HEX
Server: Apache/2.4.6 (CentOS) OpenSSL/1.0.2k-fips PHP/7.4.30
System: Linux iZj6c1151k3ad370bosnmsZ 3.10.0-1160.76.1.el7.x86_64 #1 SMP Wed Aug 10 16:21:17 UTC 2022 x86_64
User: root (0)
PHP: 7.4.30
Disabled: NONE
Upload Files
File: //usr/local/cloudmonitor/local_data/logs/argusagent.log.20260602201211
[ERROR] 2026-05-31 20:23:33.760 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 20:23:37.729 [19074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:23:41.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-05-31 20:23:41.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421202,ok=421202,error=0, records=41
[INFO ] 2026-05-31 20:23:48.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:23:48.761 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 20:23:52.734 [19053] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:23:56.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-05-31 20:23:56.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421203,ok=421203,error=0, records=41
[INFO ] 2026-05-31 20:24:03.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=23.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:24:07.739 [19074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:24:08.953 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20894196},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:24:09.125 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:24:09.125 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 20:24:09.125 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:24:09.125 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:24:09.125 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:24:09.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:24:09.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:24:09.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:24:09.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:24:09.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:24:09.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:24:11.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-05-31 20:24:11.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421204,ok=421204,error=0, records=41
[INFO ] 2026-05-31 20:24:18.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=24.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:24:22.744 [19087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:24:26.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 20:24:26.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421205,ok=421205,error=0, records=41
[INFO ] 2026-05-31 20:24:33.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:24:37.749 [19012] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:24:41.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-05-31 20:24:41.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421206,ok=421206,error=0, records=41
[INFO ] 2026-05-31 20:24:41.153 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21062/300s
[INFO ] 2026-05-31 20:24:42.750 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21066/300s
[INFO ] 2026-05-31 20:24:48.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:24:52.753 [19074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:24:56.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 20:24:56.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421207,ok=421207,error=0, records=41
[INFO ] 2026-05-31 20:25:00.424 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21075/300s
[INFO ] 2026-05-31 20:25:03.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:25:07.765 [19087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:25:11.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 20:25:11.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421208,ok=421208,error=0, records=41
[INFO ] 2026-05-31 20:25:18.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:25:22.770 [19074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:25:26.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 20:25:26.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421209,ok=421209,error=0, records=41
[INFO ] 2026-05-31 20:25:33.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:25:37.776 [19053] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:25:40.259 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21075/300s
[INFO ] 2026-05-31 20:25:40.577 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21062/300s
[INFO ] 2026-05-31 20:25:41.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 20:25:41.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421210,ok=421210,error=0, records=41
[INFO ] 2026-05-31 20:25:48.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:25:52.780 [19087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:25:56.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-05-31 20:25:56.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421211,ok=421211,error=0, records=41
[INFO ] 2026-05-31 20:26:03.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:26:07.785 [19117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:26:11.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 20:26:11.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421212,ok=421212,error=0, records=41
[INFO ] 2026-05-31 20:26:18.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:26:22.790 [19053] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:26:26.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 20:26:26.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421213,ok=421213,error=0, records=41
[INFO ] 2026-05-31 20:26:26.291 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21071/300s
[INFO ] 2026-05-31 20:26:33.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:26:37.795 [19087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:26:41.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 20:26:41.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421214,ok=421214,error=0, records=41
[INFO ] 2026-05-31 20:26:48.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:26:52.800 [19117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:26:56.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 20:26:56.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421215,ok=421215,error=0, records=41
[INFO ] 2026-05-31 20:27:03.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:27:03.770 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21074/300s
[WARN ] 2026-05-31 20:27:07.806 [19012] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:27:09.126 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17546/300s
[INFO ] 2026-05-31 20:27:09.127 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20894124},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:27:09.277 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:27:09.277 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 20:27:09.277 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:27:09.277 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:27:09.277 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:27:09.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:27:09.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:27:09.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:27:09.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:27:09.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:27:09.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:27:11.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 20:27:11.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421216,ok=421216,error=0, records=41
[INFO ] 2026-05-31 20:27:18.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:27:22.812 [19643] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:27:26.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 20:27:26.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421217,ok=421217,error=0, records=41
[INFO ] 2026-05-31 20:27:33.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:27:34.622 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21072/300s
[INFO ] 2026-05-31 20:27:36.524 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21072/300s
[WARN ] 2026-05-31 20:27:37.817 [19663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:27:41.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 20:27:41.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421218,ok=421218,error=0, records=41
[INFO ] 2026-05-31 20:27:44.431 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21072/300s
[INFO ] 2026-05-31 20:27:48.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:27:52.822 [19648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:27:56.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 20:27:56.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421219,ok=421219,error=0, records=41
[INFO ] 2026-05-31 20:28:03.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:28:07.827 [19663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:28:11.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-05-31 20:28:11.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421220,ok=421220,error=0, records=41
[INFO ] 2026-05-31 20:28:18.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:28:22.833 [19677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:28:26.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 20:28:26.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421221,ok=421221,error=0, records=41
[INFO ] 2026-05-31 20:28:33.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:28:37.838 [19663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:28:41.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 20:28:41.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421222,ok=421222,error=0, records=41
[INFO ] 2026-05-31 20:28:48.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:28:52.842 [19648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:28:56.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 20:28:56.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421223,ok=421223,error=0, records=41
[INFO ] 2026-05-31 20:29:03.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:29:07.849 [19648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:29:11.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-05-31 20:29:11.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421224,ok=421224,error=0, records=41
[INFO ] 2026-05-31 20:29:18.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:29:22.854 [19663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:29:26.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 20:29:26.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421225,ok=421225,error=0, records=41
[INFO ] 2026-05-31 20:29:33.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:29:37.860 [19754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:29:41.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 20:29:41.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421226,ok=421226,error=0, records=41
[INFO ] 2026-05-31 20:29:41.359 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21063/300s
[INFO ] 2026-05-31 20:29:42.863 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21067/300s
[INFO ] 2026-05-31 20:29:48.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:29:52.866 [19782] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:29:56.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 20:29:56.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421227,ok=421227,error=0, records=41
[INFO ] 2026-05-31 20:30:00.428 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21076/300s
[INFO ] 2026-05-31 20:30:03.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:30:07.870 [19648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:30:09.279 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20894048},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:30:09.442 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:30:09.443 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 20:30:09.443 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:30:09.443 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:30:09.443 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:30:09.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:30:09.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:30:09.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:30:09.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:30:09.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:30:09.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:30:11.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 20:30:11.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421228,ok=421228,error=0, records=41
[INFO ] 2026-05-31 20:30:18.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:30:22.875 [19822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:30:26.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 20:30:26.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421229,ok=421229,error=0, records=41
[INFO ] 2026-05-31 20:30:33.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:30:37.880 [19782] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:30:40.266 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21076/300s
[INFO ] 2026-05-31 20:30:40.764 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21063/300s
[INFO ] 2026-05-31 20:30:41.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 20:30:41.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421230,ok=421230,error=0, records=41
[INFO ] 2026-05-31 20:30:48.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:30:52.886 [19828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:30:56.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 20:30:56.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421231,ok=421231,error=0, records=41
[INFO ] 2026-05-31 20:31:03.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:31:07.891 [19864] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:31:11.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-05-31 20:31:11.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421232,ok=421232,error=0, records=41
[INFO ] 2026-05-31 20:31:18.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:31:22.896 [19881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:31:26.355 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21072/300s
[INFO ] 2026-05-31 20:31:26.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-05-31 20:31:26.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421233,ok=421233,error=0, records=41
[INFO ] 2026-05-31 20:31:33.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:31:37.901 [19898] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:31:41.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-05-31 20:31:41.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421234,ok=421234,error=0, records=41
[INFO ] 2026-05-31 20:31:48.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:31:52.906 [19897] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:31:56.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-05-31 20:31:56.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421235,ok=421235,error=0, records=41
[INFO ] 2026-05-31 20:32:03.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:32:03.782 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21075/300s
[WARN ] 2026-05-31 20:32:07.912 [19929] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:32:11.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-05-31 20:32:11.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421236,ok=421236,error=0, records=41
[INFO ] 2026-05-31 20:32:18.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:32:22.918 [19930] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:32:26.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 20:32:26.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421237,ok=421237,error=0, records=41
[INFO ] 2026-05-31 20:32:33.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:32:34.694 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21073/300s
[INFO ] 2026-05-31 20:32:36.595 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21073/300s
[WARN ] 2026-05-31 20:32:37.924 [19966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:32:41.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 20:32:41.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421238,ok=421238,error=0, records=41
[INFO ] 2026-05-31 20:32:44.504 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21073/300s
[INFO ] 2026-05-31 20:32:48.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:32:52.930 [19982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:32:56.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 20:32:56.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421239,ok=421239,error=0, records=41
[INFO ] 2026-05-31 20:33:03.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:33:07.936 [19992] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:33:09.443 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17547/300s
[INFO ] 2026-05-31 20:33:09.445 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893980},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:33:09.591 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:33:09.592 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 20:33:09.592 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:33:09.592 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:33:09.592 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:33:09.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:33:09.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:33:09.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:33:09.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:33:09.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:33:09.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:33:11.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 20:33:11.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421240,ok=421240,error=0, records=41
[INFO ] 2026-05-31 20:33:18.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:33:22.942 [20010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:33:26.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 20:33:26.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421241,ok=421241,error=0, records=41
[INFO ] 2026-05-31 20:33:33.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 20:33:33.786 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 20:33:37.947 [20028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:33:41.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-05-31 20:33:41.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421242,ok=421242,error=0, records=41
[INFO ] 2026-05-31 20:33:48.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:33:52.953 [20042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:33:56.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-05-31 20:33:56.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421243,ok=421243,error=0, records=41
[INFO ] 2026-05-31 20:34:03.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:34:07.958 [20042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:34:11.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 20:34:11.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421244,ok=421244,error=0, records=41
[INFO ] 2026-05-31 20:34:18.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:34:22.963 [20042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:34:26.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 20:34:26.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421245,ok=421245,error=0, records=41
[INFO ] 2026-05-31 20:34:33.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:34:37.967 [20011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:34:41.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 20:34:41.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421246,ok=421246,error=0, records=41
[INFO ] 2026-05-31 20:34:41.497 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21064/300s
[INFO ] 2026-05-31 20:34:42.968 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21068/300s
[INFO ] 2026-05-31 20:34:48.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:34:52.973 [20011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:34:56.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 20:34:56.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421247,ok=421247,error=0, records=41
[INFO ] 2026-05-31 20:35:00.431 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21077/300s
[INFO ] 2026-05-31 20:35:03.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:35:07.977 [20070] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:35:11.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 20:35:11.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421248,ok=421248,error=0, records=41
[INFO ] 2026-05-31 20:35:18.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:35:22.982 [20011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:35:26.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-05-31 20:35:26.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421249,ok=421249,error=0, records=41
[INFO ] 2026-05-31 20:35:33.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:35:37.987 [20011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:35:40.273 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21077/300s
[INFO ] 2026-05-31 20:35:40.946 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21064/300s
[INFO ] 2026-05-31 20:35:41.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 20:35:41.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421250,ok=421250,error=0, records=41
[INFO ] 2026-05-31 20:35:48.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:35:52.993 [20154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:35:56.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 20:35:56.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421251,ok=421251,error=0, records=41
[INFO ] 2026-05-31 20:36:03.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:36:07.998 [20154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:36:09.594 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893920},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:36:09.778 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:36:09.778 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 20:36:09.779 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:36:09.779 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:36:09.779 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:36:09.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:36:09.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:36:09.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:36:09.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:36:09.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:36:09.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:36:11.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-05-31 20:36:11.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421252,ok=421252,error=0, records=41
[INFO ] 2026-05-31 20:36:18.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:36:23.003 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:36:26.408 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21073/300s
[INFO ] 2026-05-31 20:36:26.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-05-31 20:36:26.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421253,ok=421253,error=0, records=41
[INFO ] 2026-05-31 20:36:33.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:36:38.009 [20197] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:36:41.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-05-31 20:36:41.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421254,ok=421254,error=0, records=41
[INFO ] 2026-05-31 20:36:48.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:36:53.014 [20211] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:36:56.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-05-31 20:36:56.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421255,ok=421255,error=0, records=41
[INFO ] 2026-05-31 20:37:03.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:37:03.795 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21076/300s
[WARN ] 2026-05-31 20:37:08.019 [20154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:37:11.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 20:37:11.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421256,ok=421256,error=0, records=41
[INFO ] 2026-05-31 20:37:18.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:37:23.024 [20239] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:37:26.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-05-31 20:37:26.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421257,ok=421257,error=0, records=41
[INFO ] 2026-05-31 20:37:33.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:37:34.746 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21074/300s
[INFO ] 2026-05-31 20:37:36.647 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21074/300s
[WARN ] 2026-05-31 20:37:38.029 [20098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:37:41.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 20:37:41.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421258,ok=421258,error=0, records=41
[INFO ] 2026-05-31 20:37:44.552 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21074/300s
[INFO ] 2026-05-31 20:37:48.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:37:53.034 [20098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:37:56.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 20:37:56.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421259,ok=421259,error=0, records=41
[INFO ] 2026-05-31 20:38:03.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:38:08.038 [20098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:38:11.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-05-31 20:38:11.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421260,ok=421260,error=0, records=41
[INFO ] 2026-05-31 20:38:18.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:38:23.043 [20294] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:38:26.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 20:38:26.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421261,ok=421261,error=0, records=41
[INFO ] 2026-05-31 20:38:33.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:38:38.047 [20277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:38:41.587 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 20:38:41.587 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421262,ok=421262,error=0, records=41
[INFO ] 2026-05-31 20:38:48.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:38:48.799 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 20:38:53.052 [20333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:38:56.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-05-31 20:38:56.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421263,ok=421263,error=0, records=41
[INFO ] 2026-05-31 20:39:03.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:39:07.556 [20353] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:39:09.779 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17548/300s
[INFO ] 2026-05-31 20:39:09.780 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893844},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:39:09.941 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:39:09.941 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 20:39:09.941 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:39:09.941 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:39:09.942 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:39:09.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:39:09.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:39:09.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:39:09.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:39:09.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:39:09.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:39:11.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 20:39:11.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421264,ok=421264,error=0, records=41
[INFO ] 2026-05-31 20:39:18.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:39:22.560 [20367] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:39:26.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-05-31 20:39:26.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421265,ok=421265,error=0, records=41
[INFO ] 2026-05-31 20:39:33.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:39:37.565 [20362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:39:41.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 20:39:41.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421266,ok=421266,error=0, records=41
[INFO ] 2026-05-31 20:39:41.650 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21065/300s
[INFO ] 2026-05-31 20:39:43.066 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21069/300s
[INFO ] 2026-05-31 20:39:48.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:39:52.569 [20402] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:39:56.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-05-31 20:39:56.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421267,ok=421267,error=0, records=41
[INFO ] 2026-05-31 20:40:00.434 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21078/300s
[INFO ] 2026-05-31 20:40:03.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:40:07.574 [20423] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:40:11.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 20:40:11.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421268,ok=421268,error=0, records=41
[INFO ] 2026-05-31 20:40:18.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:40:22.580 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:40:26.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 20:40:26.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421269,ok=421269,error=0, records=41
[INFO ] 2026-05-31 20:40:33.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:40:37.585 [20462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:40:40.279 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21078/300s
[INFO ] 2026-05-31 20:40:41.123 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21065/300s
[INFO ] 2026-05-31 20:40:41.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 20:40:41.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421270,ok=421270,error=0, records=41
[INFO ] 2026-05-31 20:40:48.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:40:52.590 [20481] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:40:56.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 20:40:56.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421271,ok=421271,error=0, records=41
[INFO ] 2026-05-31 20:41:03.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:41:07.595 [20482] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:41:11.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 20:41:11.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421272,ok=421272,error=0, records=41
[INFO ] 2026-05-31 20:41:18.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:41:22.599 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:41:26.457 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21074/300s
[INFO ] 2026-05-31 20:41:26.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 20:41:26.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421273,ok=421273,error=0, records=41
[INFO ] 2026-05-31 20:41:33.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:41:37.605 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:41:41.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-05-31 20:41:41.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421274,ok=421274,error=0, records=41
[INFO ] 2026-05-31 20:41:48.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:41:52.610 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:41:56.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-05-31 20:41:56.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421275,ok=421275,error=0, records=41
[INFO ] 2026-05-31 20:42:03.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:42:03.807 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21077/300s
[WARN ] 2026-05-31 20:42:07.617 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:42:09.943 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:42:10.109 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:42:10.110 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 20:42:10.110 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:42:10.110 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:42:10.110 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:42:10.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:42:10.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:42:10.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:42:10.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:42:10.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:42:10.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:42:11.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 20:42:11.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421276,ok=421276,error=0, records=41
[INFO ] 2026-05-31 20:42:18.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:42:22.622 [20508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:42:26.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 20:42:26.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421277,ok=421277,error=0, records=41
[INFO ] 2026-05-31 20:42:33.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:42:34.765 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21075/300s
[INFO ] 2026-05-31 20:42:36.666 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21075/300s
[WARN ] 2026-05-31 20:42:37.626 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:42:41.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 20:42:41.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421278,ok=421278,error=0, records=41
[INFO ] 2026-05-31 20:42:44.572 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21075/300s
[INFO ] 2026-05-31 20:42:48.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:42:52.631 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:42:56.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 20:42:56.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421279,ok=421279,error=0, records=41
[INFO ] 2026-05-31 20:43:03.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:43:07.636 [20508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:43:11.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-05-31 20:43:11.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421280,ok=421280,error=0, records=41
[INFO ] 2026-05-31 20:43:18.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:43:22.641 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:43:26.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-05-31 20:43:26.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421281,ok=421281,error=0, records=41
[INFO ] 2026-05-31 20:43:33.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 20:43:33.811 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 20:43:37.646 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:43:41.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-05-31 20:43:41.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421282,ok=421282,error=0, records=41
[INFO ] 2026-05-31 20:43:48.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:43:52.652 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:43:56.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-05-31 20:43:56.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421283,ok=421283,error=0, records=41
[INFO ] 2026-05-31 20:44:03.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:44:07.658 [20497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:44:11.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-05-31 20:44:11.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421284,ok=421284,error=0, records=41
[INFO ] 2026-05-31 20:44:18.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:44:22.663 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:44:26.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-05-31 20:44:26.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421285,ok=421285,error=0, records=41
[INFO ] 2026-05-31 20:44:33.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:44:37.668 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:44:41.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 20:44:41.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421286,ok=421286,error=0, records=41
[INFO ] 2026-05-31 20:44:41.832 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21066/300s
[INFO ] 2026-05-31 20:44:43.170 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21070/300s
[INFO ] 2026-05-31 20:44:48.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:44:52.673 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:44:56.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 20:44:56.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421287,ok=421287,error=0, records=41
[INFO ] 2026-05-31 20:45:00.438 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21079/300s
[INFO ] 2026-05-31 20:45:03.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:45:07.679 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:45:10.110 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17549/300s
[INFO ] 2026-05-31 20:45:10.112 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:45:10.272 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:45:10.273 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 20:45:11.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 20:45:11.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421288,ok=421288,error=0, records=41
[INFO ] 2026-05-31 20:45:18.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:45:22.685 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:45:26.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 20:45:26.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421289,ok=421289,error=0, records=41
[INFO ] 2026-05-31 20:45:33.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:45:37.690 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:45:40.285 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21079/300s
[INFO ] 2026-05-31 20:45:41.301 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21066/300s
[INFO ] 2026-05-31 20:45:41.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-05-31 20:45:41.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421290,ok=421290,error=0, records=41
[INFO ] 2026-05-31 20:45:48.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:45:52.695 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:45:56.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 20:45:56.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421291,ok=421291,error=0, records=41
[INFO ] 2026-05-31 20:46:03.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:46:07.701 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:46:11.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-05-31 20:46:11.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421292,ok=421292,error=0, records=41
[INFO ] 2026-05-31 20:46:18.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:46:22.707 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:46:26.508 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21075/300s
[INFO ] 2026-05-31 20:46:26.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 20:46:26.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421293,ok=421293,error=0, records=41
[INFO ] 2026-05-31 20:46:33.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:46:37.711 [20508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:46:41.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 20:46:41.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421294,ok=421294,error=0, records=41
[INFO ] 2026-05-31 20:46:48.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:46:52.716 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:46:56.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 20:46:56.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421295,ok=421295,error=0, records=41
[INFO ] 2026-05-31 20:47:03.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:47:03.819 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21078/300s
[WARN ] 2026-05-31 20:47:07.723 [20508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:47:11.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-05-31 20:47:11.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421296,ok=421296,error=0, records=41
[INFO ] 2026-05-31 20:47:18.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:47:22.728 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:47:26.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-05-31 20:47:26.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421297,ok=421297,error=0, records=41
[INFO ] 2026-05-31 20:47:33.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:47:34.814 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21076/300s
[INFO ] 2026-05-31 20:47:36.715 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21076/300s
[WARN ] 2026-05-31 20:47:37.733 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:47:41.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-05-31 20:47:41.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421298,ok=421298,error=0, records=41
[INFO ] 2026-05-31 20:47:44.619 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21076/300s
[INFO ] 2026-05-31 20:47:48.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:47:52.739 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:47:56.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-05-31 20:47:56.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421299,ok=421299,error=0, records=41
[INFO ] 2026-05-31 20:48:03.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:48:07.744 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:48:10.274 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893632},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:48:10.428 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:48:10.428 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 20:48:10.428 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:48:10.429 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:48:10.429 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:48:10.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:48:10.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:48:10.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:48:10.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:48:10.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:48:10.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:48:11.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-05-31 20:48:11.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421300,ok=421300,error=0, records=41
[INFO ] 2026-05-31 20:48:18.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:48:22.749 [20497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:48:26.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 20:48:26.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421301,ok=421301,error=0, records=41
[INFO ] 2026-05-31 20:48:33.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:48:37.753 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:48:41.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 20:48:41.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421302,ok=421302,error=0, records=41
[INFO ] 2026-05-31 20:48:48.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:48:52.759 [20508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:48:57.003 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-05-31 20:48:57.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421303,ok=421303,error=0, records=41
[INFO ] 2026-05-31 20:49:03.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:49:07.765 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:49:12.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-05-31 20:49:12.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421304,ok=421304,error=0, records=41
[INFO ] 2026-05-31 20:49:18.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:49:22.770 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:49:27.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-05-31 20:49:27.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421305,ok=421305,error=0, records=41
[INFO ] 2026-05-31 20:49:33.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:49:37.776 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:49:42.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-05-31 20:49:42.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421306,ok=421306,error=0, records=41
[INFO ] 2026-05-31 20:49:42.019 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21067/300s
[INFO ] 2026-05-31 20:49:43.278 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21071/300s
[INFO ] 2026-05-31 20:49:48.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:49:52.782 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:49:57.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-05-31 20:49:57.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421307,ok=421307,error=0, records=41
[INFO ] 2026-05-31 20:50:00.441 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21080/300s
[INFO ] 2026-05-31 20:50:03.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:50:07.787 [20457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:50:12.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-05-31 20:50:12.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421308,ok=421308,error=0, records=41
[INFO ] 2026-05-31 20:50:18.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:50:22.794 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:50:27.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 20:50:27.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421309,ok=421309,error=0, records=41
[INFO ] 2026-05-31 20:50:33.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:50:37.800 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:50:40.291 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21080/300s
[INFO ] 2026-05-31 20:50:41.481 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21067/300s
[INFO ] 2026-05-31 20:50:42.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-05-31 20:50:42.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421310,ok=421310,error=0, records=41
[INFO ] 2026-05-31 20:50:48.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:50:52.806 [21021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:50:57.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 20:50:57.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421311,ok=421311,error=0, records=41
[INFO ] 2026-05-31 20:51:03.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:51:07.810 [21036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:51:10.429 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17550/300s
[INFO ] 2026-05-31 20:51:10.430 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:51:10.610 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:51:10.611 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 20:51:10.611 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:51:10.611 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:51:10.611 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:51:10.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:51:10.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:51:10.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:51:10.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:51:10.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:51:10.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:51:12.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-05-31 20:51:12.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421312,ok=421312,error=0, records=41
[INFO ] 2026-05-31 20:51:18.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:51:22.815 [21021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:51:26.563 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21076/300s
[INFO ] 2026-05-31 20:51:27.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-05-31 20:51:27.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421313,ok=421313,error=0, records=41
[INFO ] 2026-05-31 20:51:33.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:51:37.821 [20378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:51:42.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-05-31 20:51:42.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421314,ok=421314,error=0, records=41
[INFO ] 2026-05-31 20:51:48.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:51:52.826 [20445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:51:57.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10134, records=41
[INFO ] 2026-05-31 20:51:57.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421315,ok=421315,error=0, records=41
[INFO ] 2026-05-31 20:52:03.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:52:03.831 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21079/300s
[WARN ] 2026-05-31 20:52:07.832 [21021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:52:12.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 20:52:12.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421316,ok=421316,error=0, records=41
[INFO ] 2026-05-31 20:52:18.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:52:22.839 [21036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:52:27.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 20:52:27.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421317,ok=421317,error=0, records=41
[INFO ] 2026-05-31 20:52:33.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:52:34.857 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21077/300s
[INFO ] 2026-05-31 20:52:36.758 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21077/300s
[WARN ] 2026-05-31 20:52:37.844 [21021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:52:42.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 20:52:42.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421318,ok=421318,error=0, records=41
[INFO ] 2026-05-31 20:52:44.665 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21077/300s
[INFO ] 2026-05-31 20:52:48.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:52:52.849 [21021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:52:57.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 20:52:57.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421319,ok=421319,error=0, records=41
[INFO ] 2026-05-31 20:53:03.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:53:07.855 [21021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:53:12.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 20:53:12.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421320,ok=421320,error=0, records=41
[INFO ] 2026-05-31 20:53:18.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:53:22.860 [21161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:53:27.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 20:53:27.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421321,ok=421321,error=0, records=41
[INFO ] 2026-05-31 20:53:33.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 20:53:33.835 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 20:53:37.865 [21176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:53:42.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 20:53:42.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421322,ok=421322,error=0, records=41
[INFO ] 2026-05-31 20:53:48.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:53:48.836 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 20:53:52.871 [21133] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:53:57.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 20:53:57.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421323,ok=421323,error=0, records=41
[INFO ] 2026-05-31 20:54:03.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:54:07.877 [21190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:54:10.612 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:54:10.760 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:54:10.760 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 20:54:10.760 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:54:10.760 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:54:10.760 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:54:10.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:54:10.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:54:10.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:54:10.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:54:10.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:54:10.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:54:12.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-05-31 20:54:12.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421324,ok=421324,error=0, records=41
[INFO ] 2026-05-31 20:54:18.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:54:22.883 [21222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:54:27.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 20:54:27.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421325,ok=421325,error=0, records=41
[INFO ] 2026-05-31 20:54:33.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:54:37.889 [21133] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:54:42.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-05-31 20:54:42.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421326,ok=421326,error=0, records=41
[INFO ] 2026-05-31 20:54:42.159 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21068/300s
[INFO ] 2026-05-31 20:54:43.391 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21072/300s
[INFO ] 2026-05-31 20:54:48.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:54:52.895 [21239] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:54:57.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 20:54:57.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421327,ok=421327,error=0, records=41
[INFO ] 2026-05-31 20:55:00.444 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21081/300s
[INFO ] 2026-05-31 20:55:03.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:55:07.902 [21269] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:55:12.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-05-31 20:55:12.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421328,ok=421328,error=0, records=41
[INFO ] 2026-05-31 20:55:18.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:55:22.907 [21269] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:55:27.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10399, records=41
[INFO ] 2026-05-31 20:55:27.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421329,ok=421329,error=0, records=41
[INFO ] 2026-05-31 20:55:33.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:55:37.913 [21279] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:55:40.297 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21081/300s
[INFO ] 2026-05-31 20:55:41.661 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21068/300s
[INFO ] 2026-05-31 20:55:42.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-05-31 20:55:42.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421330,ok=421330,error=0, records=41
[INFO ] 2026-05-31 20:55:48.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:55:52.919 [21190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:55:57.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-05-31 20:55:57.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421331,ok=421331,error=0, records=41
[INFO ] 2026-05-31 20:56:03.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:56:07.924 [21339] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:56:12.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 20:56:12.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421332,ok=421332,error=0, records=41
[INFO ] 2026-05-31 20:56:18.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:56:22.930 [21269] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:56:26.612 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21077/300s
[INFO ] 2026-05-31 20:56:27.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 20:56:27.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421333,ok=421333,error=0, records=41
[INFO ] 2026-05-31 20:56:33.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:56:37.936 [21370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:56:42.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 20:56:42.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421334,ok=421334,error=0, records=41
[INFO ] 2026-05-31 20:56:48.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:56:52.943 [21354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:56:57.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 20:56:57.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421335,ok=421335,error=0, records=41
[INFO ] 2026-05-31 20:57:03.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:57:03.845 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21080/300s
[WARN ] 2026-05-31 20:57:07.948 [21381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:57:10.761 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17551/300s
[INFO ] 2026-05-31 20:57:10.762 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893424},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 20:57:10.916 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 20:57:10.916 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 20:57:10.916 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 20:57:10.916 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 20:57:10.916 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 20:57:10.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:57:10.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 20:57:10.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:57:10.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 20:57:10.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:57:10.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 20:57:12.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-05-31 20:57:12.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421336,ok=421336,error=0, records=41
[INFO ] 2026-05-31 20:57:18.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:57:22.953 [21354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:57:27.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-05-31 20:57:27.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421337,ok=421337,error=0, records=41
[INFO ] 2026-05-31 20:57:33.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 20:57:34.887 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21078/300s
[INFO ] 2026-05-31 20:57:36.790 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21078/300s
[WARN ] 2026-05-31 20:57:37.958 [21403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:57:42.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-05-31 20:57:42.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421338,ok=421338,error=0, records=41
[INFO ] 2026-05-31 20:57:44.693 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21078/300s
[INFO ] 2026-05-31 20:57:48.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:57:52.963 [21414] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:57:57.382 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-05-31 20:57:57.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421339,ok=421339,error=0, records=41
[INFO ] 2026-05-31 20:58:03.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:58:07.968 [21398] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:58:12.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 20:58:12.388 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421340,ok=421340,error=0, records=41
[INFO ] 2026-05-31 20:58:18.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:58:22.972 [21354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:58:27.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 20:58:27.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421341,ok=421341,error=0, records=41
[INFO ] 2026-05-31 20:58:33.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:58:37.978 [21354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:58:42.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 20:58:42.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421342,ok=421342,error=0, records=41
[INFO ] 2026-05-31 20:58:48.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:58:52.982 [21414] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:58:57.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 20:58:57.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421343,ok=421343,error=0, records=41
[INFO ] 2026-05-31 20:59:03.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:59:07.988 [21354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:59:12.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-05-31 20:59:12.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421344,ok=421344,error=0, records=41
[INFO ] 2026-05-31 20:59:18.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:59:22.992 [21469] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:59:27.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-05-31 20:59:27.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421345,ok=421345,error=0, records=41
[INFO ] 2026-05-31 20:59:33.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:59:37.998 [21524] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:59:42.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-05-31 20:59:42.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421346,ok=421346,error=0, records=41
[INFO ] 2026-05-31 20:59:42.427 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21069/300s
[INFO ] 2026-05-31 20:59:43.499 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21073/300s
[INFO ] 2026-05-31 20:59:48.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 20:59:53.003 [21483] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 20:59:57.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-05-31 20:59:57.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421347,ok=421347,error=0, records=41
[INFO ] 2026-05-31 21:00:00.447 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21082/300s
[INFO ] 2026-05-31 21:00:03.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:00:08.008 [21524] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:00:10.918 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893348},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:00:11.065 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:00:11.065 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 21:00:11.065 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:00:11.065 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:00:11.065 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:00:11.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:00:11.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:00:11.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:00:11.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:00:11.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:00:11.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:00:12.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-05-31 21:00:12.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421348,ok=421348,error=0, records=41
[INFO ] 2026-05-31 21:00:18.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:00:23.012 [21572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:00:27.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-05-31 21:00:27.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421349,ok=421349,error=0, records=41
[INFO ] 2026-05-31 21:00:33.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:00:38.017 [21540] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:00:40.303 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21082/300s
[INFO ] 2026-05-31 21:00:41.841 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21069/300s
[INFO ] 2026-05-31 21:00:42.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-05-31 21:00:42.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421350,ok=421350,error=0, records=41
[INFO ] 2026-05-31 21:00:48.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:00:53.023 [21414] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:00:57.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-05-31 21:00:57.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421351,ok=421351,error=0, records=41
[INFO ] 2026-05-31 21:01:03.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:01:08.028 [21572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:01:12.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 21:01:12.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421352,ok=421352,error=0, records=41
[INFO ] 2026-05-31 21:01:18.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:01:23.033 [21639] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:01:26.667 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21078/300s
[INFO ] 2026-05-31 21:01:27.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-05-31 21:01:27.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421353,ok=421353,error=0, records=41
[INFO ] 2026-05-31 21:01:33.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:01:38.037 [21669] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:01:42.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 21:01:42.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421354,ok=421354,error=0, records=41
[INFO ] 2026-05-31 21:01:48.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:01:53.043 [21686] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:01:57.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-05-31 21:01:57.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421355,ok=421355,error=0, records=41
[INFO ] 2026-05-31 21:02:03.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:02:03.857 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21081/300s
[WARN ] 2026-05-31 21:02:08.048 [21702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:02:12.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-05-31 21:02:12.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421356,ok=421356,error=0, records=41
[INFO ] 2026-05-31 21:02:18.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:02:23.053 [21718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:02:27.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 21:02:27.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421357,ok=421357,error=0, records=41
[INFO ] 2026-05-31 21:02:33.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:02:34.936 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21079/300s
[INFO ] 2026-05-31 21:02:36.838 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21079/300s
[WARN ] 2026-05-31 21:02:37.558 [21718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:02:42.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 21:02:42.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421358,ok=421358,error=0, records=41
[INFO ] 2026-05-31 21:02:44.746 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21079/300s
[INFO ] 2026-05-31 21:02:48.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:02:52.562 [21718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:02:57.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 21:02:57.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421359,ok=421359,error=0, records=41
[INFO ] 2026-05-31 21:03:03.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:03:07.568 [21743] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:03:11.065 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17552/300s
[INFO ] 2026-05-31 21:03:11.067 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:03:11.236 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:03:11.236 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:03:11.236 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:03:11.236 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:03:11.236 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:03:11.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:03:11.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:03:11.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:03:11.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:03:11.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:03:11.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:03:12.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 21:03:12.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421360,ok=421360,error=0, records=41
[INFO ] 2026-05-31 21:03:18.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:03:22.574 [21760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:03:27.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-05-31 21:03:27.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421361,ok=421361,error=0, records=41
[INFO ] 2026-05-31 21:03:33.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 21:03:33.861 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 21:03:37.578 [21807] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:03:42.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 21:03:42.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421362,ok=421362,error=0, records=41
[INFO ] 2026-05-31 21:03:48.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:03:52.582 [21819] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:03:57.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-05-31 21:03:57.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421363,ok=421363,error=0, records=41
[INFO ] 2026-05-31 21:04:03.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:04:07.587 [21847] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:04:12.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 21:04:12.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421364,ok=421364,error=0, records=41
[INFO ] 2026-05-31 21:04:18.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:04:22.592 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:04:27.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 21:04:27.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421365,ok=421365,error=0, records=41
[INFO ] 2026-05-31 21:04:33.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:04:37.596 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:04:42.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-05-31 21:04:42.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421366,ok=421366,error=0, records=41
[INFO ] 2026-05-31 21:04:42.690 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21070/300s
[INFO ] 2026-05-31 21:04:43.597 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21074/300s
[INFO ] 2026-05-31 21:04:48.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:04:52.600 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:04:57.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 21:04:57.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421367,ok=421367,error=0, records=41
[INFO ] 2026-05-31 21:05:00.451 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21083/300s
[INFO ] 2026-05-31 21:05:03.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:05:07.606 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:05:12.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 21:05:12.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421368,ok=421368,error=0, records=41
[INFO ] 2026-05-31 21:05:18.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:05:22.611 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:05:27.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 21:05:27.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421369,ok=421369,error=0, records=41
[INFO ] 2026-05-31 21:05:33.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:05:37.616 [21875] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:05:40.310 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21083/300s
[INFO ] 2026-05-31 21:05:42.020 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21070/300s
[INFO ] 2026-05-31 21:05:42.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 21:05:42.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421370,ok=421370,error=0, records=41
[INFO ] 2026-05-31 21:05:48.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:05:52.620 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:05:57.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-05-31 21:05:57.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421371,ok=421371,error=0, records=41
[INFO ] 2026-05-31 21:06:03.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:06:07.625 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:06:11.238 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:06:11.405 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:06:11.405 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 21:06:11.405 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:06:11.405 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:06:11.405 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:06:11.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:06:11.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:06:11.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:06:11.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:06:11.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:06:11.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:06:12.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-05-31 21:06:12.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421372,ok=421372,error=0, records=41
[INFO ] 2026-05-31 21:06:18.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:06:22.630 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:06:26.722 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21079/300s
[INFO ] 2026-05-31 21:06:27.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-05-31 21:06:27.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421373,ok=421373,error=0, records=41
[INFO ] 2026-05-31 21:06:33.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:06:37.637 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:06:42.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-05-31 21:06:42.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421374,ok=421374,error=0, records=41
[INFO ] 2026-05-31 21:06:48.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:06:52.642 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:06:57.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-05-31 21:06:57.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421375,ok=421375,error=0, records=41
[INFO ] 2026-05-31 21:07:03.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:07:03.870 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21082/300s
[WARN ] 2026-05-31 21:07:07.646 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:07:12.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-05-31 21:07:12.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421376,ok=421376,error=0, records=41
[INFO ] 2026-05-31 21:07:18.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:07:22.651 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:07:27.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 21:07:27.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421377,ok=421377,error=0, records=41
[INFO ] 2026-05-31 21:07:33.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:07:34.999 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21080/300s
[INFO ] 2026-05-31 21:07:36.900 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21080/300s
[WARN ] 2026-05-31 21:07:37.657 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:07:42.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-05-31 21:07:42.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421378,ok=421378,error=0, records=41
[INFO ] 2026-05-31 21:07:44.804 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21080/300s
[INFO ] 2026-05-31 21:07:48.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:07:52.662 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:07:57.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-05-31 21:07:57.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421379,ok=421379,error=0, records=41
[INFO ] 2026-05-31 21:08:03.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:08:07.669 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:08:12.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 21:08:12.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421380,ok=421380,error=0, records=41
[INFO ] 2026-05-31 21:08:18.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:08:22.674 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:08:27.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 21:08:27.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421381,ok=421381,error=0, records=41
[INFO ] 2026-05-31 21:08:33.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:08:37.680 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:08:42.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:08:42.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421382,ok=421382,error=0, records=41
[INFO ] 2026-05-31 21:08:48.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:08:48.874 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 21:08:52.685 [21875] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:08:57.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 21:08:57.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421383,ok=421383,error=0, records=41
[INFO ] 2026-05-31 21:09:03.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:09:07.691 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:09:11.405 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17553/300s
[INFO ] 2026-05-31 21:09:11.407 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:09:11.578 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:09:11.578 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:09:11.578 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:09:11.578 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:09:11.578 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:09:11.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:09:11.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:09:11.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:09:11.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:09:11.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:09:11.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:09:12.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 21:09:12.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421384,ok=421384,error=0, records=41
[INFO ] 2026-05-31 21:09:18.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:09:22.695 [21875] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:09:27.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-05-31 21:09:27.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421385,ok=421385,error=0, records=41
[INFO ] 2026-05-31 21:09:33.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:09:37.700 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:09:43.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 21:09:43.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421386,ok=421386,error=0, records=41
[INFO ] 2026-05-31 21:09:43.000 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21071/300s
[INFO ] 2026-05-31 21:09:43.702 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21075/300s
[INFO ] 2026-05-31 21:09:48.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:09:52.705 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:09:58.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 21:09:58.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421387,ok=421387,error=0, records=41
[INFO ] 2026-05-31 21:10:00.454 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21084/300s
[INFO ] 2026-05-31 21:10:03.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:10:07.709 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:10:13.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-05-31 21:10:13.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421388,ok=421388,error=0, records=41
[INFO ] 2026-05-31 21:10:18.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:10:22.715 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:10:28.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 21:10:28.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421389,ok=421389,error=0, records=41
[INFO ] 2026-05-31 21:10:33.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:10:37.721 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:10:40.317 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21084/300s
[INFO ] 2026-05-31 21:10:42.202 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21071/300s
[INFO ] 2026-05-31 21:10:43.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 21:10:43.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421390,ok=421390,error=0, records=41
[INFO ] 2026-05-31 21:10:48.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:10:52.727 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:10:58.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 21:10:58.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421391,ok=421391,error=0, records=41
[INFO ] 2026-05-31 21:11:03.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:11:07.731 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:11:13.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-05-31 21:11:13.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421392,ok=421392,error=0, records=41
[INFO ] 2026-05-31 21:11:18.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:11:22.736 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:11:26.775 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21080/300s
[INFO ] 2026-05-31 21:11:28.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-05-31 21:11:28.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421393,ok=421393,error=0, records=41
[INFO ] 2026-05-31 21:11:33.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:11:37.741 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:11:43.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-05-31 21:11:43.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421394,ok=421394,error=0, records=41
[INFO ] 2026-05-31 21:11:48.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:11:52.746 [21875] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:11:58.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-05-31 21:11:58.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421395,ok=421395,error=0, records=41
[INFO ] 2026-05-31 21:12:03.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:12:03.883 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21083/300s
[WARN ] 2026-05-31 21:12:07.752 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:12:11.580 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20893020},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:12:11.754 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:12:11.754 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:12:11.754 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:12:11.754 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:12:11.754 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:12:11.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:12:11.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:12:11.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:12:11.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:12:11.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:12:11.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:12:13.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 21:12:13.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421396,ok=421396,error=0, records=41
[INFO ] 2026-05-31 21:12:18.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:12:22.757 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:12:28.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 21:12:28.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421397,ok=421397,error=0, records=41
[INFO ] 2026-05-31 21:12:33.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:12:35.052 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21081/300s
[INFO ] 2026-05-31 21:12:36.954 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21081/300s
[WARN ] 2026-05-31 21:12:37.761 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:12:43.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 21:12:43.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421398,ok=421398,error=0, records=41
[INFO ] 2026-05-31 21:12:44.861 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21081/300s
[INFO ] 2026-05-31 21:12:48.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:12:52.765 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:12:58.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 21:12:58.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421399,ok=421399,error=0, records=41
[INFO ] 2026-05-31 21:13:03.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:13:07.770 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:13:13.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 21:13:13.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421400,ok=421400,error=0, records=41
[INFO ] 2026-05-31 21:13:18.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:13:22.774 [21860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:13:28.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-05-31 21:13:28.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421401,ok=421401,error=0, records=41
[INFO ] 2026-05-31 21:13:33.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 21:13:33.887 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 21:13:37.780 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:13:43.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 21:13:43.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421402,ok=421402,error=0, records=41
[INFO ] 2026-05-31 21:13:48.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:13:52.784 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:13:58.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-05-31 21:13:58.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421403,ok=421403,error=0, records=41
[INFO ] 2026-05-31 21:14:03.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:14:07.790 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:14:13.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-05-31 21:14:13.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421404,ok=421404,error=0, records=41
[INFO ] 2026-05-31 21:14:18.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:14:22.796 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:14:28.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 21:14:28.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421405,ok=421405,error=0, records=41
[INFO ] 2026-05-31 21:14:33.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:14:37.800 [21848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:14:43.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 21:14:43.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421406,ok=421406,error=0, records=41
[INFO ] 2026-05-31 21:14:43.177 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21072/300s
[INFO ] 2026-05-31 21:14:43.802 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21076/300s
[INFO ] 2026-05-31 21:14:48.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:14:52.805 [21880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:14:58.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 21:14:58.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421407,ok=421407,error=0, records=41
[INFO ] 2026-05-31 21:15:00.458 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21085/300s
[INFO ] 2026-05-31 21:15:03.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:15:07.812 [22440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:15:11.754 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17554/300s
[INFO ] 2026-05-31 21:15:11.756 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892920},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:15:11.909 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:15:11.910 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 21:15:11.910 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:15:11.910 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:15:11.910 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:15:11.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:15:11.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:15:11.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:15:11.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:15:11.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:15:11.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:15:13.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 21:15:13.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421408,ok=421408,error=0, records=41
[INFO ] 2026-05-31 21:15:18.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:15:22.817 [22461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:15:28.206 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 21:15:28.206 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421409,ok=421409,error=0, records=41
[INFO ] 2026-05-31 21:15:33.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:15:37.821 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:15:40.323 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21085/300s
[INFO ] 2026-05-31 21:15:42.381 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21072/300s
[INFO ] 2026-05-31 21:15:43.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 21:15:43.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421410,ok=421410,error=0, records=41
[INFO ] 2026-05-31 21:15:48.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:15:52.826 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:15:58.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 21:15:58.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421411,ok=421411,error=0, records=41
[INFO ] 2026-05-31 21:16:03.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:16:07.830 [22456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:16:13.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 21:16:13.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421412,ok=421412,error=0, records=41
[INFO ] 2026-05-31 21:16:18.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:16:22.835 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:16:26.832 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21081/300s
[INFO ] 2026-05-31 21:16:28.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 21:16:28.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421413,ok=421413,error=0, records=41
[INFO ] 2026-05-31 21:16:33.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:16:37.840 [22461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:16:43.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 21:16:43.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421414,ok=421414,error=0, records=41
[INFO ] 2026-05-31 21:16:48.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:16:52.846 [22456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:16:58.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 21:16:58.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421415,ok=421415,error=0, records=41
[INFO ] 2026-05-31 21:17:03.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:17:03.896 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21084/300s
[WARN ] 2026-05-31 21:17:07.850 [22476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:17:13.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-05-31 21:17:13.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421416,ok=421416,error=0, records=41
[INFO ] 2026-05-31 21:17:18.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:17:22.855 [22518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:17:28.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 21:17:28.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421417,ok=421417,error=0, records=41
[INFO ] 2026-05-31 21:17:33.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:17:35.120 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21082/300s
[INFO ] 2026-05-31 21:17:37.021 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21082/300s
[WARN ] 2026-05-31 21:17:37.860 [22518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:17:43.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 21:17:43.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421418,ok=421418,error=0, records=41
[INFO ] 2026-05-31 21:17:44.927 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21082/300s
[INFO ] 2026-05-31 21:17:48.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:17:52.866 [22518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:17:58.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 21:17:58.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421419,ok=421419,error=0, records=41
[INFO ] 2026-05-31 21:18:03.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:18:07.871 [22518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:18:11.911 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:18:12.080 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:18:12.080 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-05-31 21:18:12.080 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:18:12.080 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:18:12.080 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:18:12.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:18:12.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:18:12.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:18:12.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:18:12.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:18:12.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:18:13.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 21:18:13.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421420,ok=421420,error=0, records=41
[INFO ] 2026-05-31 21:18:18.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:18:22.875 [22611] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:18:28.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 21:18:28.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421421,ok=421421,error=0, records=41
[INFO ] 2026-05-31 21:18:33.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:18:37.880 [22611] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:18:43.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 21:18:43.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421422,ok=421422,error=0, records=41
[INFO ] 2026-05-31 21:18:48.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:18:52.886 [22664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:18:58.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 21:18:58.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421423,ok=421423,error=0, records=41
[INFO ] 2026-05-31 21:19:03.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:19:07.892 [22680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:19:13.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-05-31 21:19:13.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421424,ok=421424,error=0, records=41
[INFO ] 2026-05-31 21:19:18.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:19:22.897 [22680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:19:28.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 21:19:28.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421425,ok=421425,error=0, records=41
[INFO ] 2026-05-31 21:19:33.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:19:37.902 [22680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:19:43.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 21:19:43.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421426,ok=421426,error=0, records=41
[INFO ] 2026-05-31 21:19:43.355 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21073/300s
[INFO ] 2026-05-31 21:19:43.904 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21077/300s
[INFO ] 2026-05-31 21:19:48.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:19:52.908 [22708] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:19:58.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-05-31 21:19:58.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421427,ok=421427,error=0, records=41
[INFO ] 2026-05-31 21:20:00.461 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21086/300s
[INFO ] 2026-05-31 21:20:03.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:20:07.914 [22697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:20:13.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 21:20:13.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421428,ok=421428,error=0, records=41
[INFO ] 2026-05-31 21:20:18.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:20:22.919 [22765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:20:28.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-05-31 21:20:28.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421429,ok=421429,error=0, records=41
[INFO ] 2026-05-31 21:20:33.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:20:37.925 [22770] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:20:40.330 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21086/300s
[INFO ] 2026-05-31 21:20:42.563 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21073/300s
[INFO ] 2026-05-31 21:20:43.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-05-31 21:20:43.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421430,ok=421430,error=0, records=41
[INFO ] 2026-05-31 21:20:48.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:20:52.931 [22784] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:20:58.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-05-31 21:20:58.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421431,ok=421431,error=0, records=41
[INFO ] 2026-05-31 21:21:03.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:21:07.936 [22808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:21:12.080 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17555/300s
[INFO ] 2026-05-31 21:21:12.082 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892752},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:21:12.263 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:21:12.263 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 21:21:12.263 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:21:12.263 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:21:12.263 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:21:12.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:21:12.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:21:12.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:21:12.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:21:12.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:21:12.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:21:13.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 21:21:13.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421432,ok=421432,error=0, records=41
[INFO ] 2026-05-31 21:21:18.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:21:22.942 [22826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:21:26.884 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21082/300s
[INFO ] 2026-05-31 21:21:28.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-05-31 21:21:28.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421433,ok=421433,error=0, records=41
[INFO ] 2026-05-31 21:21:33.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:21:37.947 [22843] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:21:43.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-05-31 21:21:43.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421434,ok=421434,error=0, records=41
[INFO ] 2026-05-31 21:21:48.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:21:52.952 [22770] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:21:58.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-05-31 21:21:58.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421435,ok=421435,error=0, records=41
[INFO ] 2026-05-31 21:22:03.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:22:03.907 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21085/300s
[WARN ] 2026-05-31 21:22:07.957 [22814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:22:13.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 21:22:13.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421436,ok=421436,error=0, records=41
[INFO ] 2026-05-31 21:22:18.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:22:22.961 [22859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:22:28.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 21:22:28.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421437,ok=421437,error=0, records=41
[INFO ] 2026-05-31 21:22:33.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:22:35.168 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21083/300s
[INFO ] 2026-05-31 21:22:37.069 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21083/300s
[WARN ] 2026-05-31 21:22:37.965 [22901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:22:43.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 21:22:43.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421438,ok=421438,error=0, records=41
[INFO ] 2026-05-31 21:22:44.975 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21083/300s
[INFO ] 2026-05-31 21:22:48.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:22:52.969 [22915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:22:58.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 21:22:58.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421439,ok=421439,error=0, records=41
[INFO ] 2026-05-31 21:23:03.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:23:07.975 [22887] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:23:13.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 21:23:13.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421440,ok=421440,error=0, records=41
[INFO ] 2026-05-31 21:23:18.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:23:22.981 [22943] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:23:28.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 21:23:28.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421441,ok=421441,error=0, records=41
[INFO ] 2026-05-31 21:23:33.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 21:23:33.911 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 21:23:37.986 [22814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:23:43.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 21:23:43.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421442,ok=421442,error=0, records=41
[INFO ] 2026-05-31 21:23:48.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:23:48.911 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 21:23:52.991 [22915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:23:58.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 21:23:58.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421443,ok=421443,error=0, records=41
[INFO ] 2026-05-31 21:24:03.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:24:07.995 [22814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:24:12.265 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892676},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:24:12.418 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:24:12.418 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:24:12.418 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:24:12.418 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:24:12.418 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:24:12.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:24:12.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:24:12.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:24:12.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:24:12.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:24:12.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:24:13.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 21:24:13.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421444,ok=421444,error=0, records=41
[INFO ] 2026-05-31 21:24:18.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:24:23.000 [22814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:24:28.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 21:24:28.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421445,ok=421445,error=0, records=41
[INFO ] 2026-05-31 21:24:33.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:24:38.005 [22915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:24:43.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 21:24:43.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421446,ok=421446,error=0, records=41
[INFO ] 2026-05-31 21:24:43.514 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21074/300s
[INFO ] 2026-05-31 21:24:44.007 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21078/300s
[INFO ] 2026-05-31 21:24:48.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:24:53.010 [22814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:24:58.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 21:24:58.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421447,ok=421447,error=0, records=41
[INFO ] 2026-05-31 21:25:00.464 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21087/300s
[INFO ] 2026-05-31 21:25:03.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:25:08.015 [23041] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:25:13.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-05-31 21:25:13.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421448,ok=421448,error=0, records=41
[INFO ] 2026-05-31 21:25:18.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:25:23.020 [23041] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:25:28.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-05-31 21:25:28.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421449,ok=421449,error=0, records=41
[INFO ] 2026-05-31 21:25:33.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:25:38.024 [22985] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:25:40.336 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21087/300s
[INFO ] 2026-05-31 21:25:42.741 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21074/300s
[INFO ] 2026-05-31 21:25:43.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-05-31 21:25:43.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421450,ok=421450,error=0, records=41
[INFO ] 2026-05-31 21:25:48.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:25:53.029 [23083] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:25:58.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-05-31 21:25:58.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421451,ok=421451,error=0, records=41
[INFO ] 2026-05-31 21:26:03.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:26:08.034 [23097] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:26:13.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:26:13.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421452,ok=421452,error=0, records=41
[INFO ] 2026-05-31 21:26:18.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:26:23.039 [22943] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:26:26.933 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21083/300s
[INFO ] 2026-05-31 21:26:28.562 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 21:26:28.562 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421453,ok=421453,error=0, records=41
[INFO ] 2026-05-31 21:26:33.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:26:38.043 [23112] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:26:43.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 21:26:43.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421454,ok=421454,error=0, records=41
[INFO ] 2026-05-31 21:26:48.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:26:53.049 [23146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:26:58.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 21:26:58.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421455,ok=421455,error=0, records=41
[INFO ] 2026-05-31 21:27:03.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:27:03.920 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21086/300s
[WARN ] 2026-05-31 21:27:07.554 [23161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:27:12.418 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17556/300s
[INFO ] 2026-05-31 21:27:12.420 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:27:12.591 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:27:12.591 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:27:12.591 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:27:12.591 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:27:12.591 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:27:12.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:27:12.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:27:12.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:27:12.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:27:12.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:27:12.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:27:13.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 21:27:13.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421456,ok=421456,error=0, records=41
[INFO ] 2026-05-31 21:27:18.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:27:22.560 [23162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:27:28.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 21:27:28.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421457,ok=421457,error=0, records=41
[INFO ] 2026-05-31 21:27:33.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:27:35.192 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21084/300s
[INFO ] 2026-05-31 21:27:37.094 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21084/300s
[WARN ] 2026-05-31 21:27:37.564 [23184] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:27:43.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 21:27:43.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421458,ok=421458,error=0, records=41
[INFO ] 2026-05-31 21:27:44.999 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21084/300s
[INFO ] 2026-05-31 21:27:48.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:27:52.567 [23184] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:27:58.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 21:27:58.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421459,ok=421459,error=0, records=41
[INFO ] 2026-05-31 21:28:03.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:28:07.572 [23232] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:28:13.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-05-31 21:28:13.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421460,ok=421460,error=0, records=41
[INFO ] 2026-05-31 21:28:18.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:28:22.576 [23255] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:28:28.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 21:28:28.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421461,ok=421461,error=0, records=41
[INFO ] 2026-05-31 21:28:33.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:28:37.581 [23248] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:28:43.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 21:28:43.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421462,ok=421462,error=0, records=41
[INFO ] 2026-05-31 21:28:48.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:28:52.585 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:28:58.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 21:28:58.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421463,ok=421463,error=0, records=41
[INFO ] 2026-05-31 21:29:03.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:29:07.590 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:29:13.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-05-31 21:29:13.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421464,ok=421464,error=0, records=41
[INFO ] 2026-05-31 21:29:18.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:29:22.595 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:29:28.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 21:29:28.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421465,ok=421465,error=0, records=41
[INFO ] 2026-05-31 21:29:33.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:29:37.600 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:29:43.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:29:43.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421466,ok=421466,error=0, records=41
[INFO ] 2026-05-31 21:29:43.641 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21075/300s
[INFO ] 2026-05-31 21:29:44.101 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21079/300s
[INFO ] 2026-05-31 21:29:48.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:29:52.605 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:29:58.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 21:29:58.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421467,ok=421467,error=0, records=41
[INFO ] 2026-05-31 21:30:00.467 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21088/300s
[INFO ] 2026-05-31 21:30:03.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:30:07.611 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:30:12.593 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892516},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:30:12.761 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:30:12.761 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 21:30:12.761 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:30:12.761 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:30:12.761 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:30:12.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:30:12.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:30:12.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:30:12.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:30:12.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:30:12.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:30:13.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 21:30:13.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421468,ok=421468,error=0, records=41
[INFO ] 2026-05-31 21:30:18.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:30:22.616 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:30:28.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 21:30:28.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421469,ok=421469,error=0, records=41
[INFO ] 2026-05-31 21:30:33.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:30:37.621 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:30:40.342 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21088/300s
[INFO ] 2026-05-31 21:30:42.919 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21075/300s
[INFO ] 2026-05-31 21:30:43.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-05-31 21:30:43.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421470,ok=421470,error=0, records=41
[INFO ] 2026-05-31 21:30:48.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:30:52.627 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:30:58.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-05-31 21:30:58.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421471,ok=421471,error=0, records=41
[INFO ] 2026-05-31 21:31:03.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:31:07.633 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:31:13.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 21:31:13.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421472,ok=421472,error=0, records=41
[INFO ] 2026-05-31 21:31:18.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:31:22.638 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:31:26.988 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21084/300s
[INFO ] 2026-05-31 21:31:28.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 21:31:28.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421473,ok=421473,error=0, records=41
[INFO ] 2026-05-31 21:31:33.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:31:37.643 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:31:43.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 21:31:43.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421474,ok=421474,error=0, records=41
[INFO ] 2026-05-31 21:31:48.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:31:52.649 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:31:58.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-05-31 21:31:58.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421475,ok=421475,error=0, records=41
[INFO ] 2026-05-31 21:32:03.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:32:03.932 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21087/300s
[WARN ] 2026-05-31 21:32:07.655 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:32:13.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-05-31 21:32:13.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421476,ok=421476,error=0, records=41
[INFO ] 2026-05-31 21:32:18.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:32:22.660 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:32:28.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 21:32:28.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421477,ok=421477,error=0, records=41
[INFO ] 2026-05-31 21:32:33.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:32:35.240 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21085/300s
[INFO ] 2026-05-31 21:32:37.142 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21085/300s
[WARN ] 2026-05-31 21:32:37.664 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:32:43.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-05-31 21:32:43.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421478,ok=421478,error=0, records=41
[INFO ] 2026-05-31 21:32:45.049 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21085/300s
[INFO ] 2026-05-31 21:32:48.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:32:52.669 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:32:58.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 21:32:58.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421479,ok=421479,error=0, records=41
[INFO ] 2026-05-31 21:33:03.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:33:07.674 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:33:12.761 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17557/300s
[INFO ] 2026-05-31 21:33:12.763 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892436},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:33:12.916 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:33:12.916 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 21:33:13.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:33:13.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421480,ok=421480,error=0, records=41
[INFO ] 2026-05-31 21:33:18.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:33:22.679 [23323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:33:28.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 21:33:28.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421481,ok=421481,error=0, records=41
[INFO ] 2026-05-31 21:33:33.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 21:33:33.935 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 21:33:37.685 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:33:43.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-05-31 21:33:43.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421482,ok=421482,error=0, records=41
[INFO ] 2026-05-31 21:33:48.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:33:52.690 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:33:58.740 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-05-31 21:33:58.740 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421483,ok=421483,error=0, records=41
[INFO ] 2026-05-31 21:34:03.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:34:07.696 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:34:13.745 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-05-31 21:34:13.745 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421484,ok=421484,error=0, records=41
[INFO ] 2026-05-31 21:34:18.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:34:22.702 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:34:28.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 21:34:28.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421485,ok=421485,error=0, records=41
[INFO ] 2026-05-31 21:34:33.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:34:37.709 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:34:43.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 21:34:43.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421486,ok=421486,error=0, records=41
[INFO ] 2026-05-31 21:34:43.756 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21076/300s
[INFO ] 2026-05-31 21:34:44.211 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21080/300s
[INFO ] 2026-05-31 21:34:48.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:34:52.714 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:34:58.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-05-31 21:34:58.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421487,ok=421487,error=0, records=41
[INFO ] 2026-05-31 21:35:00.470 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21089/300s
[INFO ] 2026-05-31 21:35:03.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:35:07.721 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:35:13.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-05-31 21:35:13.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421488,ok=421488,error=0, records=41
[WARN ] 2026-05-31 21:35:17.725 [23266] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17991/stat), No such file or directory
[INFO ] 2026-05-31 21:35:18.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:35:22.726 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:35:28.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-05-31 21:35:28.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421489,ok=421489,error=0, records=41
[WARN ] 2026-05-31 21:35:32.730 [23333] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17991/stat), No such file or directory
[INFO ] 2026-05-31 21:35:33.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:35:37.732 [23323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:35:40.348 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21089/300s
[INFO ] 2026-05-31 21:35:43.099 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21076/300s
[INFO ] 2026-05-31 21:35:43.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 21:35:43.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421490,ok=421490,error=0, records=41
[WARN ] 2026-05-31 21:35:47.736 [23323] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17991/stat), No such file or directory
[WARN ] 2026-05-31 21:35:47.736 [23323] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13145/stat), No such file or directory
[WARN ] 2026-05-31 21:35:47.737 [23323] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13111/stat), No such file or directory
[INFO ] 2026-05-31 21:35:48.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:35:52.737 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:35:58.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 21:35:58.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421491,ok=421491,error=0, records=41
[INFO ] 2026-05-31 21:36:03.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:36:07.743 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:36:12.918 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892340},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:36:13.084 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:36:13.084 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 21:36:13.084 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:36:13.084 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:36:13.084 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:36:13.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:36:13.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:36:13.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:36:13.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:36:13.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:36:13.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:36:13.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-05-31 21:36:13.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421492,ok=421492,error=0, records=41
[INFO ] 2026-05-31 21:36:18.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:36:22.748 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:36:27.038 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21085/300s
[INFO ] 2026-05-31 21:36:28.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 21:36:28.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421493,ok=421493,error=0, records=41
[INFO ] 2026-05-31 21:36:33.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:36:37.753 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:36:43.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 21:36:43.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421494,ok=421494,error=0, records=41
[INFO ] 2026-05-31 21:36:48.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:36:52.757 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:36:58.805 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 21:36:58.805 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421495,ok=421495,error=0, records=41
[INFO ] 2026-05-31 21:37:03.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:37:03.943 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21088/300s
[WARN ] 2026-05-31 21:37:07.762 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:37:13.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 21:37:13.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421496,ok=421496,error=0, records=41
[INFO ] 2026-05-31 21:37:18.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:37:22.767 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:37:28.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 21:37:28.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421497,ok=421497,error=0, records=41
[INFO ] 2026-05-31 21:37:33.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:37:35.279 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21086/300s
[INFO ] 2026-05-31 21:37:37.180 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21086/300s
[WARN ] 2026-05-31 21:37:37.772 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:37:43.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-05-31 21:37:43.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421498,ok=421498,error=0, records=41
[INFO ] 2026-05-31 21:37:45.084 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21086/300s
[INFO ] 2026-05-31 21:37:48.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:37:52.777 [23333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:37:58.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:37:58.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421499,ok=421499,error=0, records=41
[INFO ] 2026-05-31 21:38:03.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:38:07.783 [23318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:38:13.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-05-31 21:38:13.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421500,ok=421500,error=0, records=41
[INFO ] 2026-05-31 21:38:18.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:38:22.788 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:38:28.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 21:38:28.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421501,ok=421501,error=0, records=41
[INFO ] 2026-05-31 21:38:33.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:38:37.793 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:38:43.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 21:38:43.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421502,ok=421502,error=0, records=41
[INFO ] 2026-05-31 21:38:48.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:38:48.947 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 21:38:52.798 [23323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:38:58.852 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 21:38:58.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421503,ok=421503,error=0, records=41
[INFO ] 2026-05-31 21:39:03.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:39:07.803 [23266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:39:13.084 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17558/300s
[INFO ] 2026-05-31 21:39:13.085 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892264},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:39:13.258 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:39:13.258 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:39:13.258 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:39:13.258 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:39:13.258 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:39:13.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:39:13.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:39:13.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:39:13.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:39:13.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:39:13.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:39:13.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-05-31 21:39:13.857 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421504,ok=421504,error=0, records=41
[INFO ] 2026-05-31 21:39:18.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:39:22.808 [23308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:39:28.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-05-31 21:39:28.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421505,ok=421505,error=0, records=41
[INFO ] 2026-05-31 21:39:33.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:39:37.816 [23939] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:39:43.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-05-31 21:39:43.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421506,ok=421506,error=0, records=41
[INFO ] 2026-05-31 21:39:43.867 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21077/300s
[INFO ] 2026-05-31 21:39:44.318 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21081/300s
[INFO ] 2026-05-31 21:39:48.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:39:52.821 [23924] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:39:58.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-05-31 21:39:58.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421507,ok=421507,error=0, records=41
[INFO ] 2026-05-31 21:40:00.472 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21090/300s
[INFO ] 2026-05-31 21:40:03.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:40:07.827 [23978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:40:13.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-05-31 21:40:13.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421508,ok=421508,error=0, records=41
[INFO ] 2026-05-31 21:40:18.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:40:22.832 [23924] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:40:28.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 21:40:28.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421509,ok=421509,error=0, records=41
[INFO ] 2026-05-31 21:40:33.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:40:37.837 [23944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:40:40.354 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21090/300s
[INFO ] 2026-05-31 21:40:43.269 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21077/300s
[INFO ] 2026-05-31 21:40:43.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 21:40:43.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421510,ok=421510,error=0, records=41
[INFO ] 2026-05-31 21:40:48.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:40:52.842 [23992] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:40:58.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 21:40:58.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421511,ok=421511,error=0, records=41
[INFO ] 2026-05-31 21:41:03.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:41:07.846 [23992] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:41:13.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-05-31 21:41:13.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421512,ok=421512,error=0, records=41
[INFO ] 2026-05-31 21:41:18.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:41:22.850 [23913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:41:27.089 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21086/300s
[INFO ] 2026-05-31 21:41:28.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 21:41:28.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421513,ok=421513,error=0, records=41
[INFO ] 2026-05-31 21:41:33.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:41:37.856 [23913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:41:43.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 21:41:43.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421514,ok=421514,error=0, records=41
[INFO ] 2026-05-31 21:41:48.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:41:52.863 [24061] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:41:58.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 21:41:58.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421515,ok=421515,error=0, records=41
[INFO ] 2026-05-31 21:42:03.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:42:03.956 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21089/300s
[WARN ] 2026-05-31 21:42:07.869 [24076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:42:13.260 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892180},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:42:13.435 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:42:13.435 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 21:42:13.435 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:42:13.436 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:42:13.436 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:42:13.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:42:13.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:42:13.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:42:13.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:42:13.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:42:13.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:42:13.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 21:42:13.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421516,ok=421516,error=0, records=41
[INFO ] 2026-05-31 21:42:18.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:42:22.874 [24076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:42:28.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 21:42:28.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421517,ok=421517,error=0, records=41
[INFO ] 2026-05-31 21:42:33.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:42:35.314 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21087/300s
[INFO ] 2026-05-31 21:42:37.216 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21087/300s
[WARN ] 2026-05-31 21:42:37.881 [24016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:42:44.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 21:42:44.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421518,ok=421518,error=0, records=41
[INFO ] 2026-05-31 21:42:45.124 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21087/300s
[INFO ] 2026-05-31 21:42:48.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:42:52.886 [24119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:42:59.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 21:42:59.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421519,ok=421519,error=0, records=41
[INFO ] 2026-05-31 21:43:03.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:43:07.891 [24157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:43:14.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 21:43:14.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421520,ok=421520,error=0, records=41
[INFO ] 2026-05-31 21:43:18.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:43:22.896 [24168] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:43:29.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 21:43:29.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421521,ok=421521,error=0, records=41
[INFO ] 2026-05-31 21:43:33.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 21:43:33.960 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 21:43:37.901 [24169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:43:44.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-05-31 21:43:44.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421522,ok=421522,error=0, records=41
[INFO ] 2026-05-31 21:43:48.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:43:52.906 [24185] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:43:59.037 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-05-31 21:43:59.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421523,ok=421523,error=0, records=41
[INFO ] 2026-05-31 21:44:03.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:44:07.911 [24214] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:44:14.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-05-31 21:44:14.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421524,ok=421524,error=0, records=41
[INFO ] 2026-05-31 21:44:18.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:44:22.917 [24242] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:44:29.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 21:44:29.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421525,ok=421525,error=0, records=41
[INFO ] 2026-05-31 21:44:33.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:44:37.922 [24247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:44:44.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:44:44.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421526,ok=421526,error=0, records=41
[INFO ] 2026-05-31 21:44:44.138 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21078/300s
[INFO ] 2026-05-31 21:44:44.425 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21082/300s
[INFO ] 2026-05-31 21:44:48.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:44:52.929 [24270] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:44:59.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 21:44:59.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421527,ok=421527,error=0, records=41
[INFO ] 2026-05-31 21:45:00.475 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21091/300s
[INFO ] 2026-05-31 21:45:03.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:45:07.934 [24270] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:45:13.436 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17559/300s
[INFO ] 2026-05-31 21:45:13.437 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892104},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:45:13.596 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:45:13.596 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 21:45:13.596 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:45:13.596 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:45:13.596 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:45:13.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:45:13.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:45:13.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:45:13.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:45:13.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:45:13.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:45:14.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-05-31 21:45:14.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421528,ok=421528,error=0, records=41
[INFO ] 2026-05-31 21:45:18.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:45:22.939 [24305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:45:29.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-05-31 21:45:29.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421529,ok=421529,error=0, records=41
[INFO ] 2026-05-31 21:45:33.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:45:37.946 [24305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:45:40.360 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21091/300s
[INFO ] 2026-05-31 21:45:43.447 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21078/300s
[INFO ] 2026-05-31 21:45:44.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-05-31 21:45:44.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421530,ok=421530,error=0, records=41
[INFO ] 2026-05-31 21:45:48.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:45:52.951 [24339] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:45:59.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10397, records=41
[INFO ] 2026-05-31 21:45:59.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421531,ok=421531,error=0, records=41
[INFO ] 2026-05-31 21:46:03.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:46:07.956 [24297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:46:14.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-05-31 21:46:14.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421532,ok=421532,error=0, records=41
[INFO ] 2026-05-31 21:46:18.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:46:22.961 [24332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:46:27.138 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21087/300s
[INFO ] 2026-05-31 21:46:29.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 21:46:29.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421533,ok=421533,error=0, records=41
[INFO ] 2026-05-31 21:46:33.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:46:37.966 [24354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:46:44.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 21:46:44.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421534,ok=421534,error=0, records=41
[INFO ] 2026-05-31 21:46:48.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:46:52.970 [24321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:46:59.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 21:46:59.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421535,ok=421535,error=0, records=41
[INFO ] 2026-05-31 21:47:03.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:47:03.967 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21090/300s
[WARN ] 2026-05-31 21:47:07.974 [24297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:47:14.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-05-31 21:47:14.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421536,ok=421536,error=0, records=41
[INFO ] 2026-05-31 21:47:18.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:47:22.980 [24321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:47:29.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 21:47:29.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421537,ok=421537,error=0, records=41
[INFO ] 2026-05-31 21:47:33.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:47:35.341 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21088/300s
[INFO ] 2026-05-31 21:47:37.243 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21088/300s
[WARN ] 2026-05-31 21:47:37.985 [24423] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:47:44.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-05-31 21:47:44.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421538,ok=421538,error=0, records=41
[INFO ] 2026-05-31 21:47:45.150 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21088/300s
[INFO ] 2026-05-31 21:47:48.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:47:52.990 [24437] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:47:59.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 21:47:59.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421539,ok=421539,error=0, records=41
[INFO ] 2026-05-31 21:48:03.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:48:07.995 [24423] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:48:13.598 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20892024},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:48:13.760 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:48:13.760 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 21:48:13.760 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:48:13.760 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:48:13.760 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:48:13.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:48:13.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:48:13.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:48:13.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:48:13.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:48:13.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:48:14.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 21:48:14.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421540,ok=421540,error=0, records=41
[INFO ] 2026-05-31 21:48:18.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:48:23.000 [24480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:48:29.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-05-31 21:48:29.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421541,ok=421541,error=0, records=41
[INFO ] 2026-05-31 21:48:33.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:48:38.004 [24423] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:48:44.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 21:48:44.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421542,ok=421542,error=0, records=41
[INFO ] 2026-05-31 21:48:48.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:48:53.008 [24494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:48:59.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 21:48:59.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421543,ok=421543,error=0, records=41
[INFO ] 2026-05-31 21:49:03.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:49:08.014 [24494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:49:14.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-05-31 21:49:14.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421544,ok=421544,error=0, records=41
[INFO ] 2026-05-31 21:49:18.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:49:23.019 [24480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:49:29.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-05-31 21:49:29.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421545,ok=421545,error=0, records=41
[INFO ] 2026-05-31 21:49:33.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:49:38.025 [24536] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:49:44.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 21:49:44.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421546,ok=421546,error=0, records=41
[INFO ] 2026-05-31 21:49:44.263 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21079/300s
[INFO ] 2026-05-31 21:49:44.526 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21083/300s
[INFO ] 2026-05-31 21:49:48.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:49:53.029 [24522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:49:59.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 21:49:59.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421547,ok=421547,error=0, records=41
[INFO ] 2026-05-31 21:50:00.478 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21092/300s
[INFO ] 2026-05-31 21:50:03.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:50:08.036 [24522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:50:14.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-05-31 21:50:14.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421548,ok=421548,error=0, records=41
[INFO ] 2026-05-31 21:50:18.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:50:23.042 [24368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:50:29.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 21:50:29.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421549,ok=421549,error=0, records=41
[INFO ] 2026-05-31 21:50:33.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:50:38.046 [24620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:50:40.366 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21092/300s
[INFO ] 2026-05-31 21:50:43.627 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21079/300s
[INFO ] 2026-05-31 21:50:44.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-05-31 21:50:44.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421550,ok=421550,error=0, records=41
[INFO ] 2026-05-31 21:50:48.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:50:53.051 [24584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:50:59.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-05-31 21:50:59.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421551,ok=421551,error=0, records=41
[INFO ] 2026-05-31 21:51:03.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:51:07.555 [24653] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:51:13.760 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17560/300s
[INFO ] 2026-05-31 21:51:13.762 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891948},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:51:13.922 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:51:13.922 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 21:51:13.922 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:51:13.922 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:51:13.922 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:51:13.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:51:13.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:51:13.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:51:13.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:51:13.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:51:13.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:51:14.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-05-31 21:51:14.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421552,ok=421552,error=0, records=41
[INFO ] 2026-05-31 21:51:18.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:51:22.560 [24659] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:51:27.190 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21088/300s
[INFO ] 2026-05-31 21:51:29.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-05-31 21:51:29.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421553,ok=421553,error=0, records=41
[INFO ] 2026-05-31 21:51:33.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:51:37.565 [24679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:51:44.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-05-31 21:51:44.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421554,ok=421554,error=0, records=41
[INFO ] 2026-05-31 21:51:48.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:51:52.570 [24654] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:51:59.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-05-31 21:51:59.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421555,ok=421555,error=0, records=41
[INFO ] 2026-05-31 21:52:03.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:52:03.979 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21091/300s
[WARN ] 2026-05-31 21:52:07.576 [24730] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:52:14.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-05-31 21:52:14.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421556,ok=421556,error=0, records=41
[INFO ] 2026-05-31 21:52:18.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:52:22.582 [24718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:52:29.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-05-31 21:52:29.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421557,ok=421557,error=0, records=41
[INFO ] 2026-05-31 21:52:33.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:52:35.382 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21089/300s
[INFO ] 2026-05-31 21:52:37.283 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21089/300s
[WARN ] 2026-05-31 21:52:37.587 [24758] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:52:44.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-05-31 21:52:44.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421558,ok=421558,error=0, records=41
[INFO ] 2026-05-31 21:52:45.191 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21089/300s
[INFO ] 2026-05-31 21:52:48.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:52:52.593 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:52:59.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-05-31 21:52:59.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421559,ok=421559,error=0, records=41
[INFO ] 2026-05-31 21:53:03.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:53:07.598 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:53:14.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-05-31 21:53:14.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421560,ok=421560,error=0, records=41
[INFO ] 2026-05-31 21:53:18.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:53:22.603 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:53:29.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 21:53:29.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421561,ok=421561,error=0, records=41
[INFO ] 2026-05-31 21:53:33.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 21:53:33.983 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 21:53:37.608 [24769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:53:44.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-05-31 21:53:44.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421562,ok=421562,error=0, records=41
[INFO ] 2026-05-31 21:53:48.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:53:48.984 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 21:53:52.614 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:53:59.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 21:53:59.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421563,ok=421563,error=0, records=41
[INFO ] 2026-05-31 21:54:03.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=24.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:54:07.621 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:54:13.924 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:54:14.091 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:54:14.091 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 21:54:14.091 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:54:14.091 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:54:14.091 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:54:14.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:54:14.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:54:14.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:54:14.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:54:14.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:54:14.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:54:14.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 21:54:14.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421564,ok=421564,error=0, records=41
[INFO ] 2026-05-31 21:54:18.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:54:22.626 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:54:29.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 21:54:29.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421565,ok=421565,error=0, records=41
[INFO ] 2026-05-31 21:54:33.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:54:37.631 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:54:44.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-05-31 21:54:44.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421566,ok=421566,error=0, records=41
[INFO ] 2026-05-31 21:54:44.453 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21080/300s
[INFO ] 2026-05-31 21:54:44.633 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21084/300s
[INFO ] 2026-05-31 21:54:48.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:54:52.636 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:54:59.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 21:54:59.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421567,ok=421567,error=0, records=41
[INFO ] 2026-05-31 21:55:00.481 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21093/300s
[INFO ] 2026-05-31 21:55:03.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:55:07.641 [24769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:55:14.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-05-31 21:55:14.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421568,ok=421568,error=0, records=41
[INFO ] 2026-05-31 21:55:18.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:55:22.648 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:55:29.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 21:55:29.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421569,ok=421569,error=0, records=41
[INFO ] 2026-05-31 21:55:33.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:55:37.653 [24769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:55:40.372 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21093/300s
[INFO ] 2026-05-31 21:55:43.810 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21080/300s
[INFO ] 2026-05-31 21:55:44.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 21:55:44.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421570,ok=421570,error=0, records=41
[INFO ] 2026-05-31 21:55:48.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:55:52.658 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:55:59.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 21:55:59.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421571,ok=421571,error=0, records=41
[INFO ] 2026-05-31 21:56:03.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:56:07.663 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:56:14.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 21:56:14.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421572,ok=421572,error=0, records=41
[INFO ] 2026-05-31 21:56:18.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:56:22.669 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:56:27.245 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21089/300s
[INFO ] 2026-05-31 21:56:29.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-05-31 21:56:29.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421573,ok=421573,error=0, records=41
[INFO ] 2026-05-31 21:56:33.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:56:37.675 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:56:44.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-05-31 21:56:44.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421574,ok=421574,error=0, records=41
[INFO ] 2026-05-31 21:56:48.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:56:52.681 [24769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:56:59.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-05-31 21:56:59.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421575,ok=421575,error=0, records=41
[INFO ] 2026-05-31 21:57:03.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:57:03.993 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21092/300s
[WARN ] 2026-05-31 21:57:07.686 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:57:14.091 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17561/300s
[INFO ] 2026-05-31 21:57:14.093 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891756},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 21:57:14.271 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 21:57:14.271 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-05-31 21:57:14.272 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 21:57:14.272 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 21:57:14.272 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 21:57:14.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:57:14.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 21:57:14.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:57:14.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 21:57:14.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:57:14.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 21:57:14.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-05-31 21:57:14.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421576,ok=421576,error=0, records=41
[INFO ] 2026-05-31 21:57:18.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:57:22.692 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:57:29.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-05-31 21:57:29.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421577,ok=421577,error=0, records=41
[INFO ] 2026-05-31 21:57:33.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 21:57:35.441 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21090/300s
[INFO ] 2026-05-31 21:57:37.343 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21090/300s
[WARN ] 2026-05-31 21:57:37.698 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:57:44.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-05-31 21:57:44.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421578,ok=421578,error=0, records=41
[INFO ] 2026-05-31 21:57:45.251 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21090/300s
[INFO ] 2026-05-31 21:57:48.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:57:52.702 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:57:59.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-05-31 21:57:59.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421579,ok=421579,error=0, records=41
[INFO ] 2026-05-31 21:58:03.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:58:07.708 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:58:14.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 21:58:14.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421580,ok=421580,error=0, records=41
[INFO ] 2026-05-31 21:58:18.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:58:22.713 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:58:29.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-05-31 21:58:29.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421581,ok=421581,error=0, records=41
[INFO ] 2026-05-31 21:58:33.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:58:37.718 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:58:44.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-05-31 21:58:44.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421582,ok=421582,error=0, records=41
[INFO ] 2026-05-31 21:58:48.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:58:52.722 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:58:59.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-05-31 21:58:59.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421583,ok=421583,error=0, records=41
[INFO ] 2026-05-31 21:59:03.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:59:07.726 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:59:14.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-05-31 21:59:14.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421584,ok=421584,error=0, records=41
[INFO ] 2026-05-31 21:59:18.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:59:22.731 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:59:29.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 21:59:29.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421585,ok=421585,error=0, records=41
[INFO ] 2026-05-31 21:59:33.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:59:37.736 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:59:44.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-05-31 21:59:44.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421586,ok=421586,error=0, records=41
[INFO ] 2026-05-31 21:59:44.678 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21081/300s
[INFO ] 2026-05-31 21:59:44.738 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21085/300s
[INFO ] 2026-05-31 21:59:49.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 21:59:52.742 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 21:59:59.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 21:59:59.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421587,ok=421587,error=0, records=41
[INFO ] 2026-05-31 22:00:00.485 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21094/300s
[INFO ] 2026-05-31 22:00:04.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:00:07.747 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:00:14.273 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891672},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:00:14.431 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:00:14.431 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 22:00:14.431 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:00:14.431 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:00:14.431 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:00:14.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:00:14.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:00:14.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:00:14.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:00:14.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:00:14.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:00:14.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-05-31 22:00:14.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421588,ok=421588,error=0, records=41
[INFO ] 2026-05-31 22:00:19.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:00:22.752 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:00:29.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 22:00:29.694 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421589,ok=421589,error=0, records=41
[INFO ] 2026-05-31 22:00:34.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:00:37.756 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:00:40.378 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21094/300s
[INFO ] 2026-05-31 22:00:43.991 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21081/300s
[INFO ] 2026-05-31 22:00:44.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-05-31 22:00:44.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421590,ok=421590,error=0, records=41
[INFO ] 2026-05-31 22:00:49.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:00:52.761 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:00:59.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 22:00:59.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421591,ok=421591,error=0, records=41
[INFO ] 2026-05-31 22:01:04.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:01:07.767 [24769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:01:14.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 22:01:14.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421592,ok=421592,error=0, records=41
[INFO ] 2026-05-31 22:01:19.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:01:22.772 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:01:27.300 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21090/300s
[INFO ] 2026-05-31 22:01:29.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-05-31 22:01:29.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421593,ok=421593,error=0, records=41
[INFO ] 2026-05-31 22:01:34.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:01:37.776 [24794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:01:44.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 22:01:44.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421594,ok=421594,error=0, records=41
[INFO ] 2026-05-31 22:01:49.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:01:52.781 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:01:59.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 22:01:59.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421595,ok=421595,error=0, records=41
[INFO ] 2026-05-31 22:02:04.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:02:04.005 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21093/300s
[WARN ] 2026-05-31 22:02:07.787 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:02:14.740 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-05-31 22:02:14.740 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421596,ok=421596,error=0, records=41
[INFO ] 2026-05-31 22:02:19.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:02:22.794 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:02:29.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 22:02:29.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421597,ok=421597,error=0, records=41
[INFO ] 2026-05-31 22:02:34.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:02:35.508 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21091/300s
[INFO ] 2026-05-31 22:02:37.409 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21091/300s
[WARN ] 2026-05-31 22:02:37.800 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:02:44.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 22:02:44.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421598,ok=421598,error=0, records=41
[INFO ] 2026-05-31 22:02:45.314 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21091/300s
[INFO ] 2026-05-31 22:02:49.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:02:52.806 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:02:59.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 22:02:59.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421599,ok=421599,error=0, records=41
[INFO ] 2026-05-31 22:03:04.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:03:07.812 [24764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:03:14.431 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17562/300s
[INFO ] 2026-05-31 22:03:14.433 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:03:14.602 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:03:14.603 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 22:03:14.603 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:03:14.603 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:03:14.603 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:03:14.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:03:14.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:03:14.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:03:14.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:03:14.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:03:14.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:03:14.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-05-31 22:03:14.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421600,ok=421600,error=0, records=41
[INFO ] 2026-05-31 22:03:19.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:03:22.817 [25334] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:03:29.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-05-31 22:03:29.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421601,ok=421601,error=0, records=41
[INFO ] 2026-05-31 22:03:34.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 22:03:34.009 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 22:03:37.822 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:03:44.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-05-31 22:03:44.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421602,ok=421602,error=0, records=41
[INFO ] 2026-05-31 22:03:49.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:03:52.827 [25393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:03:59.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-05-31 22:03:59.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421603,ok=421603,error=0, records=41
[INFO ] 2026-05-31 22:04:04.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:04:07.832 [25393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:04:14.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-05-31 22:04:14.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421604,ok=421604,error=0, records=41
[INFO ] 2026-05-31 22:04:19.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:04:22.837 [25379] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:04:29.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 22:04:29.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421605,ok=421605,error=0, records=41
[INFO ] 2026-05-31 22:04:34.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:04:37.841 [25379] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:04:44.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-05-31 22:04:44.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421606,ok=421606,error=0, records=41
[INFO ] 2026-05-31 22:04:44.797 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21082/300s
[INFO ] 2026-05-31 22:04:44.844 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21086/300s
[INFO ] 2026-05-31 22:04:49.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:04:52.847 [25408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:04:59.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 22:04:59.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421607,ok=421607,error=0, records=41
[INFO ] 2026-05-31 22:05:00.488 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21095/300s
[INFO ] 2026-05-31 22:05:04.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:05:07.852 [25365] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:05:14.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-05-31 22:05:14.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421608,ok=421608,error=0, records=41
[INFO ] 2026-05-31 22:05:19.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:05:22.856 [25446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:05:29.813 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-05-31 22:05:29.813 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421609,ok=421609,error=0, records=41
[INFO ] 2026-05-31 22:05:34.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:05:37.862 [25446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:05:40.384 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21095/300s
[INFO ] 2026-05-31 22:05:44.173 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21082/300s
[INFO ] 2026-05-31 22:05:44.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-05-31 22:05:44.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421610,ok=421610,error=0, records=41
[INFO ] 2026-05-31 22:05:49.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:05:52.866 [25474] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:05:59.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-05-31 22:05:59.848 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421611,ok=421611,error=0, records=41
[INFO ] 2026-05-31 22:06:04.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:06:07.871 [25502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:06:14.604 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891520},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:06:14.771 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:06:14.772 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 22:06:14.772 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:06:14.772 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:06:14.772 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:06:14.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-05-31 22:06:14.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421612,ok=421612,error=0, records=41
[INFO ] 2026-05-31 22:06:14.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:06:14.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:06:14.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:06:14.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:06:14.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:06:14.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:06:19.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:06:22.876 [25538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:06:27.353 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21091/300s
[INFO ] 2026-05-31 22:06:29.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 22:06:29.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421613,ok=421613,error=0, records=41
[INFO ] 2026-05-31 22:06:34.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:06:37.881 [25548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:06:44.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 22:06:44.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421614,ok=421614,error=0, records=41
[INFO ] 2026-05-31 22:06:49.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:06:52.887 [25549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:06:59.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 22:06:59.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421615,ok=421615,error=0, records=41
[INFO ] 2026-05-31 22:07:04.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:07:04.018 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21094/300s
[WARN ] 2026-05-31 22:07:07.893 [25571] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:07:14.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-05-31 22:07:14.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421616,ok=421616,error=0, records=41
[INFO ] 2026-05-31 22:07:19.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:07:22.899 [25576] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:07:29.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-05-31 22:07:29.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421617,ok=421617,error=0, records=41
[INFO ] 2026-05-31 22:07:34.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:07:35.560 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21092/300s
[INFO ] 2026-05-31 22:07:37.462 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21092/300s
[WARN ] 2026-05-31 22:07:37.904 [25618] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:07:44.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-05-31 22:07:44.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421618,ok=421618,error=0, records=41
[INFO ] 2026-05-31 22:07:45.367 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21092/300s
[INFO ] 2026-05-31 22:07:49.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:07:52.910 [25634] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:07:59.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-05-31 22:07:59.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421619,ok=421619,error=0, records=41
[INFO ] 2026-05-31 22:08:04.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:08:07.915 [25646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:08:14.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-05-31 22:08:14.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421620,ok=421620,error=0, records=41
[INFO ] 2026-05-31 22:08:19.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:08:22.921 [25645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:08:29.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 22:08:29.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421621,ok=421621,error=0, records=41
[INFO ] 2026-05-31 22:08:34.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:08:37.926 [25645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:08:44.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 22:08:44.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421622,ok=421622,error=0, records=41
[INFO ] 2026-05-31 22:08:49.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:08:49.022 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 22:08:52.931 [25689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:08:59.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 22:08:59.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421623,ok=421623,error=0, records=41
[INFO ] 2026-05-31 22:09:04.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:09:07.936 [25696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:09:14.772 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17563/300s
[INFO ] 2026-05-31 22:09:14.774 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891440},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:09:14.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 22:09:14.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421624,ok=421624,error=0, records=41
[INFO ] 2026-05-31 22:09:14.966 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:09:14.967 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 22:09:14.967 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:09:14.967 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:09:14.967 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:09:15.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:09:15.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:09:15.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:09:15.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:09:15.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:09:15.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:09:19.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:09:22.941 [25679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:09:29.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 22:09:29.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421625,ok=421625,error=0, records=41
[INFO ] 2026-05-31 22:09:34.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:09:37.946 [25745] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:09:44.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-05-31 22:09:44.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421626,ok=421626,error=0, records=41
[INFO ] 2026-05-31 22:09:44.937 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21083/300s
[INFO ] 2026-05-31 22:09:44.949 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21087/300s
[INFO ] 2026-05-31 22:09:49.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:09:52.952 [25761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:09:59.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:09:59.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421627,ok=421627,error=0, records=41
[INFO ] 2026-05-31 22:10:00.491 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21096/300s
[INFO ] 2026-05-31 22:10:04.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:10:07.957 [25781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:10:14.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-05-31 22:10:14.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421628,ok=421628,error=0, records=41
[INFO ] 2026-05-31 22:10:19.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:10:22.962 [25781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:10:29.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 22:10:29.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421629,ok=421629,error=0, records=41
[INFO ] 2026-05-31 22:10:34.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:10:37.966 [25751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:10:40.391 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21096/300s
[INFO ] 2026-05-31 22:10:44.358 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21083/300s
[INFO ] 2026-05-31 22:10:44.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 22:10:44.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421630,ok=421630,error=0, records=41
[INFO ] 2026-05-31 22:10:49.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:10:52.971 [25751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:10:59.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:10:59.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421631,ok=421631,error=0, records=41
[INFO ] 2026-05-31 22:11:04.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:11:07.975 [25823] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:11:14.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 22:11:14.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421632,ok=421632,error=0, records=41
[INFO ] 2026-05-31 22:11:19.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:11:22.981 [25851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:11:27.408 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21092/300s
[INFO ] 2026-05-31 22:11:29.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 22:11:29.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421633,ok=421633,error=0, records=41
[INFO ] 2026-05-31 22:11:34.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:11:37.985 [25865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:11:44.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 22:11:44.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421634,ok=421634,error=0, records=41
[INFO ] 2026-05-31 22:11:49.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:11:52.991 [25879] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:11:59.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-05-31 22:11:59.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421635,ok=421635,error=0, records=41
[INFO ] 2026-05-31 22:12:04.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:12:04.031 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21095/300s
[WARN ] 2026-05-31 22:12:07.995 [25751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:12:14.968 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891368},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:12:14.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-05-31 22:12:14.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421636,ok=421636,error=0, records=41
[INFO ] 2026-05-31 22:12:15.121 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:12:15.122 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 22:12:15.122 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:12:15.122 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:12:15.122 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:12:15.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:12:15.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:12:15.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:12:15.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:12:15.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:12:15.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:12:19.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:12:23.000 [25851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:12:30.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 22:12:30.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421637,ok=421637,error=0, records=41
[INFO ] 2026-05-31 22:12:34.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:12:35.613 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21093/300s
[INFO ] 2026-05-31 22:12:37.515 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21093/300s
[WARN ] 2026-05-31 22:12:38.005 [25851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:12:45.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 22:12:45.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421638,ok=421638,error=0, records=41
[INFO ] 2026-05-31 22:12:45.419 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21093/300s
[INFO ] 2026-05-31 22:12:49.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:12:53.010 [25751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:13:00.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 22:13:00.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421639,ok=421639,error=0, records=41
[INFO ] 2026-05-31 22:13:04.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:13:08.016 [25696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:13:15.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-05-31 22:13:15.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421640,ok=421640,error=0, records=41
[INFO ] 2026-05-31 22:13:19.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:13:23.021 [25908] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:13:30.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-05-31 22:13:30.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421641,ok=421641,error=0, records=41
[INFO ] 2026-05-31 22:13:34.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 22:13:34.034 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 22:13:38.026 [25936] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:13:45.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-05-31 22:13:45.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421642,ok=421642,error=0, records=41
[INFO ] 2026-05-31 22:13:49.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:13:53.032 [25851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:14:00.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-05-31 22:14:00.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421643,ok=421643,error=0, records=41
[INFO ] 2026-05-31 22:14:04.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:14:08.036 [25964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:14:15.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-05-31 22:14:15.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421644,ok=421644,error=0, records=41
[INFO ] 2026-05-31 22:14:19.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:14:23.041 [25978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:14:30.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 22:14:30.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421645,ok=421645,error=0, records=41
[INFO ] 2026-05-31 22:14:34.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:14:38.047 [25964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:14:45.050 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21088/300s
[INFO ] 2026-05-31 22:14:45.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 22:14:45.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421646,ok=421646,error=0, records=41
[INFO ] 2026-05-31 22:14:45.121 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21084/300s
[INFO ] 2026-05-31 22:14:49.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:14:53.053 [26048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:15:00.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 22:15:00.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421647,ok=421647,error=0, records=41
[INFO ] 2026-05-31 22:15:00.494 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21097/300s
[INFO ] 2026-05-31 22:15:04.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:15:07.557 [26072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:15:15.122 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17564/300s
[INFO ] 2026-05-31 22:15:15.123 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891292},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:15:15.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 22:15:15.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421648,ok=421648,error=0, records=41
[INFO ] 2026-05-31 22:15:15.296 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:15:15.296 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 22:15:19.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:15:22.563 [26092] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:15:30.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 22:15:30.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421649,ok=421649,error=0, records=41
[INFO ] 2026-05-31 22:15:34.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:15:37.567 [26104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:15:40.397 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21097/300s
[INFO ] 2026-05-31 22:15:44.540 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21084/300s
[INFO ] 2026-05-31 22:15:45.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 22:15:45.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421650,ok=421650,error=0, records=41
[INFO ] 2026-05-31 22:15:49.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:15:52.571 [26129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:16:00.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 22:16:00.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421651,ok=421651,error=0, records=41
[INFO ] 2026-05-31 22:16:04.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:16:07.577 [26116] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:16:15.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 22:16:15.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421652,ok=421652,error=0, records=41
[INFO ] 2026-05-31 22:16:19.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:16:22.582 [26155] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:16:27.462 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21093/300s
[INFO ] 2026-05-31 22:16:30.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-05-31 22:16:30.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421653,ok=421653,error=0, records=41
[INFO ] 2026-05-31 22:16:34.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:16:37.588 [26193] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:16:45.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-05-31 22:16:45.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421654,ok=421654,error=0, records=41
[INFO ] 2026-05-31 22:16:49.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:16:52.593 [26155] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:17:00.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-05-31 22:17:00.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421655,ok=421655,error=0, records=41
[INFO ] 2026-05-31 22:17:04.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:17:04.043 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21096/300s
[WARN ] 2026-05-31 22:17:07.598 [26169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:17:15.179 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-05-31 22:17:15.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421656,ok=421656,error=0, records=41
[INFO ] 2026-05-31 22:17:19.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:17:22.603 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:17:30.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-05-31 22:17:30.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421657,ok=421657,error=0, records=41
[INFO ] 2026-05-31 22:17:34.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:17:35.668 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21094/300s
[INFO ] 2026-05-31 22:17:37.570 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21094/300s
[WARN ] 2026-05-31 22:17:37.609 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:17:45.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:17:45.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421658,ok=421658,error=0, records=41
[INFO ] 2026-05-31 22:17:45.476 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21094/300s
[INFO ] 2026-05-31 22:17:49.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:17:52.614 [26169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:18:00.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-05-31 22:18:00.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421659,ok=421659,error=0, records=41
[INFO ] 2026-05-31 22:18:04.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:18:07.619 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:18:15.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-05-31 22:18:15.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421660,ok=421660,error=0, records=41
[INFO ] 2026-05-31 22:18:15.298 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891216},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:18:15.451 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:18:15.452 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 22:18:15.452 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:18:15.452 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:18:15.452 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:18:15.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:18:15.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:18:15.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:18:15.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:18:15.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:18:15.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:18:19.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:18:22.624 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:18:30.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 22:18:30.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421661,ok=421661,error=0, records=41
[INFO ] 2026-05-31 22:18:34.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:18:37.629 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:18:45.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 22:18:45.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421662,ok=421662,error=0, records=41
[INFO ] 2026-05-31 22:18:49.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:18:52.635 [26188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:19:00.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 22:19:00.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421663,ok=421663,error=0, records=41
[INFO ] 2026-05-31 22:19:04.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:19:07.640 [26188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:19:15.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-05-31 22:19:15.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421664,ok=421664,error=0, records=41
[INFO ] 2026-05-31 22:19:19.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:19:22.645 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:19:30.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-05-31 22:19:30.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421665,ok=421665,error=0, records=41
[INFO ] 2026-05-31 22:19:34.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:19:37.651 [26188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:19:45.153 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21089/300s
[INFO ] 2026-05-31 22:19:45.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-05-31 22:19:45.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421666,ok=421666,error=0, records=41
[INFO ] 2026-05-31 22:19:45.228 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21085/300s
[INFO ] 2026-05-31 22:19:49.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:19:52.656 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:20:00.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-05-31 22:20:00.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421667,ok=421667,error=0, records=41
[INFO ] 2026-05-31 22:20:00.497 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21098/300s
[INFO ] 2026-05-31 22:20:04.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:20:07.662 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:20:15.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-05-31 22:20:15.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421668,ok=421668,error=0, records=41
[INFO ] 2026-05-31 22:20:19.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:20:22.667 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:20:30.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 22:20:30.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421669,ok=421669,error=0, records=41
[INFO ] 2026-05-31 22:20:34.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:20:37.672 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:20:40.403 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21098/300s
[INFO ] 2026-05-31 22:20:44.716 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21085/300s
[INFO ] 2026-05-31 22:20:45.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 22:20:45.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421670,ok=421670,error=0, records=41
[INFO ] 2026-05-31 22:20:49.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:20:52.676 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:21:00.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 22:21:00.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421671,ok=421671,error=0, records=41
[INFO ] 2026-05-31 22:21:04.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:21:07.681 [26188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:21:15.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-05-31 22:21:15.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421672,ok=421672,error=0, records=41
[INFO ] 2026-05-31 22:21:15.452 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17565/300s
[INFO ] 2026-05-31 22:21:15.453 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891136},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:21:15.628 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:21:15.628 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 22:21:15.628 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:21:15.628 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:21:15.628 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:21:15.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:21:15.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:21:15.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:21:15.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:21:15.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:21:15.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:21:19.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:21:22.686 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:21:27.511 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21094/300s
[INFO ] 2026-05-31 22:21:30.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-05-31 22:21:30.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421673,ok=421673,error=0, records=41
[INFO ] 2026-05-31 22:21:34.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:21:37.691 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:21:45.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 22:21:45.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421674,ok=421674,error=0, records=41
[INFO ] 2026-05-31 22:21:49.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:21:52.696 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:22:00.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 22:22:00.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421675,ok=421675,error=0, records=41
[INFO ] 2026-05-31 22:22:04.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:22:04.055 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21097/300s
[WARN ] 2026-05-31 22:22:07.702 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:22:15.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-05-31 22:22:15.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421676,ok=421676,error=0, records=41
[INFO ] 2026-05-31 22:22:19.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:22:22.707 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:22:30.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 22:22:30.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421677,ok=421677,error=0, records=41
[INFO ] 2026-05-31 22:22:34.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:22:35.695 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21095/300s
[INFO ] 2026-05-31 22:22:37.596 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21095/300s
[WARN ] 2026-05-31 22:22:37.712 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:22:45.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-05-31 22:22:45.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421678,ok=421678,error=0, records=41
[INFO ] 2026-05-31 22:22:45.500 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21095/300s
[INFO ] 2026-05-31 22:22:49.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:22:52.718 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:23:00.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 22:23:00.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421679,ok=421679,error=0, records=41
[INFO ] 2026-05-31 22:23:04.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:23:07.724 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:23:15.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 22:23:15.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421680,ok=421680,error=0, records=41
[INFO ] 2026-05-31 22:23:19.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:23:22.729 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:23:30.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 22:23:30.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421681,ok=421681,error=0, records=41
[INFO ] 2026-05-31 22:23:34.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 22:23:34.058 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 22:23:37.735 [26169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:23:45.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 22:23:45.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421682,ok=421682,error=0, records=41
[INFO ] 2026-05-31 22:23:49.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:23:49.058 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 22:23:52.741 [26169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:24:00.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 22:24:00.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421683,ok=421683,error=0, records=41
[INFO ] 2026-05-31 22:24:04.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:24:07.747 [26169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:24:15.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-05-31 22:24:15.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421684,ok=421684,error=0, records=41
[INFO ] 2026-05-31 22:24:15.630 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20891064},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:24:15.792 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:24:15.792 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 22:24:15.792 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:24:15.792 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:24:15.792 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:24:15.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:24:15.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:24:15.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:24:15.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:24:15.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:24:15.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:24:19.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:24:22.752 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:24:30.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10109, records=41
[INFO ] 2026-05-31 22:24:30.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421685,ok=421685,error=0, records=41
[INFO ] 2026-05-31 22:24:34.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:24:37.758 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:24:45.260 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21090/300s
[INFO ] 2026-05-31 22:24:45.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10081, records=41
[INFO ] 2026-05-31 22:24:45.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421686,ok=421686,error=0, records=41
[INFO ] 2026-05-31 22:24:45.361 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21086/300s
[INFO ] 2026-05-31 22:24:49.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:24:52.763 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:25:00.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10088, records=41
[INFO ] 2026-05-31 22:25:00.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421687,ok=421687,error=0, records=41
[INFO ] 2026-05-31 22:25:00.500 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21099/300s
[INFO ] 2026-05-31 22:25:04.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:25:07.769 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:25:15.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 22:25:15.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421688,ok=421688,error=0, records=41
[INFO ] 2026-05-31 22:25:19.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:25:22.774 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:25:30.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 22:25:30.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421689,ok=421689,error=0, records=41
[INFO ] 2026-05-31 22:25:34.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:25:37.781 [26217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:25:40.409 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21099/300s
[INFO ] 2026-05-31 22:25:44.891 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21086/300s
[INFO ] 2026-05-31 22:25:45.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-05-31 22:25:45.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421690,ok=421690,error=0, records=41
[INFO ] 2026-05-31 22:25:49.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:25:52.786 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:26:00.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-05-31 22:26:00.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421691,ok=421691,error=0, records=41
[INFO ] 2026-05-31 22:26:04.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:26:07.791 [26188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:26:15.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 22:26:15.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421692,ok=421692,error=0, records=41
[INFO ] 2026-05-31 22:26:19.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:26:22.797 [26222] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:26:27.557 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21095/300s
[INFO ] 2026-05-31 22:26:30.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-05-31 22:26:30.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421693,ok=421693,error=0, records=41
[INFO ] 2026-05-31 22:26:34.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:26:37.802 [26187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:26:45.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 22:26:45.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421694,ok=421694,error=0, records=41
[INFO ] 2026-05-31 22:26:49.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:26:52.808 [26731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:27:00.413 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 22:27:00.413 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421695,ok=421695,error=0, records=41
[INFO ] 2026-05-31 22:27:04.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:27:04.066 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21098/300s
[WARN ] 2026-05-31 22:27:07.812 [26756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:27:15.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-05-31 22:27:15.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421696,ok=421696,error=0, records=41
[INFO ] 2026-05-31 22:27:15.792 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17566/300s
[INFO ] 2026-05-31 22:27:15.794 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890984},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:27:15.962 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:27:15.962 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 22:27:15.962 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:27:15.962 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:27:15.962 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:27:16.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:27:16.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:27:16.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:27:16.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:27:16.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:27:16.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:27:19.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:27:22.817 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:27:30.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 22:27:30.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421697,ok=421697,error=0, records=41
[INFO ] 2026-05-31 22:27:34.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:27:35.708 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21096/300s
[INFO ] 2026-05-31 22:27:37.609 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21096/300s
[WARN ] 2026-05-31 22:27:37.822 [26791] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:27:45.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:27:45.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421698,ok=421698,error=0, records=41
[INFO ] 2026-05-31 22:27:45.516 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21096/300s
[INFO ] 2026-05-31 22:27:49.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:27:52.826 [26188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:28:00.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 22:28:00.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421699,ok=421699,error=0, records=41
[INFO ] 2026-05-31 22:28:04.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:28:07.831 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:28:15.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 22:28:15.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421700,ok=421700,error=0, records=41
[INFO ] 2026-05-31 22:28:19.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:28:22.837 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:28:30.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 22:28:30.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421701,ok=421701,error=0, records=41
[INFO ] 2026-05-31 22:28:34.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:28:37.842 [26746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:28:45.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-05-31 22:28:45.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421702,ok=421702,error=0, records=41
[INFO ] 2026-05-31 22:28:49.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:28:52.847 [26746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:29:00.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 22:29:00.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421703,ok=421703,error=0, records=41
[INFO ] 2026-05-31 22:29:04.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:29:07.853 [26833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:29:15.511 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 22:29:15.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421704,ok=421704,error=0, records=41
[INFO ] 2026-05-31 22:29:19.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:29:22.859 [26871] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:29:30.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 22:29:30.519 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421705,ok=421705,error=0, records=41
[INFO ] 2026-05-31 22:29:34.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:29:37.864 [26871] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:29:45.367 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21091/300s
[INFO ] 2026-05-31 22:29:45.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-05-31 22:29:45.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421706,ok=421706,error=0, records=41
[INFO ] 2026-05-31 22:29:45.525 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21087/300s
[INFO ] 2026-05-31 22:29:49.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:29:52.870 [26871] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:30:00.503 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21100/300s
[INFO ] 2026-05-31 22:30:00.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 22:30:00.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421707,ok=421707,error=0, records=41
[INFO ] 2026-05-31 22:30:04.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:30:07.875 [26932] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:30:15.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 22:30:15.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421708,ok=421708,error=0, records=41
[INFO ] 2026-05-31 22:30:15.963 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890904},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:30:16.120 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:30:16.120 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 22:30:16.120 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:30:16.120 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:30:16.120 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:30:16.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:30:16.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:30:16.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:30:16.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:30:16.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:30:16.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:30:19.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:30:22.882 [26833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:30:30.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 22:30:30.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421709,ok=421709,error=0, records=41
[INFO ] 2026-05-31 22:30:34.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:30:37.888 [26965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:30:40.415 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21100/300s
[INFO ] 2026-05-31 22:30:45.068 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21087/300s
[INFO ] 2026-05-31 22:30:45.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 22:30:45.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421710,ok=421710,error=0, records=41
[INFO ] 2026-05-31 22:30:49.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:30:52.893 [26976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:31:00.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 22:31:00.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421711,ok=421711,error=0, records=41
[INFO ] 2026-05-31 22:31:04.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:31:07.899 [26982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:31:15.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 22:31:15.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421712,ok=421712,error=0, records=41
[INFO ] 2026-05-31 22:31:19.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:31:22.905 [26949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:31:27.602 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21096/300s
[INFO ] 2026-05-31 22:31:30.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:31:30.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421713,ok=421713,error=0, records=41
[INFO ] 2026-05-31 22:31:34.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:31:37.910 [27036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:31:45.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 22:31:45.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421714,ok=421714,error=0, records=41
[INFO ] 2026-05-31 22:31:49.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:31:52.915 [27052] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:32:00.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 22:32:00.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421715,ok=421715,error=0, records=41
[INFO ] 2026-05-31 22:32:04.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:32:04.078 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21099/300s
[WARN ] 2026-05-31 22:32:07.922 [27069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:32:15.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 22:32:15.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421716,ok=421716,error=0, records=41
[INFO ] 2026-05-31 22:32:19.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:32:22.927 [27031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:32:30.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:32:30.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421717,ok=421717,error=0, records=41
[INFO ] 2026-05-31 22:32:34.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:32:35.720 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21097/300s
[INFO ] 2026-05-31 22:32:37.621 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21097/300s
[WARN ] 2026-05-31 22:32:37.933 [27092] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:32:45.525 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21097/300s
[INFO ] 2026-05-31 22:32:45.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 22:32:45.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421718,ok=421718,error=0, records=41
[INFO ] 2026-05-31 22:32:49.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:32:52.939 [27115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:33:00.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-05-31 22:33:00.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421719,ok=421719,error=0, records=41
[INFO ] 2026-05-31 22:33:04.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:33:07.945 [27136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:33:15.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 22:33:15.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421720,ok=421720,error=0, records=41
[INFO ] 2026-05-31 22:33:16.121 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17567/300s
[INFO ] 2026-05-31 22:33:16.122 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890824},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:33:16.291 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:33:16.292 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 22:33:16.292 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:33:16.292 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:33:16.292 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:33:16.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:33:16.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:33:16.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:33:16.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:33:16.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:33:16.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:33:19.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:33:22.951 [27130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:33:30.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:33:30.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421721,ok=421721,error=0, records=41
[INFO ] 2026-05-31 22:33:34.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 22:33:34.081 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 22:33:37.956 [27087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:33:45.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 22:33:45.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421722,ok=421722,error=0, records=41
[INFO ] 2026-05-31 22:33:49.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:33:52.962 [27087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:34:00.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 22:34:00.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421723,ok=421723,error=0, records=41
[INFO ] 2026-05-31 22:34:04.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:34:07.966 [27176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:34:15.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 22:34:15.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421724,ok=421724,error=0, records=41
[INFO ] 2026-05-31 22:34:19.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:34:22.970 [27205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:34:30.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 22:34:30.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421725,ok=421725,error=0, records=41
[INFO ] 2026-05-31 22:34:34.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:34:37.976 [27205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:34:45.478 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21092/300s
[INFO ] 2026-05-31 22:34:45.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-05-31 22:34:45.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421726,ok=421726,error=0, records=41
[INFO ] 2026-05-31 22:34:45.639 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21088/300s
[INFO ] 2026-05-31 22:34:49.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:34:52.980 [27087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:35:00.506 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21101/300s
[INFO ] 2026-05-31 22:35:00.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 22:35:00.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421727,ok=421727,error=0, records=41
[INFO ] 2026-05-31 22:35:04.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:35:07.986 [27205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:35:15.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-05-31 22:35:15.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421728,ok=421728,error=0, records=41
[INFO ] 2026-05-31 22:35:19.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:35:22.990 [27265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:35:30.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-05-31 22:35:30.659 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421729,ok=421729,error=0, records=41
[INFO ] 2026-05-31 22:35:34.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:35:37.995 [27265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:35:40.421 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21101/300s
[INFO ] 2026-05-31 22:35:45.244 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21088/300s
[INFO ] 2026-05-31 22:35:45.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 22:35:45.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421730,ok=421730,error=0, records=41
[INFO ] 2026-05-31 22:35:49.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:35:53.000 [27292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:36:00.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-05-31 22:36:00.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421731,ok=421731,error=0, records=41
[INFO ] 2026-05-31 22:36:04.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:36:08.005 [27306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:36:15.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-05-31 22:36:15.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421732,ok=421732,error=0, records=41
[INFO ] 2026-05-31 22:36:16.293 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890740},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:36:16.456 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:36:16.456 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 22:36:16.456 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:36:16.456 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:36:16.456 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:36:16.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:36:16.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:36:16.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:36:16.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:36:16.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:36:16.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:36:19.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:36:23.009 [27232] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:36:27.654 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21097/300s
[INFO ] 2026-05-31 22:36:30.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-05-31 22:36:30.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421733,ok=421733,error=0, records=41
[INFO ] 2026-05-31 22:36:34.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:36:38.014 [27247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:36:45.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-05-31 22:36:45.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421734,ok=421734,error=0, records=41
[INFO ] 2026-05-31 22:36:49.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:36:53.019 [27350] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:37:00.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-05-31 22:37:00.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421735,ok=421735,error=0, records=41
[INFO ] 2026-05-31 22:37:04.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:37:04.090 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21100/300s
[WARN ] 2026-05-31 22:37:08.024 [27306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:37:15.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-05-31 22:37:15.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421736,ok=421736,error=0, records=41
[INFO ] 2026-05-31 22:37:19.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:37:23.030 [27364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:37:30.717 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-05-31 22:37:30.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421737,ok=421737,error=0, records=41
[INFO ] 2026-05-31 22:37:34.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:37:35.747 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21098/300s
[INFO ] 2026-05-31 22:37:37.649 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21098/300s
[WARN ] 2026-05-31 22:37:38.035 [27322] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:37:45.555 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21098/300s
[INFO ] 2026-05-31 22:37:45.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 22:37:45.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421738,ok=421738,error=0, records=41
[INFO ] 2026-05-31 22:37:49.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:37:53.041 [27408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:38:00.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 22:38:00.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421739,ok=421739,error=0, records=41
[INFO ] 2026-05-31 22:38:04.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:38:08.047 [27364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:38:15.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 22:38:15.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421740,ok=421740,error=0, records=41
[INFO ] 2026-05-31 22:38:19.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:38:23.052 [27418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:38:30.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-05-31 22:38:30.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421741,ok=421741,error=0, records=41
[INFO ] 2026-05-31 22:38:34.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:38:37.557 [27364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:38:45.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-05-31 22:38:45.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421742,ok=421742,error=0, records=41
[INFO ] 2026-05-31 22:38:49.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:38:49.094 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 22:38:52.563 [27430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:39:00.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 22:39:00.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421743,ok=421743,error=0, records=41
[INFO ] 2026-05-31 22:39:04.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:39:07.569 [27497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:39:15.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-05-31 22:39:15.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421744,ok=421744,error=0, records=41
[INFO ] 2026-05-31 22:39:16.456 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17568/300s
[INFO ] 2026-05-31 22:39:16.458 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890652},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:39:16.663 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:39:16.663 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-05-31 22:39:16.663 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:39:16.663 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:39:16.663 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:39:16.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:39:16.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:39:16.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:39:16.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:39:16.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:39:16.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:39:19.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:39:22.574 [27496] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:39:30.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-05-31 22:39:30.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421745,ok=421745,error=0, records=41
[INFO ] 2026-05-31 22:39:34.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:39:37.578 [27364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:39:45.580 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21093/300s
[INFO ] 2026-05-31 22:39:45.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 22:39:45.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421746,ok=421746,error=0, records=41
[INFO ] 2026-05-31 22:39:45.772 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21089/300s
[INFO ] 2026-05-31 22:39:49.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:39:52.582 [27532] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:40:00.509 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21102/300s
[INFO ] 2026-05-31 22:40:00.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 22:40:00.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421747,ok=421747,error=0, records=41
[INFO ] 2026-05-31 22:40:04.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:40:07.587 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:40:15.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-05-31 22:40:15.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421748,ok=421748,error=0, records=41
[INFO ] 2026-05-31 22:40:19.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:40:22.592 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:40:30.787 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 22:40:30.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421749,ok=421749,error=0, records=41
[INFO ] 2026-05-31 22:40:34.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:40:37.600 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:40:40.428 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21102/300s
[INFO ] 2026-05-31 22:40:45.421 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21089/300s
[INFO ] 2026-05-31 22:40:45.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 22:40:45.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421750,ok=421750,error=0, records=41
[INFO ] 2026-05-31 22:40:49.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:40:52.605 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:41:00.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-05-31 22:41:00.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421751,ok=421751,error=0, records=41
[INFO ] 2026-05-31 22:41:04.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:41:07.610 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:41:15.836 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-05-31 22:41:15.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421752,ok=421752,error=0, records=41
[INFO ] 2026-05-31 22:41:19.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:41:22.615 [27591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:41:27.705 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21098/300s
[INFO ] 2026-05-31 22:41:30.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-05-31 22:41:30.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421753,ok=421753,error=0, records=41
[INFO ] 2026-05-31 22:41:34.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:41:37.620 [27591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:41:45.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 22:41:45.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421754,ok=421754,error=0, records=41
[INFO ] 2026-05-31 22:41:49.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:41:52.625 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:42:00.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 22:42:00.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421755,ok=421755,error=0, records=41
[INFO ] 2026-05-31 22:42:04.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:42:04.102 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21101/300s
[WARN ] 2026-05-31 22:42:07.630 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:42:15.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 22:42:15.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421756,ok=421756,error=0, records=41
[INFO ] 2026-05-31 22:42:16.665 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890580},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:42:16.826 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:42:16.826 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 22:42:16.826 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:42:16.826 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:42:16.826 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:42:16.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:42:16.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:42:16.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:42:16.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:42:16.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:42:16.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:42:19.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:42:22.636 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:42:30.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 22:42:30.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421757,ok=421757,error=0, records=41
[INFO ] 2026-05-31 22:42:34.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:42:35.793 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21099/300s
[WARN ] 2026-05-31 22:42:37.642 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:42:37.694 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21099/300s
[INFO ] 2026-05-31 22:42:45.598 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21099/300s
[INFO ] 2026-05-31 22:42:45.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 22:42:45.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421758,ok=421758,error=0, records=41
[INFO ] 2026-05-31 22:42:49.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:42:52.647 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:43:00.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 22:43:00.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421759,ok=421759,error=0, records=41
[INFO ] 2026-05-31 22:43:04.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:43:07.652 [27591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:43:15.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 22:43:15.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421760,ok=421760,error=0, records=41
[INFO ] 2026-05-31 22:43:19.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:43:22.658 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:43:30.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-05-31 22:43:30.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421761,ok=421761,error=0, records=41
[INFO ] 2026-05-31 22:43:34.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 22:43:34.106 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 22:43:37.663 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:43:45.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 22:43:45.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421762,ok=421762,error=0, records=41
[INFO ] 2026-05-31 22:43:49.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:43:52.669 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:44:00.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 22:44:00.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421763,ok=421763,error=0, records=41
[INFO ] 2026-05-31 22:44:04.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:44:07.674 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:44:15.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-05-31 22:44:15.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421764,ok=421764,error=0, records=41
[INFO ] 2026-05-31 22:44:19.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:44:22.679 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:44:30.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 22:44:30.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421765,ok=421765,error=0, records=41
[INFO ] 2026-05-31 22:44:34.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:44:37.684 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:44:45.687 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21094/300s
[INFO ] 2026-05-31 22:44:45.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-05-31 22:44:45.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421766,ok=421766,error=0, records=41
[INFO ] 2026-05-31 22:44:45.943 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21090/300s
[INFO ] 2026-05-31 22:44:49.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:44:52.690 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:45:00.513 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21103/300s
[INFO ] 2026-05-31 22:45:00.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-05-31 22:45:00.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421767,ok=421767,error=0, records=41
[INFO ] 2026-05-31 22:45:04.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:45:07.696 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:45:15.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 22:45:15.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421768,ok=421768,error=0, records=41
[INFO ] 2026-05-31 22:45:16.826 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17569/300s
[INFO ] 2026-05-31 22:45:16.827 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890500},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:45:16.992 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:45:16.993 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 22:45:16.993 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:45:16.993 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:45:16.993 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:45:17.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:45:17.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:45:17.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:45:17.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:45:17.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:45:17.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:45:19.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:45:22.701 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:45:30.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 22:45:30.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421769,ok=421769,error=0, records=41
[INFO ] 2026-05-31 22:45:34.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:45:37.705 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:45:40.434 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21103/300s
[INFO ] 2026-05-31 22:45:45.603 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21090/300s
[INFO ] 2026-05-31 22:45:45.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 22:45:45.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421770,ok=421770,error=0, records=41
[INFO ] 2026-05-31 22:45:49.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:45:52.711 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:46:00.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 22:46:00.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421771,ok=421771,error=0, records=41
[INFO ] 2026-05-31 22:46:04.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:46:07.715 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:46:15.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 22:46:15.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421772,ok=421772,error=0, records=41
[INFO ] 2026-05-31 22:46:19.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:46:22.720 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:46:27.757 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21099/300s
[INFO ] 2026-05-31 22:46:30.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 22:46:30.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421773,ok=421773,error=0, records=41
[INFO ] 2026-05-31 22:46:34.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:46:37.726 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:46:45.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-05-31 22:46:45.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421774,ok=421774,error=0, records=41
[INFO ] 2026-05-31 22:46:49.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:46:52.732 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:47:00.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-05-31 22:47:00.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421775,ok=421775,error=0, records=41
[INFO ] 2026-05-31 22:47:04.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:47:04.114 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21102/300s
[WARN ] 2026-05-31 22:47:07.737 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:47:16.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 22:47:16.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421776,ok=421776,error=0, records=41
[INFO ] 2026-05-31 22:47:19.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:47:22.743 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:47:31.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-05-31 22:47:31.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421777,ok=421777,error=0, records=41
[INFO ] 2026-05-31 22:47:34.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:47:35.838 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21100/300s
[INFO ] 2026-05-31 22:47:37.740 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21100/300s
[WARN ] 2026-05-31 22:47:37.750 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:47:45.647 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21100/300s
[INFO ] 2026-05-31 22:47:46.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 22:47:46.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421778,ok=421778,error=0, records=41
[INFO ] 2026-05-31 22:47:49.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:47:52.755 [27557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:48:01.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 22:48:01.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421779,ok=421779,error=0, records=41
[INFO ] 2026-05-31 22:48:04.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:48:07.760 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:48:16.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-05-31 22:48:16.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421780,ok=421780,error=0, records=41
[INFO ] 2026-05-31 22:48:16.995 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890420},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:48:17.151 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:48:17.151 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-05-31 22:48:17.151 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:48:17.151 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:48:17.151 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:48:17.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:48:17.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:48:17.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:48:17.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:48:17.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:48:17.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:48:19.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:48:22.764 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:48:31.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-05-31 22:48:31.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421781,ok=421781,error=0, records=41
[INFO ] 2026-05-31 22:48:34.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:48:37.770 [27601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:48:46.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-05-31 22:48:46.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421782,ok=421782,error=0, records=41
[INFO ] 2026-05-31 22:48:49.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:48:52.775 [27591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:49:01.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-05-31 22:49:01.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421783,ok=421783,error=0, records=41
[INFO ] 2026-05-31 22:49:04.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:49:07.781 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:49:16.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 22:49:16.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421784,ok=421784,error=0, records=41
[INFO ] 2026-05-31 22:49:19.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:49:22.786 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:49:31.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 22:49:31.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421785,ok=421785,error=0, records=41
[INFO ] 2026-05-31 22:49:34.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:49:37.790 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:49:45.793 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21095/300s
[INFO ] 2026-05-31 22:49:46.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-05-31 22:49:46.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421786,ok=421786,error=0, records=41
[INFO ] 2026-05-31 22:49:46.107 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21091/300s
[INFO ] 2026-05-31 22:49:49.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:49:52.796 [27572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:50:00.516 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21104/300s
[INFO ] 2026-05-31 22:50:01.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 22:50:01.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421787,ok=421787,error=0, records=41
[INFO ] 2026-05-31 22:50:04.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:50:07.801 [27591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:50:16.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-05-31 22:50:16.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421788,ok=421788,error=0, records=41
[INFO ] 2026-05-31 22:50:19.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:50:22.806 [27591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:50:31.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-05-31 22:50:31.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421789,ok=421789,error=0, records=41
[INFO ] 2026-05-31 22:50:34.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:50:37.812 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:50:40.440 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21104/300s
[INFO ] 2026-05-31 22:50:45.781 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21091/300s
[INFO ] 2026-05-31 22:50:46.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-05-31 22:50:46.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421790,ok=421790,error=0, records=41
[INFO ] 2026-05-31 22:50:49.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:50:52.818 [27544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:51:01.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-05-31 22:51:01.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421791,ok=421791,error=0, records=41
[INFO ] 2026-05-31 22:51:04.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:51:07.823 [28140] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:51:16.218 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 22:51:16.218 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421792,ok=421792,error=0, records=41
[INFO ] 2026-05-31 22:51:17.151 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17570/300s
[INFO ] 2026-05-31 22:51:17.153 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890336},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:51:17.325 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:51:17.325 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 22:51:17.325 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:51:17.325 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:51:17.325 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:51:17.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:51:17.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:51:17.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:51:17.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:51:17.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:51:17.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:51:19.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:51:22.828 [28176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:51:27.805 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21100/300s
[INFO ] 2026-05-31 22:51:31.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-05-31 22:51:31.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421793,ok=421793,error=0, records=41
[INFO ] 2026-05-31 22:51:34.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:51:37.833 [28157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:51:46.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 22:51:46.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421794,ok=421794,error=0, records=41
[INFO ] 2026-05-31 22:51:49.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:51:52.838 [28218] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:52:01.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-05-31 22:52:01.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421795,ok=421795,error=0, records=41
[INFO ] 2026-05-31 22:52:04.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:52:04.126 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21103/300s
[WARN ] 2026-05-31 22:52:07.844 [28191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:52:16.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-05-31 22:52:16.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421796,ok=421796,error=0, records=41
[INFO ] 2026-05-31 22:52:19.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:52:22.849 [28242] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:52:31.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 22:52:31.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421797,ok=421797,error=0, records=41
[INFO ] 2026-05-31 22:52:34.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:52:35.875 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21101/300s
[INFO ] 2026-05-31 22:52:37.777 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21101/300s
[WARN ] 2026-05-31 22:52:37.856 [28228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:52:45.681 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21101/300s
[INFO ] 2026-05-31 22:52:46.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 22:52:46.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421798,ok=421798,error=0, records=41
[INFO ] 2026-05-31 22:52:49.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:52:52.863 [28162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:53:01.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 22:53:01.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421799,ok=421799,error=0, records=41
[INFO ] 2026-05-31 22:53:04.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:53:07.868 [28157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:53:16.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-05-31 22:53:16.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421800,ok=421800,error=0, records=41
[INFO ] 2026-05-31 22:53:19.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:53:22.872 [28300] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:53:31.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-05-31 22:53:31.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421801,ok=421801,error=0, records=41
[INFO ] 2026-05-31 22:53:34.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 22:53:34.129 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 22:53:37.877 [28320] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:53:46.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-05-31 22:53:46.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421802,ok=421802,error=0, records=41
[INFO ] 2026-05-31 22:53:49.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:53:49.130 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 22:53:52.882 [28331] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:54:01.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-05-31 22:54:01.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421803,ok=421803,error=0, records=41
[INFO ] 2026-05-31 22:54:04.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:54:07.889 [28353] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:54:16.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 22:54:16.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421804,ok=421804,error=0, records=41
[INFO ] 2026-05-31 22:54:17.326 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:54:17.488 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:54:17.489 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 22:54:17.490 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:54:17.490 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:54:17.490 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:54:17.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:54:17.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:54:17.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:54:17.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:54:17.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:54:17.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:54:19.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:54:22.893 [28162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:54:31.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 22:54:31.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421805,ok=421805,error=0, records=41
[INFO ] 2026-05-31 22:54:34.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:54:37.898 [28366] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:54:45.901 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21096/300s
[INFO ] 2026-05-31 22:54:46.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 22:54:46.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421806,ok=421806,error=0, records=41
[INFO ] 2026-05-31 22:54:46.316 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21092/300s
[INFO ] 2026-05-31 22:54:49.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:54:52.905 [28399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:55:00.519 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21105/300s
[INFO ] 2026-05-31 22:55:01.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 22:55:01.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421807,ok=421807,error=0, records=41
[INFO ] 2026-05-31 22:55:04.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:55:07.910 [28416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:55:16.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 22:55:16.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421808,ok=421808,error=0, records=41
[INFO ] 2026-05-31 22:55:19.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:55:22.915 [28427] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:55:31.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 22:55:31.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421809,ok=421809,error=0, records=41
[INFO ] 2026-05-31 22:55:34.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:55:37.920 [28422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:55:40.446 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21105/300s
[INFO ] 2026-05-31 22:55:45.959 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21092/300s
[INFO ] 2026-05-31 22:55:46.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 22:55:46.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421810,ok=421810,error=0, records=41
[INFO ] 2026-05-31 22:55:49.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:55:52.925 [28443] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:56:01.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 22:56:01.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421811,ok=421811,error=0, records=41
[INFO ] 2026-05-31 22:56:04.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:56:07.931 [28472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:56:16.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 22:56:16.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421812,ok=421812,error=0, records=41
[INFO ] 2026-05-31 22:56:19.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:56:22.937 [28484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:56:27.858 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21101/300s
[INFO ] 2026-05-31 22:56:31.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 22:56:31.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421813,ok=421813,error=0, records=41
[INFO ] 2026-05-31 22:56:34.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:56:37.943 [28507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:56:46.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 22:56:46.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421814,ok=421814,error=0, records=41
[INFO ] 2026-05-31 22:56:49.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:56:52.950 [28534] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:57:01.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 22:57:01.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421815,ok=421815,error=0, records=41
[INFO ] 2026-05-31 22:57:04.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:57:04.139 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21104/300s
[WARN ] 2026-05-31 22:57:07.955 [28518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:57:16.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 22:57:16.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421816,ok=421816,error=0, records=41
[INFO ] 2026-05-31 22:57:17.490 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17571/300s
[INFO ] 2026-05-31 22:57:17.492 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 22:57:17.671 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 22:57:17.671 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 22:57:17.671 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 22:57:17.671 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 22:57:17.671 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 22:57:17.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:57:17.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 22:57:17.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:57:17.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 22:57:17.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:57:17.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 22:57:19.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:57:22.960 [28519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:57:31.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-05-31 22:57:31.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421817,ok=421817,error=0, records=41
[INFO ] 2026-05-31 22:57:34.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 22:57:35.916 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21102/300s
[INFO ] 2026-05-31 22:57:37.818 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21102/300s
[WARN ] 2026-05-31 22:57:37.964 [28519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:57:45.725 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21102/300s
[INFO ] 2026-05-31 22:57:46.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-05-31 22:57:46.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421818,ok=421818,error=0, records=41
[INFO ] 2026-05-31 22:57:49.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:57:52.969 [28580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:58:01.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-05-31 22:58:01.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421819,ok=421819,error=0, records=41
[INFO ] 2026-05-31 22:58:04.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:58:07.973 [28608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:58:16.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 22:58:16.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421820,ok=421820,error=0, records=41
[INFO ] 2026-05-31 22:58:19.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:58:22.979 [28501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:58:31.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 22:58:31.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421821,ok=421821,error=0, records=41
[INFO ] 2026-05-31 22:58:34.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:58:37.984 [28622] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:58:46.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 22:58:46.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421822,ok=421822,error=0, records=41
[INFO ] 2026-05-31 22:58:49.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:58:52.989 [28622] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:59:01.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-05-31 22:59:01.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421823,ok=421823,error=0, records=41
[INFO ] 2026-05-31 22:59:04.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:59:07.994 [28663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:59:16.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-05-31 22:59:16.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421824,ok=421824,error=0, records=41
[INFO ] 2026-05-31 22:59:19.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:59:22.999 [28501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:59:31.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 22:59:31.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421825,ok=421825,error=0, records=41
[INFO ] 2026-05-31 22:59:34.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:59:38.004 [28692] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 22:59:46.006 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21097/300s
[INFO ] 2026-05-31 22:59:46.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 22:59:46.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421826,ok=421826,error=0, records=41
[INFO ] 2026-05-31 22:59:46.545 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21093/300s
[INFO ] 2026-05-31 22:59:49.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 22:59:53.009 [28663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:00:00.522 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21106/300s
[INFO ] 2026-05-31 23:00:01.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 23:00:01.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421827,ok=421827,error=0, records=41
[INFO ] 2026-05-31 23:00:04.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:00:08.015 [28725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:00:16.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-05-31 23:00:16.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421828,ok=421828,error=0, records=41
[INFO ] 2026-05-31 23:00:17.673 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890084},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:00:17.847 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:00:17.847 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 23:00:17.848 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:00:17.848 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:00:17.848 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:00:17.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:00:17.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:00:17.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:00:17.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:00:17.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:00:17.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:00:19.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:00:23.020 [28519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:00:31.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-05-31 23:00:31.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421829,ok=421829,error=0, records=41
[INFO ] 2026-05-31 23:00:34.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:00:38.025 [28706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:00:40.453 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21106/300s
[INFO ] 2026-05-31 23:00:46.139 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21093/300s
[INFO ] 2026-05-31 23:00:46.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-05-31 23:00:46.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421830,ok=421830,error=0, records=41
[INFO ] 2026-05-31 23:00:49.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:00:53.030 [28739] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:01:01.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-05-31 23:01:01.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421831,ok=421831,error=0, records=41
[INFO ] 2026-05-31 23:01:04.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:01:08.035 [28739] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:01:16.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 23:01:16.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421832,ok=421832,error=0, records=41
[INFO ] 2026-05-31 23:01:19.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:01:23.041 [28794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:01:27.909 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21102/300s
[INFO ] 2026-05-31 23:01:31.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-05-31 23:01:31.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421833,ok=421833,error=0, records=41
[INFO ] 2026-05-31 23:01:34.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:01:38.049 [28831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:01:46.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 23:01:46.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421834,ok=421834,error=0, records=41
[INFO ] 2026-05-31 23:01:49.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:01:52.555 [28848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:02:01.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 23:02:01.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421835,ok=421835,error=0, records=41
[INFO ] 2026-05-31 23:02:04.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:02:04.151 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21105/300s
[WARN ] 2026-05-31 23:02:07.560 [28828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:02:16.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 23:02:16.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421836,ok=421836,error=0, records=41
[INFO ] 2026-05-31 23:02:19.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:02:22.565 [28879] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:02:31.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-05-31 23:02:31.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421837,ok=421837,error=0, records=41
[INFO ] 2026-05-31 23:02:34.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:02:35.963 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21103/300s
[WARN ] 2026-05-31 23:02:37.569 [28880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:02:37.864 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21103/300s
[INFO ] 2026-05-31 23:02:45.770 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21103/300s
[INFO ] 2026-05-31 23:02:46.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 23:02:46.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421838,ok=421838,error=0, records=41
[INFO ] 2026-05-31 23:02:49.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:02:52.574 [28883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:03:01.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 23:03:01.745 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421839,ok=421839,error=0, records=41
[INFO ] 2026-05-31 23:03:04.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:03:07.579 [28935] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:03:16.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-05-31 23:03:16.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421840,ok=421840,error=0, records=41
[INFO ] 2026-05-31 23:03:17.848 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17572/300s
[INFO ] 2026-05-31 23:03:17.849 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20890004},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:03:18.034 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:03:18.034 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 23:03:18.034 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:03:18.034 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:03:18.035 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:03:18.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:03:18.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:03:18.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:03:18.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:03:18.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:03:18.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:03:19.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:03:22.584 [28928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:03:31.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 23:03:31.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421841,ok=421841,error=0, records=41
[INFO ] 2026-05-31 23:03:34.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 23:03:34.155 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 23:03:37.589 [28952] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:03:46.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 23:03:46.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421842,ok=421842,error=0, records=41
[INFO ] 2026-05-31 23:03:49.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:03:52.594 [28952] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:04:01.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 23:04:01.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421843,ok=421843,error=0, records=41
[INFO ] 2026-05-31 23:04:04.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:04:07.600 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:04:16.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 23:04:16.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421844,ok=421844,error=0, records=41
[INFO ] 2026-05-31 23:04:19.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:04:22.605 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:04:31.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 23:04:31.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421845,ok=421845,error=0, records=41
[INFO ] 2026-05-31 23:04:34.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:04:37.610 [28978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:04:46.113 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21098/300s
[INFO ] 2026-05-31 23:04:46.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 23:04:46.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421846,ok=421846,error=0, records=41
[INFO ] 2026-05-31 23:04:46.790 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21094/300s
[INFO ] 2026-05-31 23:04:49.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:04:52.616 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:05:00.526 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21107/300s
[INFO ] 2026-05-31 23:05:01.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 23:05:01.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421847,ok=421847,error=0, records=41
[INFO ] 2026-05-31 23:05:04.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:05:07.622 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:05:16.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-05-31 23:05:16.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421848,ok=421848,error=0, records=41
[INFO ] 2026-05-31 23:05:19.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:05:22.628 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:05:31.805 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 23:05:31.805 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421849,ok=421849,error=0, records=41
[INFO ] 2026-05-31 23:05:34.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:05:37.633 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:05:40.459 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21107/300s
[INFO ] 2026-05-31 23:05:46.320 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21094/300s
[INFO ] 2026-05-31 23:05:46.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 23:05:46.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421850,ok=421850,error=0, records=41
[INFO ] 2026-05-31 23:05:49.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:05:52.639 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:06:01.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 23:06:01.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421851,ok=421851,error=0, records=41
[INFO ] 2026-05-31 23:06:04.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:06:07.644 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:06:16.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 23:06:16.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421852,ok=421852,error=0, records=41
[INFO ] 2026-05-31 23:06:18.036 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889924},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:06:18.200 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:06:18.200 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 23:06:18.200 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:06:18.200 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:06:18.200 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:06:18.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:06:18.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:06:18.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:06:18.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:06:18.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:06:18.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:06:19.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:06:22.649 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:06:27.962 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21103/300s
[INFO ] 2026-05-31 23:06:31.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-05-31 23:06:31.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421853,ok=421853,error=0, records=41
[INFO ] 2026-05-31 23:06:34.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:06:37.655 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:06:46.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-05-31 23:06:46.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421854,ok=421854,error=0, records=41
[INFO ] 2026-05-31 23:06:49.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:06:52.660 [28978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:07:01.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-05-31 23:07:01.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421855,ok=421855,error=0, records=41
[INFO ] 2026-05-31 23:07:04.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:07:04.163 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21106/300s
[WARN ] 2026-05-31 23:07:07.665 [28978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:07:16.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 23:07:16.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421856,ok=421856,error=0, records=41
[INFO ] 2026-05-31 23:07:19.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:07:22.670 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:07:31.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 23:07:31.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421857,ok=421857,error=0, records=41
[INFO ] 2026-05-31 23:07:34.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:07:36.007 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21104/300s
[WARN ] 2026-05-31 23:07:37.675 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:07:37.909 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21104/300s
[INFO ] 2026-05-31 23:07:45.816 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21104/300s
[INFO ] 2026-05-31 23:07:46.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 23:07:46.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421858,ok=421858,error=0, records=41
[INFO ] 2026-05-31 23:07:49.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:07:52.680 [28978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:08:01.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 23:08:01.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421859,ok=421859,error=0, records=41
[INFO ] 2026-05-31 23:08:04.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:08:07.685 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:08:16.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-05-31 23:08:16.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421860,ok=421860,error=0, records=41
[INFO ] 2026-05-31 23:08:19.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:08:22.690 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:08:31.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-05-31 23:08:31.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421861,ok=421861,error=0, records=41
[INFO ] 2026-05-31 23:08:34.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:08:37.696 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:08:46.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 23:08:46.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421862,ok=421862,error=0, records=41
[INFO ] 2026-05-31 23:08:49.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:08:49.167 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 23:08:52.701 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:09:01.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-05-31 23:09:01.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421863,ok=421863,error=0, records=41
[INFO ] 2026-05-31 23:09:04.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:09:07.706 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:09:16.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 23:09:16.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421864,ok=421864,error=0, records=41
[INFO ] 2026-05-31 23:09:18.200 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17573/300s
[INFO ] 2026-05-31 23:09:18.202 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:09:18.355 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:09:18.355 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 23:09:18.356 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:09:18.356 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:09:18.356 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:09:18.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:09:18.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:09:18.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:09:18.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:09:18.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:09:18.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:09:19.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:09:22.711 [28978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:09:31.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-05-31 23:09:31.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421865,ok=421865,error=0, records=41
[INFO ] 2026-05-31 23:09:34.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:09:37.716 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:09:46.218 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21099/300s
[INFO ] 2026-05-31 23:09:46.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 23:09:46.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421866,ok=421866,error=0, records=41
[INFO ] 2026-05-31 23:09:46.940 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21095/300s
[INFO ] 2026-05-31 23:09:49.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:09:52.722 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:10:00.529 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21108/300s
[INFO ] 2026-05-31 23:10:01.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-05-31 23:10:01.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421867,ok=421867,error=0, records=41
[INFO ] 2026-05-31 23:10:04.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:10:07.727 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:10:16.957 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-05-31 23:10:16.957 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421868,ok=421868,error=0, records=41
[INFO ] 2026-05-31 23:10:19.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:10:22.733 [29003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:10:31.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-05-31 23:10:31.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421869,ok=421869,error=0, records=41
[INFO ] 2026-05-31 23:10:34.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:10:37.738 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:10:40.465 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21108/300s
[INFO ] 2026-05-31 23:10:46.500 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21095/300s
[INFO ] 2026-05-31 23:10:46.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-05-31 23:10:46.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421870,ok=421870,error=0, records=41
[INFO ] 2026-05-31 23:10:49.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:10:52.743 [28978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:11:02.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-05-31 23:11:02.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421871,ok=421871,error=0, records=41
[INFO ] 2026-05-31 23:11:04.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:11:07.749 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:11:17.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-05-31 23:11:17.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421872,ok=421872,error=0, records=41
[INFO ] 2026-05-31 23:11:19.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:11:22.754 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:11:28.013 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21104/300s
[INFO ] 2026-05-31 23:11:32.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 23:11:32.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421873,ok=421873,error=0, records=41
[INFO ] 2026-05-31 23:11:34.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:11:37.760 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:11:47.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 23:11:47.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421874,ok=421874,error=0, records=41
[INFO ] 2026-05-31 23:11:49.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:11:52.764 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:12:02.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 23:12:02.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421875,ok=421875,error=0, records=41
[INFO ] 2026-05-31 23:12:04.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:12:04.175 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21107/300s
[WARN ] 2026-05-31 23:12:07.770 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:12:17.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-05-31 23:12:17.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421876,ok=421876,error=0, records=41
[INFO ] 2026-05-31 23:12:18.357 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889764},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:12:18.514 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:12:18.514 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 23:12:18.514 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:12:18.514 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:12:18.514 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:12:18.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:12:18.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:12:18.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:12:18.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:12:18.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:12:18.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:12:19.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:12:22.774 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:12:32.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-05-31 23:12:32.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421877,ok=421877,error=0, records=41
[INFO ] 2026-05-31 23:12:34.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:12:36.053 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21105/300s
[WARN ] 2026-05-31 23:12:37.780 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:12:37.954 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21105/300s
[INFO ] 2026-05-31 23:12:45.859 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21105/300s
[INFO ] 2026-05-31 23:12:47.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 23:12:47.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421878,ok=421878,error=0, records=41
[INFO ] 2026-05-31 23:12:49.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:12:52.785 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:13:02.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 23:13:02.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421879,ok=421879,error=0, records=41
[INFO ] 2026-05-31 23:13:04.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:13:07.791 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:13:17.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 23:13:17.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421880,ok=421880,error=0, records=41
[INFO ] 2026-05-31 23:13:19.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:13:22.797 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:13:32.195 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-05-31 23:13:32.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421881,ok=421881,error=0, records=41
[INFO ] 2026-05-31 23:13:34.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 23:13:34.179 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 23:13:37.803 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:13:47.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 23:13:47.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421882,ok=421882,error=0, records=41
[INFO ] 2026-05-31 23:13:49.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:13:52.809 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:14:02.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 23:14:02.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421883,ok=421883,error=0, records=41
[INFO ] 2026-05-31 23:14:04.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:14:07.814 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:14:17.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-05-31 23:14:17.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421884,ok=421884,error=0, records=41
[INFO ] 2026-05-31 23:14:19.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:14:22.820 [29533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:14:32.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-05-31 23:14:32.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421885,ok=421885,error=0, records=41
[INFO ] 2026-05-31 23:14:34.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:14:37.826 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:14:46.330 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21100/300s
[INFO ] 2026-05-31 23:14:47.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-05-31 23:14:47.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421886,ok=421886,error=0, records=41
[INFO ] 2026-05-31 23:14:47.252 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21096/300s
[INFO ] 2026-05-31 23:14:49.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:14:52.833 [29548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:15:00.532 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21109/300s
[INFO ] 2026-05-31 23:15:02.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-05-31 23:15:02.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421887,ok=421887,error=0, records=41
[INFO ] 2026-05-31 23:15:04.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:15:07.839 [29548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:15:17.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 23:15:17.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421888,ok=421888,error=0, records=41
[INFO ] 2026-05-31 23:15:18.514 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17574/300s
[INFO ] 2026-05-31 23:15:18.516 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889688},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:15:18.689 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:15:18.689 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-05-31 23:15:18.689 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:15:18.689 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:15:18.689 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:15:18.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:15:18.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:15:18.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:15:18.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:15:18.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:15:18.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:15:19.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:15:22.845 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:15:32.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 23:15:32.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421889,ok=421889,error=0, records=41
[INFO ] 2026-05-31 23:15:34.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:15:37.850 [28988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:15:40.471 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21109/300s
[INFO ] 2026-05-31 23:15:46.675 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21096/300s
[INFO ] 2026-05-31 23:15:47.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 23:15:47.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421890,ok=421890,error=0, records=41
[INFO ] 2026-05-31 23:15:49.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:15:52.856 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:16:02.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 23:16:02.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421891,ok=421891,error=0, records=41
[INFO ] 2026-05-31 23:16:04.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:16:07.861 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:16:17.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-05-31 23:16:17.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421892,ok=421892,error=0, records=41
[INFO ] 2026-05-31 23:16:19.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:16:22.866 [29627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:16:28.067 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21105/300s
[INFO ] 2026-05-31 23:16:32.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 23:16:32.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421893,ok=421893,error=0, records=41
[INFO ] 2026-05-31 23:16:34.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:16:37.870 [29627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:16:47.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 23:16:47.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421894,ok=421894,error=0, records=41
[INFO ] 2026-05-31 23:16:49.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:16:52.876 [28993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:17:02.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-05-31 23:17:02.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421895,ok=421895,error=0, records=41
[INFO ] 2026-05-31 23:17:04.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:17:04.187 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21108/300s
[WARN ] 2026-05-31 23:17:07.881 [29704] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:17:17.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-05-31 23:17:17.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421896,ok=421896,error=0, records=41
[INFO ] 2026-05-31 23:17:19.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:17:22.886 [29732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:17:32.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-05-31 23:17:32.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421897,ok=421897,error=0, records=41
[INFO ] 2026-05-31 23:17:34.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:17:36.090 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21106/300s
[WARN ] 2026-05-31 23:17:37.891 [29754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:17:37.992 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21106/300s
[INFO ] 2026-05-31 23:17:45.899 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21106/300s
[INFO ] 2026-05-31 23:17:47.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-05-31 23:17:47.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421898,ok=421898,error=0, records=41
[INFO ] 2026-05-31 23:17:49.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:17:52.896 [29771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:18:02.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 23:18:02.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421899,ok=421899,error=0, records=41
[INFO ] 2026-05-31 23:18:04.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:18:07.902 [29765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:18:17.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-05-31 23:18:17.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421900,ok=421900,error=0, records=41
[INFO ] 2026-05-31 23:18:18.691 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:18:18.830 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:18:18.830 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 23:18:18.830 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:18:18.830 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:18:18.830 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:18:18.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:18:18.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:18:18.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:18:18.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:18:18.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:18:18.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:18:19.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:18:22.909 [29765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:18:32.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-05-31 23:18:32.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421901,ok=421901,error=0, records=41
[INFO ] 2026-05-31 23:18:34.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:18:37.915 [29814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:18:47.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-05-31 23:18:47.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421902,ok=421902,error=0, records=41
[INFO ] 2026-05-31 23:18:49.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:18:52.921 [29781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:19:02.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-05-31 23:19:02.379 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421903,ok=421903,error=0, records=41
[INFO ] 2026-05-31 23:19:04.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:19:07.926 [29847] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:19:17.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 23:19:17.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421904,ok=421904,error=0, records=41
[INFO ] 2026-05-31 23:19:19.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:19:22.931 [29869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:19:32.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 23:19:32.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421905,ok=421905,error=0, records=41
[INFO ] 2026-05-31 23:19:34.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:19:37.937 [29814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:19:46.440 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21101/300s
[INFO ] 2026-05-31 23:19:47.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13324, records=49
[INFO ] 2026-05-31 23:19:47.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421906,ok=421906,error=0, records=49
[INFO ] 2026-05-31 23:19:47.486 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21097/300s
[INFO ] 2026-05-31 23:19:49.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:19:52.942 [29895] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:20:00.535 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21110/300s
[INFO ] 2026-05-31 23:20:02.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-05-31 23:20:02.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421907,ok=421907,error=0, records=41
[INFO ] 2026-05-31 23:20:04.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:20:07.948 [29916] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:20:17.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 23:20:17.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421908,ok=421908,error=0, records=41
[INFO ] 2026-05-31 23:20:19.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:20:22.953 [29880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:20:32.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-05-31 23:20:32.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421909,ok=421909,error=0, records=41
[INFO ] 2026-05-31 23:20:34.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:20:37.958 [29880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:20:40.477 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21110/300s
[INFO ] 2026-05-31 23:20:46.856 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21097/300s
[INFO ] 2026-05-31 23:20:47.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 23:20:47.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421910,ok=421910,error=0, records=41
[INFO ] 2026-05-31 23:20:49.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:20:52.963 [29945] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:21:02.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 23:21:02.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421911,ok=421911,error=0, records=41
[INFO ] 2026-05-31 23:21:04.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:21:07.968 [29915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:21:17.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 23:21:17.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421912,ok=421912,error=0, records=41
[INFO ] 2026-05-31 23:21:18.830 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17575/300s
[INFO ] 2026-05-31 23:21:18.831 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889532},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:21:18.999 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:21:19.000 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-05-31 23:21:19.000 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:21:19.000 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:21:19.000 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:21:19.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:21:19.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:21:19.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:21:19.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:21:19.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:21:19.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:21:19.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:21:22.972 [29915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:21:28.118 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21106/300s
[INFO ] 2026-05-31 23:21:32.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-05-31 23:21:32.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421913,ok=421913,error=0, records=41
[INFO ] 2026-05-31 23:21:34.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:21:37.977 [29880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:21:47.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-05-31 23:21:47.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421914,ok=421914,error=0, records=41
[INFO ] 2026-05-31 23:21:49.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:21:52.982 [29915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:22:02.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-05-31 23:22:02.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421915,ok=421915,error=0, records=41
[INFO ] 2026-05-31 23:22:04.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:22:04.200 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21109/300s
[WARN ] 2026-05-31 23:22:07.988 [30030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:22:17.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 23:22:17.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421916,ok=421916,error=0, records=41
[INFO ] 2026-05-31 23:22:19.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:22:22.993 [30016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:22:32.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 23:22:32.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421917,ok=421917,error=0, records=41
[INFO ] 2026-05-31 23:22:34.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:22:36.146 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21107/300s
[WARN ] 2026-05-31 23:22:37.999 [30016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:22:38.047 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21107/300s
[INFO ] 2026-05-31 23:22:45.952 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21107/300s
[INFO ] 2026-05-31 23:22:47.562 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-05-31 23:22:47.562 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421918,ok=421918,error=0, records=41
[INFO ] 2026-05-31 23:22:49.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:22:53.005 [30016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:23:02.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-05-31 23:23:02.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421919,ok=421919,error=0, records=41
[INFO ] 2026-05-31 23:23:04.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:23:08.010 [30086] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:23:17.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 23:23:17.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421920,ok=421920,error=0, records=41
[INFO ] 2026-05-31 23:23:19.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:23:23.016 [30072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:23:32.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-05-31 23:23:32.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421921,ok=421921,error=0, records=41
[INFO ] 2026-05-31 23:23:34.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 23:23:34.203 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 23:23:38.020 [30030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:23:47.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 23:23:47.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421922,ok=421922,error=0, records=41
[INFO ] 2026-05-31 23:23:49.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:23:49.204 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 23:23:53.026 [30114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:24:02.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 23:24:02.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421923,ok=421923,error=0, records=41
[INFO ] 2026-05-31 23:24:04.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:24:08.031 [30030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:24:17.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 23:24:17.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421924,ok=421924,error=0, records=41
[INFO ] 2026-05-31 23:24:19.002 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889452},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:24:19.169 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:24:19.169 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 23:24:19.169 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:24:19.170 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:24:19.170 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:24:19.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:24:19.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:24:19.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:24:19.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:24:19.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:24:19.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:24:19.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:24:23.035 [30072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:24:32.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11261, records=44
[INFO ] 2026-05-31 23:24:32.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421925,ok=421925,error=0, records=44
[INFO ] 2026-05-31 23:24:34.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:24:38.040 [30178] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:24:46.542 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21102/300s
[INFO ] 2026-05-31 23:24:47.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 23:24:47.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421926,ok=421926,error=0, records=41
[INFO ] 2026-05-31 23:24:47.671 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21098/300s
[INFO ] 2026-05-31 23:24:49.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:24:53.044 [30193] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:25:00.538 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21111/300s
[INFO ] 2026-05-31 23:25:02.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-05-31 23:25:02.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421927,ok=421927,error=0, records=41
[INFO ] 2026-05-31 23:25:04.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:25:08.050 [30205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:25:17.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 23:25:17.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421928,ok=421928,error=0, records=41
[INFO ] 2026-05-31 23:25:19.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:25:22.555 [30205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:25:32.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 23:25:32.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421929,ok=421929,error=0, records=41
[INFO ] 2026-05-31 23:25:34.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:25:37.561 [30178] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:25:40.483 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21111/300s
[INFO ] 2026-05-31 23:25:47.038 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21098/300s
[INFO ] 2026-05-31 23:25:47.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 23:25:47.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421930,ok=421930,error=0, records=41
[INFO ] 2026-05-31 23:25:49.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:25:52.567 [30261] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:26:02.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-05-31 23:26:02.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421931,ok=421931,error=0, records=41
[INFO ] 2026-05-31 23:26:04.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:26:07.571 [30260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:26:17.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-05-31 23:26:17.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421932,ok=421932,error=0, records=41
[INFO ] 2026-05-31 23:26:19.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:26:22.576 [30285] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:26:28.172 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21107/300s
[INFO ] 2026-05-31 23:26:32.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-05-31 23:26:32.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421933,ok=421933,error=0, records=41
[INFO ] 2026-05-31 23:26:34.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:26:37.581 [30240] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:26:47.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-05-31 23:26:47.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421934,ok=421934,error=0, records=41
[INFO ] 2026-05-31 23:26:49.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:26:52.585 [30240] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:27:02.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-05-31 23:27:02.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421935,ok=421935,error=0, records=41
[INFO ] 2026-05-31 23:27:04.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:27:04.213 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21110/300s
[WARN ] 2026-05-31 23:27:07.590 [30240] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:27:17.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-05-31 23:27:17.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421936,ok=421936,error=0, records=41
[INFO ] 2026-05-31 23:27:19.170 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17576/300s
[INFO ] 2026-05-31 23:27:19.171 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889376},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:27:19.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-05-31 23:27:19.349 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:27:19.349 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 23:27:19.349 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:27:19.349 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:27:19.349 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:27:19.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:27:19.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:27:19.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:27:19.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:27:19.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:27:19.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:27:22.594 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:27:32.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 23:27:32.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421937,ok=421937,error=0, records=41
[INFO ] 2026-05-31 23:27:34.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:27:36.184 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21108/300s
[WARN ] 2026-05-31 23:27:37.599 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:27:38.086 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21108/300s
[INFO ] 2026-05-31 23:27:45.990 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21108/300s
[INFO ] 2026-05-31 23:27:47.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-05-31 23:27:47.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421938,ok=421938,error=0, records=41
[INFO ] 2026-05-31 23:27:49.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:27:52.605 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:28:02.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-05-31 23:28:02.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421939,ok=421939,error=0, records=41
[INFO ] 2026-05-31 23:28:04.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:28:07.611 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:28:17.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 23:28:17.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421940,ok=421940,error=0, records=41
[INFO ] 2026-05-31 23:28:19.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:28:22.616 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:28:32.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-05-31 23:28:32.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421941,ok=421941,error=0, records=41
[INFO ] 2026-05-31 23:28:34.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:28:37.622 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:28:47.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-05-31 23:28:47.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421942,ok=421942,error=0, records=41
[INFO ] 2026-05-31 23:28:49.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:28:52.629 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:29:02.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-05-31 23:29:02.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421943,ok=421943,error=0, records=41
[INFO ] 2026-05-31 23:29:04.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:29:07.635 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:29:17.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-05-31 23:29:17.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421944,ok=421944,error=0, records=41
[INFO ] 2026-05-31 23:29:19.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:29:22.641 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:29:32.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-05-31 23:29:32.848 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421945,ok=421945,error=0, records=41
[INFO ] 2026-05-31 23:29:34.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:29:37.648 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:29:46.651 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21103/300s
[INFO ] 2026-05-31 23:29:47.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-05-31 23:29:47.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421946,ok=421946,error=0, records=41
[INFO ] 2026-05-31 23:29:47.855 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21099/300s
[INFO ] 2026-05-31 23:29:49.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:29:52.654 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:30:00.541 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21112/300s
[INFO ] 2026-05-31 23:30:02.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-05-31 23:30:02.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421947,ok=421947,error=0, records=41
[INFO ] 2026-05-31 23:30:04.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:30:07.662 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:30:17.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-05-31 23:30:17.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421948,ok=421948,error=0, records=41
[INFO ] 2026-05-31 23:30:19.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:30:19.351 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889276},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:30:19.511 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:30:19.511 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 23:30:19.511 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:30:19.512 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:30:19.512 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:30:19.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:30:19.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:30:19.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:30:19.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:30:19.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:30:19.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:30:22.667 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:30:32.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-05-31 23:30:32.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421949,ok=421949,error=0, records=41
[INFO ] 2026-05-31 23:30:34.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:30:37.673 [30381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:30:40.489 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21112/300s
[INFO ] 2026-05-31 23:30:47.215 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21099/300s
[INFO ] 2026-05-31 23:30:47.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-05-31 23:30:47.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421950,ok=421950,error=0, records=41
[INFO ] 2026-05-31 23:30:49.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:30:52.680 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:31:02.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-05-31 23:31:02.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421951,ok=421951,error=0, records=41
[INFO ] 2026-05-31 23:31:04.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:31:07.686 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:31:17.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-05-31 23:31:17.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421952,ok=421952,error=0, records=41
[INFO ] 2026-05-31 23:31:19.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:31:22.691 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:31:28.224 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21108/300s
[INFO ] 2026-05-31 23:31:32.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-05-31 23:31:32.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421953,ok=421953,error=0, records=41
[INFO ] 2026-05-31 23:31:34.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:31:37.697 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:31:47.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 23:31:47.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421954,ok=421954,error=0, records=41
[INFO ] 2026-05-31 23:31:49.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:31:52.704 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:32:02.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-05-31 23:32:02.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421955,ok=421955,error=0, records=41
[INFO ] 2026-05-31 23:32:04.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:32:04.225 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21111/300s
[WARN ] 2026-05-31 23:32:07.709 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:32:17.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-05-31 23:32:17.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421956,ok=421956,error=0, records=41
[INFO ] 2026-05-31 23:32:19.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:32:22.715 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:32:32.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-05-31 23:32:32.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421957,ok=421957,error=0, records=41
[INFO ] 2026-05-31 23:32:34.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:32:36.222 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21109/300s
[WARN ] 2026-05-31 23:32:37.720 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:32:38.122 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21109/300s
[INFO ] 2026-05-31 23:32:46.028 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21109/300s
[INFO ] 2026-05-31 23:32:47.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-05-31 23:32:47.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421958,ok=421958,error=0, records=41
[INFO ] 2026-05-31 23:32:49.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:32:52.727 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:33:02.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-05-31 23:33:02.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421959,ok=421959,error=0, records=41
[INFO ] 2026-05-31 23:33:04.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:33:07.732 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:33:17.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-05-31 23:33:17.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421960,ok=421960,error=0, records=41
[INFO ] 2026-05-31 23:33:19.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:33:19.512 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17577/300s
[INFO ] 2026-05-31 23:33:19.513 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889184},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:33:19.661 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:33:19.661 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-05-31 23:33:19.662 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:33:19.662 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:33:19.662 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:33:19.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:33:19.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:33:19.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:33:19.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:33:19.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:33:19.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:33:22.737 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:33:32.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-05-31 23:33:32.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421961,ok=421961,error=0, records=41
[INFO ] 2026-05-31 23:33:34.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 23:33:34.228 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 23:33:37.742 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:33:47.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 23:33:47.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421962,ok=421962,error=0, records=41
[INFO ] 2026-05-31 23:33:49.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:33:52.748 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:34:02.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 23:34:02.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421963,ok=421963,error=0, records=41
[INFO ] 2026-05-31 23:34:04.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:34:07.752 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:34:17.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 23:34:17.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421964,ok=421964,error=0, records=41
[INFO ] 2026-05-31 23:34:19.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:34:22.757 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:34:32.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-05-31 23:34:32.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421965,ok=421965,error=0, records=41
[INFO ] 2026-05-31 23:34:34.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:34:37.761 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:34:46.763 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21104/300s
[INFO ] 2026-05-31 23:34:47.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-05-31 23:34:47.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421966,ok=421966,error=0, records=41
[INFO ] 2026-05-31 23:34:47.973 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21100/300s
[INFO ] 2026-05-31 23:34:49.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:34:52.766 [30356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:35:00.544 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21113/300s
[INFO ] 2026-05-31 23:35:02.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 23:35:02.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421967,ok=421967,error=0, records=41
[INFO ] 2026-05-31 23:35:04.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:35:07.771 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:35:18.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-05-31 23:35:18.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421968,ok=421968,error=0, records=41
[INFO ] 2026-05-31 23:35:19.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:35:22.776 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:35:33.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-05-31 23:35:33.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421969,ok=421969,error=0, records=41
[INFO ] 2026-05-31 23:35:34.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:35:37.780 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:35:40.495 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21113/300s
[INFO ] 2026-05-31 23:35:47.388 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21100/300s
[INFO ] 2026-05-31 23:35:48.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-05-31 23:35:48.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421970,ok=421970,error=0, records=41
[INFO ] 2026-05-31 23:35:49.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:35:52.785 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:36:03.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-05-31 23:36:03.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421971,ok=421971,error=0, records=41
[INFO ] 2026-05-31 23:36:04.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:36:07.790 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:36:18.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 23:36:18.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421972,ok=421972,error=0, records=41
[INFO ] 2026-05-31 23:36:19.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:36:19.663 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:36:19.827 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:36:19.828 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 23:36:19.828 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:36:19.828 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:36:19.828 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:36:19.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:36:19.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:36:19.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:36:19.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:36:19.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:36:19.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:36:22.795 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:36:28.275 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21109/300s
[INFO ] 2026-05-31 23:36:33.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-05-31 23:36:33.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421973,ok=421973,error=0, records=41
[INFO ] 2026-05-31 23:36:34.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:36:37.800 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:36:48.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 23:36:48.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421974,ok=421974,error=0, records=41
[INFO ] 2026-05-31 23:36:49.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:36:52.805 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:37:03.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 23:37:03.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421975,ok=421975,error=0, records=41
[INFO ] 2026-05-31 23:37:04.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:37:04.237 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21112/300s
[WARN ] 2026-05-31 23:37:07.811 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:37:18.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-05-31 23:37:18.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421976,ok=421976,error=0, records=41
[INFO ] 2026-05-31 23:37:19.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:37:22.816 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:37:33.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 23:37:33.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421977,ok=421977,error=0, records=41
[INFO ] 2026-05-31 23:37:34.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:37:36.260 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21110/300s
[WARN ] 2026-05-31 23:37:37.822 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:37:38.163 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21110/300s
[INFO ] 2026-05-31 23:37:46.066 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21110/300s
[INFO ] 2026-05-31 23:37:48.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-05-31 23:37:48.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421978,ok=421978,error=0, records=41
[INFO ] 2026-05-31 23:37:49.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:37:52.828 [30321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:38:03.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-05-31 23:38:03.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421979,ok=421979,error=0, records=41
[INFO ] 2026-05-31 23:38:04.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:38:07.834 [30931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:38:18.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-05-31 23:38:18.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421980,ok=421980,error=0, records=41
[INFO ] 2026-05-31 23:38:19.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:38:22.839 [30917] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:38:33.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-05-31 23:38:33.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421981,ok=421981,error=0, records=41
[INFO ] 2026-05-31 23:38:34.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:38:37.844 [30372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:38:48.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-05-31 23:38:48.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421982,ok=421982,error=0, records=41
[INFO ] 2026-05-31 23:38:49.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:38:49.240 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 23:38:52.850 [30917] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:39:03.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-05-31 23:39:03.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421983,ok=421983,error=0, records=41
[INFO ] 2026-05-31 23:39:04.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:39:07.856 [30931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:39:18.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-05-31 23:39:18.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421984,ok=421984,error=0, records=41
[INFO ] 2026-05-31 23:39:19.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:39:19.828 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17578/300s
[INFO ] 2026-05-31 23:39:19.829 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889028},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:39:19.985 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:39:19.985 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-05-31 23:39:19.985 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:39:19.985 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:39:19.985 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:39:20.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:39:20.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:39:20.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:39:20.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:39:20.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:39:20.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:39:22.861 [30931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:39:33.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-05-31 23:39:33.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421985,ok=421985,error=0, records=41
[INFO ] 2026-05-31 23:39:34.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:39:37.867 [31022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:39:46.870 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21105/300s
[INFO ] 2026-05-31 23:39:48.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-05-31 23:39:48.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421986,ok=421986,error=0, records=41
[INFO ] 2026-05-31 23:39:48.152 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21101/300s
[INFO ] 2026-05-31 23:39:49.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:39:52.872 [31008] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:40:00.547 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21114/300s
[INFO ] 2026-05-31 23:40:03.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-05-31 23:40:03.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421987,ok=421987,error=0, records=41
[INFO ] 2026-05-31 23:40:04.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:40:07.879 [31008] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:40:18.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-05-31 23:40:18.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421988,ok=421988,error=0, records=41
[INFO ] 2026-05-31 23:40:19.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:40:22.884 [31087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:40:33.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-05-31 23:40:33.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421989,ok=421989,error=0, records=41
[INFO ] 2026-05-31 23:40:34.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:40:37.891 [31088] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:40:40.501 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21114/300s
[INFO ] 2026-05-31 23:40:47.561 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21101/300s
[INFO ] 2026-05-31 23:40:48.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-05-31 23:40:48.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421990,ok=421990,error=0, records=41
[INFO ] 2026-05-31 23:40:49.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:40:52.898 [31122] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:41:03.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-05-31 23:41:03.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421991,ok=421991,error=0, records=41
[INFO ] 2026-05-31 23:41:04.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:41:07.904 [31103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:41:18.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-05-31 23:41:18.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421992,ok=421992,error=0, records=41
[INFO ] 2026-05-31 23:41:19.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:41:22.909 [31104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:41:28.316 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21110/300s
[INFO ] 2026-05-31 23:41:33.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-05-31 23:41:33.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421993,ok=421993,error=0, records=41
[INFO ] 2026-05-31 23:41:34.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:41:37.915 [31172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:41:48.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 23:41:48.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421994,ok=421994,error=0, records=41
[INFO ] 2026-05-31 23:41:49.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:41:52.920 [31104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:42:03.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-05-31 23:42:03.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421995,ok=421995,error=0, records=41
[INFO ] 2026-05-31 23:42:04.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:42:04.248 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21113/300s
[WARN ] 2026-05-31 23:42:07.926 [31194] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:42:18.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 23:42:18.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421996,ok=421996,error=0, records=41
[INFO ] 2026-05-31 23:42:19.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:42:19.987 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888948},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:42:20.138 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:42:20.138 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-05-31 23:42:20.138 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:42:20.138 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:42:20.138 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:42:20.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:42:20.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:42:20.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:42:20.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:42:20.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:42:20.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:42:22.932 [31104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:42:33.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-05-31 23:42:33.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421997,ok=421997,error=0, records=41
[INFO ] 2026-05-31 23:42:34.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:42:36.354 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21111/300s
[WARN ] 2026-05-31 23:42:37.937 [31234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:42:38.255 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21111/300s
[INFO ] 2026-05-31 23:42:46.151 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21111/300s
[INFO ] 2026-05-31 23:42:48.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-05-31 23:42:48.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421998,ok=421998,error=0, records=41
[INFO ] 2026-05-31 23:42:49.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:42:52.943 [31241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:43:03.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-05-31 23:43:03.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=421999,ok=421999,error=0, records=41
[INFO ] 2026-05-31 23:43:04.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:43:07.949 [31246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:43:18.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-05-31 23:43:18.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422000,ok=422000,error=0, records=41
[INFO ] 2026-05-31 23:43:19.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:43:22.954 [31241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:43:33.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-05-31 23:43:33.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422001,ok=422001,error=0, records=41
[INFO ] 2026-05-31 23:43:34.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 23:43:34.251 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 23:43:37.959 [31282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:43:48.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-05-31 23:43:48.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422002,ok=422002,error=0, records=41
[INFO ] 2026-05-31 23:43:49.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:43:52.963 [31310] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:44:03.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-05-31 23:44:03.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422003,ok=422003,error=0, records=41
[INFO ] 2026-05-31 23:44:04.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:44:07.969 [31246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:44:18.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 23:44:18.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422004,ok=422004,error=0, records=41
[INFO ] 2026-05-31 23:44:19.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:44:22.974 [31241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:44:33.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-05-31 23:44:33.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422005,ok=422005,error=0, records=41
[INFO ] 2026-05-31 23:44:34.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:44:37.978 [31338] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:44:46.982 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21106/300s
[INFO ] 2026-05-31 23:44:48.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-05-31 23:44:48.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422006,ok=422006,error=0, records=41
[INFO ] 2026-05-31 23:44:48.311 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21102/300s
[INFO ] 2026-05-31 23:44:49.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:44:52.985 [31352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:45:00.550 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21115/300s
[INFO ] 2026-05-31 23:45:03.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-05-31 23:45:03.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422007,ok=422007,error=0, records=41
[INFO ] 2026-05-31 23:45:04.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:45:07.990 [31246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:45:18.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-05-31 23:45:18.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422008,ok=422008,error=0, records=41
[INFO ] 2026-05-31 23:45:19.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:45:20.139 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17579/300s
[INFO ] 2026-05-31 23:45:20.140 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888872},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:45:20.314 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:45:20.314 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-05-31 23:45:20.314 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:45:20.314 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:45:20.314 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:45:20.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:45:20.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:45:20.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:45:20.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:45:20.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:45:20.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:45:22.995 [31310] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:45:33.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-05-31 23:45:33.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422009,ok=422009,error=0, records=41
[INFO ] 2026-05-31 23:45:34.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:45:38.000 [31352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:45:40.506 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21115/300s
[INFO ] 2026-05-31 23:45:47.740 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21102/300s
[INFO ] 2026-05-31 23:45:48.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-05-31 23:45:48.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422010,ok=422010,error=0, records=41
[INFO ] 2026-05-31 23:45:49.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:45:53.006 [31246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:46:03.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-05-31 23:46:03.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422011,ok=422011,error=0, records=41
[INFO ] 2026-05-31 23:46:04.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:46:08.011 [31435] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:46:18.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-05-31 23:46:18.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422012,ok=422012,error=0, records=41
[INFO ] 2026-05-31 23:46:19.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:46:23.016 [31352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:46:28.367 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21111/300s
[INFO ] 2026-05-31 23:46:33.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-05-31 23:46:33.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422013,ok=422013,error=0, records=41
[INFO ] 2026-05-31 23:46:34.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:46:38.021 [31407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:46:48.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-05-31 23:46:48.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422014,ok=422014,error=0, records=41
[INFO ] 2026-05-31 23:46:49.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:46:53.026 [31421] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:47:03.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-05-31 23:47:03.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422015,ok=422015,error=0, records=41
[INFO ] 2026-05-31 23:47:04.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:47:04.260 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21114/300s
[WARN ] 2026-05-31 23:47:08.031 [31421] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:47:18.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-05-31 23:47:18.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422016,ok=422016,error=0, records=41
[INFO ] 2026-05-31 23:47:19.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:47:23.036 [31421] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:47:33.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-05-31 23:47:33.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422017,ok=422017,error=0, records=41
[INFO ] 2026-05-31 23:47:34.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:47:36.406 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21112/300s
[WARN ] 2026-05-31 23:47:38.041 [31506] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:47:38.308 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21112/300s
[INFO ] 2026-05-31 23:47:46.196 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21112/300s
[INFO ] 2026-05-31 23:47:48.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-05-31 23:47:48.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422018,ok=422018,error=0, records=41
[INFO ] 2026-05-31 23:47:49.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:47:53.046 [31521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:48:03.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-05-31 23:48:03.388 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422019,ok=422019,error=0, records=41
[INFO ] 2026-05-31 23:48:04.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:48:08.050 [31554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:48:18.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-05-31 23:48:18.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422020,ok=422020,error=0, records=41
[INFO ] 2026-05-31 23:48:19.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:48:20.316 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888796},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:48:20.493 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:48:20.493 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 23:48:20.493 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:48:20.493 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:48:20.493 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:48:20.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:48:20.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:48:20.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:48:20.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:48:20.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:48:20.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:48:22.556 [31574] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:48:33.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-05-31 23:48:33.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422021,ok=422021,error=0, records=41
[INFO ] 2026-05-31 23:48:34.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:48:37.561 [31594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:48:48.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-05-31 23:48:48.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422022,ok=422022,error=0, records=41
[INFO ] 2026-05-31 23:48:49.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:48:52.565 [31595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:49:03.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-05-31 23:49:03.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422023,ok=422023,error=0, records=41
[INFO ] 2026-05-31 23:49:04.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:49:07.570 [31594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:49:18.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 23:49:18.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422024,ok=422024,error=0, records=41
[INFO ] 2026-05-31 23:49:19.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:49:22.575 [31632] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:49:33.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-05-31 23:49:33.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422025,ok=422025,error=0, records=41
[INFO ] 2026-05-31 23:49:34.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:49:37.579 [31632] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:49:47.082 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21107/300s
[INFO ] 2026-05-31 23:49:48.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-05-31 23:49:48.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422026,ok=422026,error=0, records=41
[INFO ] 2026-05-31 23:49:48.426 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21103/300s
[INFO ] 2026-05-31 23:49:49.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:49:52.584 [31677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:50:00.553 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21116/300s
[INFO ] 2026-05-31 23:50:03.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-05-31 23:50:03.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422027,ok=422027,error=0, records=41
[INFO ] 2026-05-31 23:50:04.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:50:07.589 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:50:18.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-05-31 23:50:18.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422028,ok=422028,error=0, records=41
[INFO ] 2026-05-31 23:50:19.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:50:22.594 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:50:33.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-05-31 23:50:33.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422029,ok=422029,error=0, records=41
[INFO ] 2026-05-31 23:50:34.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:50:37.599 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:50:40.513 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21116/300s
[INFO ] 2026-05-31 23:50:47.921 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21103/300s
[INFO ] 2026-05-31 23:50:48.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-05-31 23:50:48.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422030,ok=422030,error=0, records=41
[INFO ] 2026-05-31 23:50:49.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:50:52.606 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:51:03.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-05-31 23:51:03.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422031,ok=422031,error=0, records=41
[INFO ] 2026-05-31 23:51:04.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:51:07.610 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:51:18.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-05-31 23:51:18.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422032,ok=422032,error=0, records=41
[INFO ] 2026-05-31 23:51:19.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:51:20.494 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17580/300s
[INFO ] 2026-05-31 23:51:20.495 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888716},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:51:20.672 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:51:20.672 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-05-31 23:51:20.673 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:51:20.673 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:51:20.673 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:51:20.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:51:20.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:51:20.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:51:20.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:51:20.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:51:20.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:51:22.615 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:51:28.420 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21112/300s
[INFO ] 2026-05-31 23:51:33.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-05-31 23:51:33.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422033,ok=422033,error=0, records=41
[INFO ] 2026-05-31 23:51:34.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:51:37.620 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:51:48.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-05-31 23:51:48.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422034,ok=422034,error=0, records=41
[INFO ] 2026-05-31 23:51:49.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:51:52.625 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:52:03.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-05-31 23:52:03.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422035,ok=422035,error=0, records=41
[INFO ] 2026-05-31 23:52:04.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:52:04.272 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21115/300s
[WARN ] 2026-05-31 23:52:07.631 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:52:18.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-05-31 23:52:18.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422036,ok=422036,error=0, records=41
[INFO ] 2026-05-31 23:52:19.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:52:22.635 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:52:33.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 23:52:33.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422037,ok=422037,error=0, records=41
[INFO ] 2026-05-31 23:52:34.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:52:36.463 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21113/300s
[WARN ] 2026-05-31 23:52:37.640 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:52:38.365 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21113/300s
[INFO ] 2026-05-31 23:52:46.248 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21113/300s
[INFO ] 2026-05-31 23:52:48.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-05-31 23:52:48.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422038,ok=422038,error=0, records=41
[INFO ] 2026-05-31 23:52:49.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:52:52.645 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:53:03.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-05-31 23:53:03.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422039,ok=422039,error=0, records=41
[INFO ] 2026-05-31 23:53:04.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:53:07.651 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:53:18.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-05-31 23:53:18.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422040,ok=422040,error=0, records=41
[INFO ] 2026-05-31 23:53:19.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:53:22.655 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:53:33.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-05-31 23:53:33.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422041,ok=422041,error=0, records=41
[INFO ] 2026-05-31 23:53:34.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-05-31 23:53:34.275 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-05-31 23:53:37.661 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:53:48.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-05-31 23:53:48.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422042,ok=422042,error=0, records=41
[INFO ] 2026-05-31 23:53:49.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:53:49.276 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-05-31 23:53:52.667 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:54:03.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-05-31 23:54:03.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422043,ok=422043,error=0, records=41
[INFO ] 2026-05-31 23:54:04.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:54:07.672 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:54:18.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-05-31 23:54:18.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422044,ok=422044,error=0, records=41
[INFO ] 2026-05-31 23:54:19.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:54:20.675 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:54:21.083 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:54:21.083 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[WARN ] 2026-05-31 23:54:22.677 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:54:33.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-05-31 23:54:33.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422045,ok=422045,error=0, records=41
[INFO ] 2026-05-31 23:54:34.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:54:37.682 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:54:47.185 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21108/300s
[INFO ] 2026-05-31 23:54:48.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-05-31 23:54:48.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422046,ok=422046,error=0, records=41
[INFO ] 2026-05-31 23:54:48.650 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21104/300s
[INFO ] 2026-05-31 23:54:49.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:54:52.687 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:55:00.556 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21117/300s
[INFO ] 2026-05-31 23:55:03.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-05-31 23:55:03.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422047,ok=422047,error=0, records=41
[INFO ] 2026-05-31 23:55:04.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:55:07.693 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:55:18.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-05-31 23:55:18.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422048,ok=422048,error=0, records=41
[INFO ] 2026-05-31 23:55:19.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:55:22.697 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:55:33.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-05-31 23:55:33.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422049,ok=422049,error=0, records=41
[INFO ] 2026-05-31 23:55:34.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:55:37.701 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:55:40.518 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21117/300s
[INFO ] 2026-05-31 23:55:48.097 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21104/300s
[INFO ] 2026-05-31 23:55:48.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-05-31 23:55:48.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422050,ok=422050,error=0, records=41
[INFO ] 2026-05-31 23:55:49.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:55:52.706 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:56:03.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-05-31 23:56:03.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422051,ok=422051,error=0, records=41
[INFO ] 2026-05-31 23:56:04.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:56:07.711 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:56:18.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 23:56:18.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422052,ok=422052,error=0, records=41
[INFO ] 2026-05-31 23:56:19.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:56:22.716 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:56:28.469 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21113/300s
[INFO ] 2026-05-31 23:56:33.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-05-31 23:56:33.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422053,ok=422053,error=0, records=41
[INFO ] 2026-05-31 23:56:34.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:56:37.721 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:56:48.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-05-31 23:56:48.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422054,ok=422054,error=0, records=41
[INFO ] 2026-05-31 23:56:49.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:56:52.726 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:57:03.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-05-31 23:57:03.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422055,ok=422055,error=0, records=41
[INFO ] 2026-05-31 23:57:04.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:57:04.284 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21116/300s
[WARN ] 2026-05-31 23:57:07.732 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:57:18.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-05-31 23:57:18.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422056,ok=422056,error=0, records=41
[INFO ] 2026-05-31 23:57:19.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:57:21.084 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17581/300s
[INFO ] 2026-05-31 23:57:21.085 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-05-31 23:57:21.245 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-05-31 23:57:21.245 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-05-31 23:57:21.245 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-05-31 23:57:21.245 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-05-31 23:57:21.245 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-05-31 23:57:21.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:57:21.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-05-31 23:57:21.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:57:21.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-05-31 23:57:21.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-05-31 23:57:21.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-05-31 23:57:22.737 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:57:33.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-05-31 23:57:33.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422057,ok=422057,error=0, records=41
[INFO ] 2026-05-31 23:57:34.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-05-31 23:57:36.508 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21114/300s
[WARN ] 2026-05-31 23:57:37.742 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:57:38.410 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21114/300s
[INFO ] 2026-05-31 23:57:46.286 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21114/300s
[INFO ] 2026-05-31 23:57:48.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-05-31 23:57:48.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422058,ok=422058,error=0, records=41
[INFO ] 2026-05-31 23:57:49.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:57:52.747 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:58:03.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-05-31 23:58:03.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422059,ok=422059,error=0, records=41
[INFO ] 2026-05-31 23:58:04.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:58:07.752 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:58:18.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-05-31 23:58:18.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422060,ok=422060,error=0, records=41
[INFO ] 2026-05-31 23:58:19.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:58:22.758 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:58:33.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-05-31 23:58:33.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422061,ok=422061,error=0, records=41
[INFO ] 2026-05-31 23:58:34.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:58:37.763 [31718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:58:48.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-05-31 23:58:48.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422062,ok=422062,error=0, records=41
[INFO ] 2026-05-31 23:58:49.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:58:52.769 [31701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:59:03.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-05-31 23:59:03.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422063,ok=422063,error=0, records=41
[INFO ] 2026-05-31 23:59:04.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:59:07.774 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:59:18.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-05-31 23:59:18.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422064,ok=422064,error=0, records=41
[INFO ] 2026-05-31 23:59:19.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:59:22.779 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:59:33.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-05-31 23:59:33.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422065,ok=422065,error=0, records=41
[INFO ] 2026-05-31 23:59:34.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:59:37.785 [31733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-05-31 23:59:47.288 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21109/300s
[INFO ] 2026-05-31 23:59:48.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-05-31 23:59:48.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422066,ok=422066,error=0, records=41
[INFO ] 2026-05-31 23:59:48.777 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21105/300s
[INFO ] 2026-05-31 23:59:49.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-05-31 23:59:52.790 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:00:00.559 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21118/300s
[INFO ] 2026-06-01 00:00:03.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-06-01 00:00:03.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422067,ok=422067,error=0, records=41
[INFO ] 2026-06-01 00:00:04.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:00:07.824 [31691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:00:18.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10399, records=41
[INFO ] 2026-06-01 00:00:18.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422068,ok=422068,error=0, records=41
[INFO ] 2026-06-01 00:00:19.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:00:21.246 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889840},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:00:21.411 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:00:21.411 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 00:00:21.411 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:00:21.411 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:00:21.411 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:00:21.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:00:21.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:00:21.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:00:21.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:00:21.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:00:21.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:00:22.829 [32276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:00:33.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10406, records=41
[INFO ] 2026-06-01 00:00:33.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422069,ok=422069,error=0, records=41
[INFO ] 2026-06-01 00:00:34.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:00:37.835 [32305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:00:40.525 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21118/300s
[INFO ] 2026-06-01 00:00:48.285 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21105/300s
[INFO ] 2026-06-01 00:00:48.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10384, records=41
[INFO ] 2026-06-01 00:00:48.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422070,ok=422070,error=0, records=41
[INFO ] 2026-06-01 00:00:49.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:00:52.840 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:01:03.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 00:01:03.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422071,ok=422071,error=0, records=41
[INFO ] 2026-06-01 00:01:04.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:01:07.845 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:01:18.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 00:01:18.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422072,ok=422072,error=0, records=41
[INFO ] 2026-06-01 00:01:19.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:01:22.851 [32355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:01:28.529 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21114/300s
[INFO ] 2026-06-01 00:01:33.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 00:01:33.852 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422073,ok=422073,error=0, records=41
[INFO ] 2026-06-01 00:01:34.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:01:37.856 [32369] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:01:48.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 00:01:48.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422074,ok=422074,error=0, records=41
[INFO ] 2026-06-01 00:01:49.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:01:52.862 [32369] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:02:03.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 00:02:03.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422075,ok=422075,error=0, records=41
[INFO ] 2026-06-01 00:02:04.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:02:04.301 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21117/300s
[WARN ] 2026-06-01 00:02:07.867 [32355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:02:18.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10130, records=41
[INFO ] 2026-06-01 00:02:18.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422076,ok=422076,error=0, records=41
[INFO ] 2026-06-01 00:02:19.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:02:22.873 [32305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:02:33.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10107, records=41
[INFO ] 2026-06-01 00:02:33.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422077,ok=422077,error=0, records=41
[INFO ] 2026-06-01 00:02:34.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:02:36.566 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21115/300s
[WARN ] 2026-06-01 00:02:37.878 [32432] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:02:38.467 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21115/300s
[INFO ] 2026-06-01 00:02:46.309 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21115/300s
[INFO ] 2026-06-01 00:02:48.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10112, records=41
[INFO ] 2026-06-01 00:02:48.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422078,ok=422078,error=0, records=41
[INFO ] 2026-06-01 00:02:49.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:02:52.883 [32432] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:03:03.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 00:03:03.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422079,ok=422079,error=0, records=41
[INFO ] 2026-06-01 00:03:04.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:03:07.889 [32453] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:03:18.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 00:03:18.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422080,ok=422080,error=0, records=41
[INFO ] 2026-06-01 00:03:19.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:03:21.412 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17582/300s
[INFO ] 2026-06-01 00:03:21.413 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889760},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:03:21.560 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:03:21.560 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 00:03:21.560 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:03:21.560 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:03:21.560 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:03:21.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:03:21.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:03:21.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:03:21.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:03:21.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:03:21.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:03:22.894 [32475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:03:33.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 00:03:33.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422081,ok=422081,error=0, records=41
[INFO ] 2026-06-01 00:03:34.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 00:03:34.304 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 00:03:37.900 [32476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:03:48.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 00:03:48.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422082,ok=422082,error=0, records=41
[INFO ] 2026-06-01 00:03:49.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:03:52.905 [32510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:04:03.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 00:04:03.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422083,ok=422083,error=0, records=41
[INFO ] 2026-06-01 00:04:04.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:04:07.911 [32500] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:04:18.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 00:04:18.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422084,ok=422084,error=0, records=41
[INFO ] 2026-06-01 00:04:19.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:04:22.917 [32542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:04:33.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 00:04:33.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422085,ok=422085,error=0, records=41
[INFO ] 2026-06-01 00:04:34.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:04:37.921 [32560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:04:47.424 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21110/300s
[INFO ] 2026-06-01 00:04:48.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 00:04:48.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422086,ok=422086,error=0, records=41
[INFO ] 2026-06-01 00:04:48.996 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21106/300s
[INFO ] 2026-06-01 00:04:49.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:04:52.927 [32583] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:05:00.562 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21119/300s
[INFO ] 2026-06-01 00:05:04.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 00:05:04.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422087,ok=422087,error=0, records=41
[INFO ] 2026-06-01 00:05:04.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:05:07.933 [32594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:05:19.006 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 00:05:19.006 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422088,ok=422088,error=0, records=41
[INFO ] 2026-06-01 00:05:19.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:05:22.939 [32601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:05:34.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 00:05:34.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422089,ok=422089,error=0, records=41
[INFO ] 2026-06-01 00:05:34.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:05:37.944 [32612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:05:40.530 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21119/300s
[INFO ] 2026-06-01 00:05:48.458 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21106/300s
[INFO ] 2026-06-01 00:05:49.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 00:05:49.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422090,ok=422090,error=0, records=41
[INFO ] 2026-06-01 00:05:49.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:05:52.950 [32640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:06:04.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 00:06:04.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422091,ok=422091,error=0, records=41
[INFO ] 2026-06-01 00:06:04.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:06:07.955 [32576] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:06:19.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 00:06:19.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422092,ok=422092,error=0, records=41
[INFO ] 2026-06-01 00:06:19.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:06:21.561 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889688},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:06:21.725 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:06:21.725 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 00:06:21.725 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:06:21.725 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:06:21.725 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:06:21.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:06:21.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:06:21.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:06:21.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:06:21.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:06:21.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:06:22.960 [32674] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:06:28.575 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21115/300s
[INFO ] 2026-06-01 00:06:34.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 00:06:34.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422093,ok=422093,error=0, records=41
[INFO ] 2026-06-01 00:06:34.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:06:37.964 [32646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:06:49.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 00:06:49.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422094,ok=422094,error=0, records=41
[INFO ] 2026-06-01 00:06:49.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:06:52.970 [32674] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:07:04.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 00:07:04.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422095,ok=422095,error=0, records=41
[INFO ] 2026-06-01 00:07:04.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:07:04.312 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21118/300s
[WARN ] 2026-06-01 00:07:07.975 [32576] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:07:19.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 00:07:19.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422096,ok=422096,error=0, records=41
[INFO ] 2026-06-01 00:07:19.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:07:22.980 [32703] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:07:34.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 00:07:34.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422097,ok=422097,error=0, records=41
[INFO ] 2026-06-01 00:07:34.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:07:36.588 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21116/300s
[WARN ] 2026-06-01 00:07:37.985 [32732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:07:38.489 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21116/300s
[INFO ] 2026-06-01 00:07:46.309 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21116/300s
[INFO ] 2026-06-01 00:07:49.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 00:07:49.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422098,ok=422098,error=0, records=41
[INFO ] 2026-06-01 00:07:49.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:07:52.990 [32760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:08:04.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 00:08:04.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422099,ok=422099,error=0, records=41
[INFO ] 2026-06-01 00:08:04.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:08:07.995 [306  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:08:19.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 00:08:19.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422100,ok=422100,error=0, records=41
[INFO ] 2026-06-01 00:08:19.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:08:23.000 [32645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:08:34.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 00:08:34.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422101,ok=422101,error=0, records=41
[INFO ] 2026-06-01 00:08:34.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:08:38.005 [320  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:08:49.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 00:08:49.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422102,ok=422102,error=0, records=41
[INFO ] 2026-06-01 00:08:49.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:08:49.316 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 00:08:53.010 [348  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:09:04.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 00:09:04.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422103,ok=422103,error=0, records=41
[INFO ] 2026-06-01 00:09:04.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:09:08.016 [32674] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:09:19.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 00:09:19.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422104,ok=422104,error=0, records=41
[INFO ] 2026-06-01 00:09:19.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:09:21.725 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17583/300s
[INFO ] 2026-06-01 00:09:21.727 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889600},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:09:21.879 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:09:21.879 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:09:21.879 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:09:21.879 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:09:21.879 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:09:21.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:09:21.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:09:21.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:09:21.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:09:21.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:09:21.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:09:23.021 [334  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:09:34.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 00:09:34.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422105,ok=422105,error=0, records=41
[INFO ] 2026-06-01 00:09:34.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:09:38.026 [32732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:09:47.528 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21111/300s
[INFO ] 2026-06-01 00:09:49.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 00:09:49.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422106,ok=422106,error=0, records=41
[INFO ] 2026-06-01 00:09:49.187 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21107/300s
[INFO ] 2026-06-01 00:09:49.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:09:53.031 [334  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:10:00.565 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21120/300s
[INFO ] 2026-06-01 00:10:04.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 00:10:04.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422107,ok=422107,error=0, records=41
[INFO ] 2026-06-01 00:10:04.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:10:08.035 [393  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:10:19.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 00:10:19.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422108,ok=422108,error=0, records=41
[INFO ] 2026-06-01 00:10:19.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:10:23.040 [363  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:10:34.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 00:10:34.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422109,ok=422109,error=0, records=41
[INFO ] 2026-06-01 00:10:34.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:10:38.045 [443  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:10:40.536 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21120/300s
[INFO ] 2026-06-01 00:10:48.636 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21107/300s
[INFO ] 2026-06-01 00:10:49.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 00:10:49.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422110,ok=422110,error=0, records=41
[INFO ] 2026-06-01 00:10:49.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:10:53.049 [363  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:11:04.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 00:11:04.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422111,ok=422111,error=0, records=41
[INFO ] 2026-06-01 00:11:04.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:11:07.554 [478  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:11:19.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 00:11:19.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422112,ok=422112,error=0, records=41
[INFO ] 2026-06-01 00:11:19.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:11:22.559 [363  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:11:28.628 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21116/300s
[INFO ] 2026-06-01 00:11:34.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 00:11:34.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422113,ok=422113,error=0, records=41
[INFO ] 2026-06-01 00:11:34.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:11:37.564 [505  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:11:49.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 00:11:49.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422114,ok=422114,error=0, records=41
[INFO ] 2026-06-01 00:11:49.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:11:52.569 [563  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:12:04.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 00:12:04.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422115,ok=422115,error=0, records=41
[INFO ] 2026-06-01 00:12:04.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:12:04.325 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21119/300s
[WARN ] 2026-06-01 00:12:07.576 [575  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:12:19.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 00:12:19.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422116,ok=422116,error=0, records=41
[INFO ] 2026-06-01 00:12:19.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:12:21.881 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889520},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:12:22.039 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:12:22.039 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 00:12:22.039 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:12:22.039 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:12:22.039 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:12:22.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:12:22.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:12:22.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:12:22.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:12:22.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:12:22.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:12:22.581 [599  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:12:34.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 00:12:34.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422117,ok=422117,error=0, records=41
[INFO ] 2026-06-01 00:12:34.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:12:36.645 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21117/300s
[WARN ] 2026-06-01 00:12:37.587 [609  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:12:38.547 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21117/300s
[INFO ] 2026-06-01 00:12:46.354 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21117/300s
[INFO ] 2026-06-01 00:12:49.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 00:12:49.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422118,ok=422118,error=0, records=41
[INFO ] 2026-06-01 00:12:49.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:12:52.592 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:13:04.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 00:13:04.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422119,ok=422119,error=0, records=41
[INFO ] 2026-06-01 00:13:04.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:13:07.597 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:13:19.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 00:13:19.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422120,ok=422120,error=0, records=41
[INFO ] 2026-06-01 00:13:19.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:13:22.602 [627  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:13:34.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 00:13:34.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422121,ok=422121,error=0, records=41
[INFO ] 2026-06-01 00:13:34.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 00:13:34.328 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 00:13:37.607 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:13:49.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 00:13:49.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422122,ok=422122,error=0, records=41
[INFO ] 2026-06-01 00:13:49.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:13:52.611 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:14:04.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 00:14:04.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422123,ok=422123,error=0, records=41
[INFO ] 2026-06-01 00:14:04.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:14:07.616 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:14:19.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 00:14:19.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422124,ok=422124,error=0, records=41
[INFO ] 2026-06-01 00:14:19.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:14:22.622 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:14:34.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 00:14:34.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422125,ok=422125,error=0, records=41
[INFO ] 2026-06-01 00:14:34.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:14:37.629 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:14:47.632 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21112/300s
[INFO ] 2026-06-01 00:14:49.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 00:14:49.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422126,ok=422126,error=0, records=41
[INFO ] 2026-06-01 00:14:49.300 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21108/300s
[INFO ] 2026-06-01 00:14:49.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:14:52.634 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:15:00.568 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21121/300s
[INFO ] 2026-06-01 00:15:04.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 00:15:04.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422127,ok=422127,error=0, records=41
[INFO ] 2026-06-01 00:15:04.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:15:07.639 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:15:19.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 00:15:19.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422128,ok=422128,error=0, records=41
[INFO ] 2026-06-01 00:15:19.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:15:22.039 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17584/300s
[INFO ] 2026-06-01 00:15:22.040 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889440},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:15:22.199 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:15:22.200 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 00:15:22.200 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:15:22.200 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:15:22.200 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:15:22.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:15:22.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:15:22.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:15:22.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:15:22.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:15:22.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:15:22.645 [627  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:15:34.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 00:15:34.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422129,ok=422129,error=0, records=41
[INFO ] 2026-06-01 00:15:34.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:15:37.651 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:15:40.543 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21121/300s
[INFO ] 2026-06-01 00:15:48.813 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21108/300s
[INFO ] 2026-06-01 00:15:49.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 00:15:49.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422130,ok=422130,error=0, records=41
[INFO ] 2026-06-01 00:15:49.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:15:52.656 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:16:04.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 00:16:04.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422131,ok=422131,error=0, records=41
[INFO ] 2026-06-01 00:16:04.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:16:07.662 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:16:19.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 00:16:19.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422132,ok=422132,error=0, records=41
[INFO ] 2026-06-01 00:16:19.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 00:16:22.667 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:16:28.681 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21117/300s
[INFO ] 2026-06-01 00:16:34.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:16:34.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 00:16:34.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422133,ok=422133,error=0, records=41
[WARN ] 2026-06-01 00:16:37.671 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:16:49.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:16:49.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 00:16:49.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422134,ok=422134,error=0, records=41
[WARN ] 2026-06-01 00:16:52.675 [627  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:17:04.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:17:04.337 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21120/300s
[INFO ] 2026-06-01 00:17:04.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 00:17:04.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422135,ok=422135,error=0, records=41
[WARN ] 2026-06-01 00:17:07.681 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:17:19.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:17:19.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 00:17:19.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422136,ok=422136,error=0, records=41
[WARN ] 2026-06-01 00:17:22.686 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:17:34.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:17:34.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 00:17:34.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422137,ok=422137,error=0, records=41
[INFO ] 2026-06-01 00:17:36.698 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21118/300s
[WARN ] 2026-06-01 00:17:37.692 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:17:38.600 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21118/300s
[INFO ] 2026-06-01 00:17:46.406 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21118/300s
[INFO ] 2026-06-01 00:17:49.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:17:49.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 00:17:49.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422138,ok=422138,error=0, records=41
[WARN ] 2026-06-01 00:17:52.696 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:18:04.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:18:04.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 00:18:04.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422139,ok=422139,error=0, records=41
[WARN ] 2026-06-01 00:18:07.701 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:18:19.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:18:19.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 00:18:19.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422140,ok=422140,error=0, records=41
[INFO ] 2026-06-01 00:18:22.201 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889364},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:18:22.361 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:18:22.361 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:18:22.361 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:18:22.361 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:18:22.361 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:18:22.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:18:22.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:18:22.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:18:22.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:18:22.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:18:22.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:18:22.706 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:18:34.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:18:34.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 00:18:34.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422141,ok=422141,error=0, records=41
[WARN ] 2026-06-01 00:18:37.711 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:18:49.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:18:49.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 00:18:49.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422142,ok=422142,error=0, records=41
[WARN ] 2026-06-01 00:18:52.715 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:19:04.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:19:04.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 00:19:04.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422143,ok=422143,error=0, records=41
[WARN ] 2026-06-01 00:19:07.720 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:19:19.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:19:19.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 00:19:19.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422144,ok=422144,error=0, records=41
[WARN ] 2026-06-01 00:19:22.725 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:19:34.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:19:34.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 00:19:34.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422145,ok=422145,error=0, records=41
[WARN ] 2026-06-01 00:19:37.731 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:19:47.734 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21113/300s
[INFO ] 2026-06-01 00:19:49.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:19:49.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 00:19:49.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422146,ok=422146,error=0, records=41
[INFO ] 2026-06-01 00:19:49.481 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21109/300s
[WARN ] 2026-06-01 00:19:52.736 [627  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:20:00.571 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21122/300s
[INFO ] 2026-06-01 00:20:04.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:20:04.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 00:20:04.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422147,ok=422147,error=0, records=41
[WARN ] 2026-06-01 00:20:07.741 [627  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:20:19.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:20:19.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 00:20:19.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422148,ok=422148,error=0, records=41
[WARN ] 2026-06-01 00:20:22.746 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:20:34.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:20:34.499 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 00:20:34.499 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422149,ok=422149,error=0, records=41
[WARN ] 2026-06-01 00:20:37.751 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:20:40.549 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21122/300s
[INFO ] 2026-06-01 00:20:48.997 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21109/300s
[INFO ] 2026-06-01 00:20:49.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:20:49.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 00:20:49.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422150,ok=422150,error=0, records=41
[WARN ] 2026-06-01 00:20:52.756 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:21:04.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:21:04.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 00:21:04.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422151,ok=422151,error=0, records=41
[WARN ] 2026-06-01 00:21:07.761 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:21:19.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:21:19.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 00:21:19.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422152,ok=422152,error=0, records=41
[INFO ] 2026-06-01 00:21:22.361 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17585/300s
[INFO ] 2026-06-01 00:21:22.362 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889288},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:21:22.530 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:21:22.530 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:21:22.531 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:21:22.531 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:21:22.531 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:21:22.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:21:22.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:21:22.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:21:22.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:21:22.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:21:22.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:21:22.767 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:21:28.737 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21118/300s
[INFO ] 2026-06-01 00:21:34.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:21:34.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 00:21:34.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422153,ok=422153,error=0, records=41
[WARN ] 2026-06-01 00:21:37.772 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:21:49.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:21:49.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 00:21:49.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422154,ok=422154,error=0, records=41
[WARN ] 2026-06-01 00:21:52.777 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:22:04.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:22:04.348 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21121/300s
[INFO ] 2026-06-01 00:22:04.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 00:22:04.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422155,ok=422155,error=0, records=41
[WARN ] 2026-06-01 00:22:07.783 [621  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:22:19.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:22:19.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-01 00:22:19.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422156,ok=422156,error=0, records=41
[WARN ] 2026-06-01 00:22:22.788 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:22:34.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:22:34.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 00:22:34.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422157,ok=422157,error=0, records=41
[INFO ] 2026-06-01 00:22:36.736 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21119/300s
[WARN ] 2026-06-01 00:22:37.793 [626  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:22:38.638 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21119/300s
[INFO ] 2026-06-01 00:22:46.445 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21119/300s
[INFO ] 2026-06-01 00:22:49.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:22:49.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 00:22:49.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422158,ok=422158,error=0, records=41
[WARN ] 2026-06-01 00:22:52.797 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:23:04.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:23:04.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 00:23:04.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422159,ok=422159,error=0, records=41
[WARN ] 2026-06-01 00:23:07.803 [627  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:23:19.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:23:19.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 00:23:19.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422160,ok=422160,error=0, records=41
[WARN ] 2026-06-01 00:23:22.808 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:23:34.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 00:23:34.351 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 00:23:34.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 00:23:34.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422161,ok=422161,error=0, records=41
[WARN ] 2026-06-01 00:23:37.814 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:23:49.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:23:49.352 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 00:23:49.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 00:23:49.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422162,ok=422162,error=0, records=41
[WARN ] 2026-06-01 00:23:52.819 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:24:04.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:24:04.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 00:24:04.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422163,ok=422163,error=0, records=41
[WARN ] 2026-06-01 00:24:07.824 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:24:19.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:24:19.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 00:24:19.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422164,ok=422164,error=0, records=41
[INFO ] 2026-06-01 00:24:22.532 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889208},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:24:22.687 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:24:22.687 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:24:22.687 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:24:22.688 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:24:22.688 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:24:22.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:24:22.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:24:22.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:24:22.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:24:22.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:24:22.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:24:22.829 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:24:34.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:24:34.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 00:24:34.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422165,ok=422165,error=0, records=41
[WARN ] 2026-06-01 00:24:37.835 [592  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:24:47.838 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21114/300s
[INFO ] 2026-06-01 00:24:49.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:24:49.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 00:24:49.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422166,ok=422166,error=0, records=41
[INFO ] 2026-06-01 00:24:49.650 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21110/300s
[WARN ] 2026-06-01 00:24:52.840 [1314 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:25:00.573 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21123/300s
[INFO ] 2026-06-01 00:25:04.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:25:04.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 00:25:04.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422167,ok=422167,error=0, records=41
[WARN ] 2026-06-01 00:25:07.846 [1285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:25:19.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:25:19.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10389, records=41
[INFO ] 2026-06-01 00:25:19.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422168,ok=422168,error=0, records=41
[WARN ] 2026-06-01 00:25:22.851 [1285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:25:34.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:25:34.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 00:25:34.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422169,ok=422169,error=0, records=41
[WARN ] 2026-06-01 00:25:37.857 [1314 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:25:40.555 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21123/300s
[INFO ] 2026-06-01 00:25:49.170 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21110/300s
[INFO ] 2026-06-01 00:25:49.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:25:49.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 00:25:49.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422170,ok=422170,error=0, records=41
[WARN ] 2026-06-01 00:25:52.862 [1285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:26:04.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:26:04.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 00:26:04.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422171,ok=422171,error=0, records=41
[WARN ] 2026-06-01 00:26:07.869 [1249 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 00:26:17.373 [1392 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23699/stat), No such file or directory
[WARN ] 2026-06-01 00:26:17.373 [1392 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/30541/stat), No such file or directory
[INFO ] 2026-06-01 00:26:19.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:26:19.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 00:26:19.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422172,ok=422172,error=0, records=41
[WARN ] 2026-06-01 00:26:22.874 [1249 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:26:28.790 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21119/300s
[WARN ] 2026-06-01 00:26:32.378 [1249 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23699/stat), No such file or directory
[WARN ] 2026-06-01 00:26:32.378 [1249 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/30541/stat), No such file or directory
[INFO ] 2026-06-01 00:26:34.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:26:34.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 00:26:34.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422173,ok=422173,error=0, records=41
[WARN ] 2026-06-01 00:26:37.878 [1392 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 00:26:47.383 [1482 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23699/stat), No such file or directory
[WARN ] 2026-06-01 00:26:47.383 [1482 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/30541/stat), No such file or directory
[INFO ] 2026-06-01 00:26:49.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:26:49.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 00:26:49.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422174,ok=422174,error=0, records=41
[WARN ] 2026-06-01 00:26:52.883 [1492 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:27:04.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:27:04.360 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21122/300s
[INFO ] 2026-06-01 00:27:04.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 00:27:04.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422175,ok=422175,error=0, records=41
[WARN ] 2026-06-01 00:27:07.888 [1503 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:27:19.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:27:19.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 00:27:19.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422176,ok=422176,error=0, records=41
[INFO ] 2026-06-01 00:27:22.688 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17586/300s
[INFO ] 2026-06-01 00:27:22.689 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889124},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:27:22.835 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:27:22.835 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 00:27:22.835 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:27:22.835 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:27:22.835 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:27:22.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:27:22.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:27:22.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:27:22.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:27:22.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:27:22.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:27:22.894 [1520 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:27:34.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:27:34.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 00:27:34.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422177,ok=422177,error=0, records=41
[INFO ] 2026-06-01 00:27:36.751 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21120/300s
[WARN ] 2026-06-01 00:27:37.900 [1520 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:27:38.652 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21120/300s
[INFO ] 2026-06-01 00:27:46.457 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21120/300s
[INFO ] 2026-06-01 00:27:49.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:27:49.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 00:27:49.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422178,ok=422178,error=0, records=41
[WARN ] 2026-06-01 00:27:52.906 [1555 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:28:04.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:28:04.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 00:28:04.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422179,ok=422179,error=0, records=41
[WARN ] 2026-06-01 00:28:07.914 [1520 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:28:19.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:28:19.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 00:28:19.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422180,ok=422180,error=0, records=41
[WARN ] 2026-06-01 00:28:22.919 [1575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:28:34.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:28:34.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 00:28:34.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422181,ok=422181,error=0, records=41
[WARN ] 2026-06-01 00:28:37.925 [1584 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:28:49.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:28:49.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 00:28:49.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422182,ok=422182,error=0, records=41
[WARN ] 2026-06-01 00:28:52.931 [1575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:29:04.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:29:04.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 00:29:04.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422183,ok=422183,error=0, records=41
[WARN ] 2026-06-01 00:29:07.937 [1591 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:29:19.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:29:19.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 00:29:19.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422184,ok=422184,error=0, records=41
[WARN ] 2026-06-01 00:29:22.942 [1643 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:29:34.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:29:34.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 00:29:34.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422185,ok=422185,error=0, records=41
[WARN ] 2026-06-01 00:29:37.948 [1631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:29:47.951 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21115/300s
[INFO ] 2026-06-01 00:29:49.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:29:49.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 00:29:49.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422186,ok=422186,error=0, records=41
[INFO ] 2026-06-01 00:29:49.835 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21111/300s
[WARN ] 2026-06-01 00:29:52.953 [1655 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:30:00.576 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21124/300s
[INFO ] 2026-06-01 00:30:04.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:30:04.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 00:30:04.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422187,ok=422187,error=0, records=41
[WARN ] 2026-06-01 00:30:07.958 [1660 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:30:19.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:30:19.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 00:30:19.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422188,ok=422188,error=0, records=41
[INFO ] 2026-06-01 00:30:22.836 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20889040},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 00:30:22.963 [1655 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:30:23.012 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:30:23.013 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:30:23.013 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:30:23.013 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:30:23.013 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:30:23.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:30:23.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:30:23.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:30:23.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:30:23.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:30:23.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:30:34.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:30:34.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 00:30:34.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422189,ok=422189,error=0, records=41
[WARN ] 2026-06-01 00:30:37.968 [1679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:30:40.560 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21124/300s
[INFO ] 2026-06-01 00:30:49.325 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21111/300s
[INFO ] 2026-06-01 00:30:49.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:30:49.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 00:30:49.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422190,ok=422190,error=0, records=41
[WARN ] 2026-06-01 00:30:52.973 [1679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:31:04.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:31:04.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 00:31:04.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422191,ok=422191,error=0, records=41
[WARN ] 2026-06-01 00:31:07.978 [1655 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:31:19.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:31:19.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 00:31:19.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422192,ok=422192,error=0, records=41
[WARN ] 2026-06-01 00:31:22.983 [1631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:31:28.835 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21120/300s
[INFO ] 2026-06-01 00:31:34.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:31:34.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 00:31:34.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422193,ok=422193,error=0, records=41
[WARN ] 2026-06-01 00:31:37.989 [1754 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:31:49.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:31:49.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10745, records=44
[INFO ] 2026-06-01 00:31:49.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422194,ok=422194,error=0, records=44
[WARN ] 2026-06-01 00:31:52.995 [1655 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:32:04.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:32:04.371 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21123/300s
[INFO ] 2026-06-01 00:32:04.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 00:32:04.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422195,ok=422195,error=0, records=41
[WARN ] 2026-06-01 00:32:08.002 [1814 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:32:19.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:32:19.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 00:32:19.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422196,ok=422196,error=0, records=41
[WARN ] 2026-06-01 00:32:23.006 [1770 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:32:34.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:32:34.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 00:32:34.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422197,ok=422197,error=0, records=41
[INFO ] 2026-06-01 00:32:36.758 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21121/300s
[WARN ] 2026-06-01 00:32:38.011 [1828 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:32:38.660 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21121/300s
[INFO ] 2026-06-01 00:32:46.465 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21121/300s
[INFO ] 2026-06-01 00:32:49.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:32:49.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 00:32:49.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422198,ok=422198,error=0, records=41
[WARN ] 2026-06-01 00:32:53.018 [1798 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:33:04.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:33:04.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 00:33:04.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422199,ok=422199,error=0, records=41
[WARN ] 2026-06-01 00:33:08.024 [1828 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:33:19.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:33:19.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 00:33:19.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422200,ok=422200,error=0, records=41
[INFO ] 2026-06-01 00:33:23.013 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17587/300s
[INFO ] 2026-06-01 00:33:23.014 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888976},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 00:33:23.029 [1842 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:33:23.182 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:33:23.182 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 00:33:23.182 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:33:23.183 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:33:23.183 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:33:23.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:33:23.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:33:23.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:33:23.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:33:23.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:33:23.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:33:34.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 00:33:34.374 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 00:33:34.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 00:33:34.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422201,ok=422201,error=0, records=41
[WARN ] 2026-06-01 00:33:38.034 [1842 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:33:49.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:33:49.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 00:33:49.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422202,ok=422202,error=0, records=41
[WARN ] 2026-06-01 00:33:53.039 [1915 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:34:04.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:34:04.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 00:34:04.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422203,ok=422203,error=0, records=41
[WARN ] 2026-06-01 00:34:08.044 [1931 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:34:19.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:34:19.957 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 00:34:19.957 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422204,ok=422204,error=0, records=41
[WARN ] 2026-06-01 00:34:23.050 [1905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:34:34.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:34:34.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 00:34:34.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422205,ok=422205,error=0, records=41
[WARN ] 2026-06-01 00:34:37.555 [1970 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:34:48.058 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21116/300s
[INFO ] 2026-06-01 00:34:49.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:34:49.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 00:34:49.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422206,ok=422206,error=0, records=41
[INFO ] 2026-06-01 00:34:49.968 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21112/300s
[WARN ] 2026-06-01 00:34:52.560 [1905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:35:00.578 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21125/300s
[INFO ] 2026-06-01 00:35:04.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:35:04.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-01 00:35:04.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422207,ok=422207,error=0, records=41
[WARN ] 2026-06-01 00:35:07.565 [2000 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:35:19.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:35:19.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10395, records=41
[INFO ] 2026-06-01 00:35:19.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422208,ok=422208,error=0, records=41
[WARN ] 2026-06-01 00:35:22.571 [2015 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:35:34.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:35:34.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-01 00:35:34.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422209,ok=422209,error=0, records=41
[WARN ] 2026-06-01 00:35:37.576 [2039 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:35:40.566 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21125/300s
[INFO ] 2026-06-01 00:35:49.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:35:49.504 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21112/300s
[INFO ] 2026-06-01 00:35:49.994 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-01 00:35:49.994 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422210,ok=422210,error=0, records=41
[WARN ] 2026-06-01 00:35:52.582 [2044 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:36:04.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:36:05.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 00:36:05.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422211,ok=422211,error=0, records=41
[WARN ] 2026-06-01 00:36:07.586 [2066 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:36:19.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:36:20.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 00:36:20.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422212,ok=422212,error=0, records=41
[WARN ] 2026-06-01 00:36:22.592 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:36:23.184 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888900},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:36:23.357 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:36:23.357 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 00:36:23.357 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:36:23.357 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:36:23.357 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:36:23.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:36:23.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:36:23.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:36:23.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:36:23.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:36:23.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:36:28.889 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21121/300s
[INFO ] 2026-06-01 00:36:34.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:36:35.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 00:36:35.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422213,ok=422213,error=0, records=41
[WARN ] 2026-06-01 00:36:37.597 [2102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:36:49.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:36:50.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 00:36:50.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422214,ok=422214,error=0, records=41
[WARN ] 2026-06-01 00:36:52.601 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:37:04.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:37:04.383 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21124/300s
[INFO ] 2026-06-01 00:37:05.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10386, records=41
[INFO ] 2026-06-01 00:37:05.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422215,ok=422215,error=0, records=41
[WARN ] 2026-06-01 00:37:07.606 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:37:19.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:37:20.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13142, records=54
[INFO ] 2026-06-01 00:37:20.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422216,ok=422216,error=0, records=54
[WARN ] 2026-06-01 00:37:22.611 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:37:34.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:37:35.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 00:37:35.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422217,ok=422217,error=0, records=41
[INFO ] 2026-06-01 00:37:36.804 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21122/300s
[WARN ] 2026-06-01 00:37:37.617 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:37:38.706 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21122/300s
[INFO ] 2026-06-01 00:37:46.510 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21122/300s
[INFO ] 2026-06-01 00:37:49.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:37:50.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 00:37:50.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422218,ok=422218,error=0, records=41
[WARN ] 2026-06-01 00:37:52.622 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:38:04.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:38:05.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 00:38:05.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422219,ok=422219,error=0, records=41
[WARN ] 2026-06-01 00:38:07.629 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:38:19.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:38:20.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 00:38:20.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422220,ok=422220,error=0, records=41
[WARN ] 2026-06-01 00:38:22.634 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:38:34.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:38:35.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 00:38:35.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422221,ok=422221,error=0, records=41
[WARN ] 2026-06-01 00:38:37.640 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:38:49.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:38:49.388 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 00:38:50.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 00:38:50.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422222,ok=422222,error=0, records=41
[WARN ] 2026-06-01 00:38:52.646 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:39:04.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:39:05.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 00:39:05.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422223,ok=422223,error=0, records=41
[WARN ] 2026-06-01 00:39:07.652 [2102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:39:19.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:39:20.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 00:39:20.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422224,ok=422224,error=0, records=41
[WARN ] 2026-06-01 00:39:22.659 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:39:23.357 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17588/300s
[INFO ] 2026-06-01 00:39:23.359 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888824},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:39:23.531 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:39:23.531 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 00:39:23.531 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:39:23.531 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:39:23.531 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:39:23.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:39:23.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:39:23.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:39:23.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:39:23.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:39:23.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:39:34.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:39:35.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 00:39:35.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422225,ok=422225,error=0, records=41
[WARN ] 2026-06-01 00:39:37.664 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:39:48.167 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21117/300s
[INFO ] 2026-06-01 00:39:49.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:39:50.218 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 00:39:50.218 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422226,ok=422226,error=0, records=41
[INFO ] 2026-06-01 00:39:50.218 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21113/300s
[WARN ] 2026-06-01 00:39:52.669 [2102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:40:00.582 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21126/300s
[INFO ] 2026-06-01 00:40:04.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:40:05.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 00:40:05.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422227,ok=422227,error=0, records=41
[WARN ] 2026-06-01 00:40:07.674 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:40:19.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:40:20.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 00:40:20.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422228,ok=422228,error=0, records=41
[WARN ] 2026-06-01 00:40:22.679 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:40:34.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:40:35.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 00:40:35.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422229,ok=422229,error=0, records=41
[WARN ] 2026-06-01 00:40:37.685 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:40:40.572 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21126/300s
[INFO ] 2026-06-01 00:40:49.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:40:49.688 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21113/300s
[INFO ] 2026-06-01 00:40:50.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 00:40:50.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422230,ok=422230,error=0, records=41
[WARN ] 2026-06-01 00:40:52.689 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:41:04.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:41:05.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 00:41:05.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422231,ok=422231,error=0, records=41
[WARN ] 2026-06-01 00:41:07.695 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:41:19.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:41:20.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 00:41:20.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422232,ok=422232,error=0, records=41
[WARN ] 2026-06-01 00:41:22.701 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:41:28.945 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21122/300s
[INFO ] 2026-06-01 00:41:34.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:41:35.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 00:41:35.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422233,ok=422233,error=0, records=41
[WARN ] 2026-06-01 00:41:37.707 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:41:49.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:41:50.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 00:41:50.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422234,ok=422234,error=0, records=41
[WARN ] 2026-06-01 00:41:52.712 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:42:04.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:42:04.397 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21125/300s
[INFO ] 2026-06-01 00:42:05.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 00:42:05.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422235,ok=422235,error=0, records=41
[WARN ] 2026-06-01 00:42:07.719 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:42:19.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:42:20.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 00:42:20.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422236,ok=422236,error=0, records=41
[WARN ] 2026-06-01 00:42:22.726 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:42:23.533 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888752},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:42:23.705 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:42:23.705 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:42:23.705 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:42:23.705 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:42:23.705 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:42:23.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:42:23.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:42:23.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:42:23.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:42:23.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:42:23.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:42:34.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:42:35.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 00:42:35.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422237,ok=422237,error=0, records=41
[INFO ] 2026-06-01 00:42:36.850 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21123/300s
[WARN ] 2026-06-01 00:42:37.730 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:42:38.751 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21123/300s
[INFO ] 2026-06-01 00:42:46.558 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21123/300s
[INFO ] 2026-06-01 00:42:49.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:42:50.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 00:42:50.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422238,ok=422238,error=0, records=41
[WARN ] 2026-06-01 00:42:52.735 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:43:04.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:43:05.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 00:43:05.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422239,ok=422239,error=0, records=41
[WARN ] 2026-06-01 00:43:07.742 [2102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:43:19.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:43:20.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 00:43:20.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422240,ok=422240,error=0, records=41
[WARN ] 2026-06-01 00:43:22.747 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:43:34.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 00:43:34.400 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 00:43:35.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 00:43:35.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422241,ok=422241,error=0, records=41
[WARN ] 2026-06-01 00:43:37.752 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:43:49.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:43:50.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 00:43:50.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422242,ok=422242,error=0, records=41
[WARN ] 2026-06-01 00:43:52.759 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:44:04.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:44:05.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 00:44:05.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422243,ok=422243,error=0, records=41
[WARN ] 2026-06-01 00:44:07.764 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:44:19.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:44:20.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 00:44:20.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422244,ok=422244,error=0, records=41
[WARN ] 2026-06-01 00:44:22.769 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:44:34.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:44:35.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 00:44:35.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422245,ok=422245,error=0, records=41
[WARN ] 2026-06-01 00:44:37.774 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:44:48.277 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21118/300s
[INFO ] 2026-06-01 00:44:49.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:44:50.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 00:44:50.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422246,ok=422246,error=0, records=41
[INFO ] 2026-06-01 00:44:50.342 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21114/300s
[WARN ] 2026-06-01 00:44:52.778 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:45:00.584 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21127/300s
[INFO ] 2026-06-01 00:45:04.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:45:05.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 00:45:05.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422247,ok=422247,error=0, records=41
[WARN ] 2026-06-01 00:45:07.783 [2108 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:45:19.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:45:20.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 00:45:20.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422248,ok=422248,error=0, records=41
[WARN ] 2026-06-01 00:45:22.789 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:45:23.705 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17589/300s
[INFO ] 2026-06-01 00:45:23.706 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888676},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:45:23.853 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:45:23.853 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 00:45:23.853 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:45:23.853 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:45:23.853 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:45:23.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:45:23.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:45:23.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:45:23.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:45:23.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:45:23.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:45:34.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:45:35.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 00:45:35.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422249,ok=422249,error=0, records=41
[WARN ] 2026-06-01 00:45:37.794 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:45:40.579 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21127/300s
[INFO ] 2026-06-01 00:45:49.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:45:49.867 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21114/300s
[INFO ] 2026-06-01 00:45:50.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 00:45:50.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422250,ok=422250,error=0, records=41
[WARN ] 2026-06-01 00:45:52.799 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:46:04.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:46:05.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 00:46:05.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422251,ok=422251,error=0, records=41
[WARN ] 2026-06-01 00:46:07.804 [2067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:46:19.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:46:20.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 00:46:20.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422252,ok=422252,error=0, records=41
[WARN ] 2026-06-01 00:46:22.810 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:46:28.997 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21123/300s
[WARN ] 2026-06-01 00:46:32.814 [2644 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1427/stat), No such file or directory
[INFO ] 2026-06-01 00:46:34.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:46:35.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 00:46:35.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422253,ok=422253,error=0, records=41
[WARN ] 2026-06-01 00:46:37.815 [2612 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 00:46:47.319 [2644 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1427/stat), No such file or directory
[WARN ] 2026-06-01 00:46:47.319 [2644 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1428/stat), No such file or directory
[INFO ] 2026-06-01 00:46:49.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:46:50.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 00:46:50.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422254,ok=422254,error=0, records=41
[WARN ] 2026-06-01 00:46:52.819 [2612 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:47:04.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:47:04.409 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21126/300s
[INFO ] 2026-06-01 00:47:05.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 00:47:05.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422255,ok=422255,error=0, records=41
[WARN ] 2026-06-01 00:47:07.824 [2665 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:47:19.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:47:20.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 00:47:20.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422256,ok=422256,error=0, records=41
[WARN ] 2026-06-01 00:47:22.830 [2102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:47:34.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:47:35.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 00:47:35.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422257,ok=422257,error=0, records=41
[INFO ] 2026-06-01 00:47:36.897 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21124/300s
[WARN ] 2026-06-01 00:47:37.835 [2644 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:47:38.799 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21124/300s
[INFO ] 2026-06-01 00:47:46.606 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21124/300s
[INFO ] 2026-06-01 00:47:49.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:47:50.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 00:47:50.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422258,ok=422258,error=0, records=41
[WARN ] 2026-06-01 00:47:52.841 [2714 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:48:04.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:48:05.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 00:48:05.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422259,ok=422259,error=0, records=41
[WARN ] 2026-06-01 00:48:07.845 [2728 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:48:19.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:48:20.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 00:48:20.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422260,ok=422260,error=0, records=41
[WARN ] 2026-06-01 00:48:22.850 [2714 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:48:23.855 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888592},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:48:24.031 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:48:24.031 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:48:24.031 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:48:24.031 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:48:24.031 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:48:24.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:48:24.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:48:24.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:48:24.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:48:24.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:48:24.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 00:48:32.353 [2760 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1431/stat), No such file or directory
[WARN ] 2026-06-01 00:48:32.353 [2760 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1432/stat), No such file or directory
[INFO ] 2026-06-01 00:48:34.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:48:35.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 00:48:35.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422261,ok=422261,error=0, records=41
[WARN ] 2026-06-01 00:48:37.853 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 00:48:47.357 [2774 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1431/stat), No such file or directory
[WARN ] 2026-06-01 00:48:47.357 [2774 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1432/stat), No such file or directory
[INFO ] 2026-06-01 00:48:49.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:48:50.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 00:48:50.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422262,ok=422262,error=0, records=41
[WARN ] 2026-06-01 00:48:52.859 [2742 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:49:04.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:49:05.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 00:49:05.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422263,ok=422263,error=0, records=41
[WARN ] 2026-06-01 00:49:07.864 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:49:19.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:49:20.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 00:49:20.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422264,ok=422264,error=0, records=41
[WARN ] 2026-06-01 00:49:22.869 [2742 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:49:34.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:49:35.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 00:49:35.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422265,ok=422265,error=0, records=41
[WARN ] 2026-06-01 00:49:37.873 [2788 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:49:48.376 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21119/300s
[INFO ] 2026-06-01 00:49:49.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:49:50.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 00:49:50.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422266,ok=422266,error=0, records=41
[INFO ] 2026-06-01 00:49:50.555 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21115/300s
[WARN ] 2026-06-01 00:49:52.879 [2084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:50:00.587 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21128/300s
[INFO ] 2026-06-01 00:50:04.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:50:05.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 00:50:05.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422267,ok=422267,error=0, records=41
[WARN ] 2026-06-01 00:50:07.885 [2845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:50:19.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:50:20.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 00:50:20.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422268,ok=422268,error=0, records=41
[WARN ] 2026-06-01 00:50:22.890 [2867 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:50:34.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:50:35.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 00:50:35.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422269,ok=422269,error=0, records=41
[WARN ] 2026-06-01 00:50:37.895 [2890 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:50:40.584 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21128/300s
[INFO ] 2026-06-01 00:50:49.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:50:50.039 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21115/300s
[INFO ] 2026-06-01 00:50:50.588 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 00:50:50.588 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422270,ok=422270,error=0, records=41
[WARN ] 2026-06-01 00:50:52.901 [2913 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:51:04.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:51:05.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 00:51:05.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422271,ok=422271,error=0, records=41
[WARN ] 2026-06-01 00:51:07.907 [2867 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:51:19.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:51:20.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 00:51:20.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422272,ok=422272,error=0, records=41
[WARN ] 2026-06-01 00:51:22.912 [2918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:51:24.031 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17590/300s
[INFO ] 2026-06-01 00:51:24.033 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888524},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:51:24.190 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:51:24.190 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 00:51:24.190 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:51:24.190 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:51:24.190 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:51:24.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:51:24.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:51:24.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:51:24.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:51:24.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:51:24.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:51:29.050 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21124/300s
[INFO ] 2026-06-01 00:51:34.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:51:35.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 00:51:35.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422273,ok=422273,error=0, records=41
[WARN ] 2026-06-01 00:51:37.917 [2952 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:51:49.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:51:50.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 00:51:50.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422274,ok=422274,error=0, records=41
[WARN ] 2026-06-01 00:51:52.922 [2969 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:52:04.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:52:04.420 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21127/300s
[INFO ] 2026-06-01 00:52:05.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 00:52:05.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422275,ok=422275,error=0, records=41
[WARN ] 2026-06-01 00:52:07.929 [2946 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:52:19.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:52:20.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 00:52:20.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422276,ok=422276,error=0, records=41
[WARN ] 2026-06-01 00:52:22.935 [2946 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:52:34.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:52:35.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 00:52:35.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422277,ok=422277,error=0, records=41
[INFO ] 2026-06-01 00:52:36.925 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21125/300s
[WARN ] 2026-06-01 00:52:37.941 [2946 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:52:38.826 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21125/300s
[INFO ] 2026-06-01 00:52:46.630 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21125/300s
[INFO ] 2026-06-01 00:52:49.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:52:50.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 00:52:50.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422278,ok=422278,error=0, records=41
[WARN ] 2026-06-01 00:52:52.947 [3023 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:53:04.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:53:05.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 00:53:05.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422279,ok=422279,error=0, records=41
[WARN ] 2026-06-01 00:53:07.952 [3055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:53:19.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:53:20.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 00:53:20.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422280,ok=422280,error=0, records=41
[WARN ] 2026-06-01 00:53:22.957 [3041 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:53:34.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 00:53:34.423 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 00:53:35.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10390, records=41
[INFO ] 2026-06-01 00:53:35.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422281,ok=422281,error=0, records=41
[WARN ] 2026-06-01 00:53:37.962 [2991 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:53:49.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:53:49.424 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 00:53:50.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 00:53:50.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422282,ok=422282,error=0, records=41
[WARN ] 2026-06-01 00:53:52.968 [3084 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:54:04.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:54:05.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 00:54:05.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422283,ok=422283,error=0, records=41
[WARN ] 2026-06-01 00:54:07.973 [3098 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:54:19.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:54:20.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 00:54:20.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422284,ok=422284,error=0, records=41
[WARN ] 2026-06-01 00:54:22.978 [3069 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:54:24.192 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888432},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:54:24.347 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:54:24.347 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 00:54:24.347 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:54:24.347 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:54:24.347 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:54:24.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:54:24.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:54:24.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:54:24.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:54:24.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:54:24.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:54:34.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:54:35.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 00:54:35.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422285,ok=422285,error=0, records=41
[WARN ] 2026-06-01 00:54:37.983 [3069 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:54:48.487 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21120/300s
[INFO ] 2026-06-01 00:54:49.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:54:50.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 00:54:50.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422286,ok=422286,error=0, records=41
[INFO ] 2026-06-01 00:54:50.760 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21116/300s
[WARN ] 2026-06-01 00:54:52.988 [3098 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:55:00.590 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21129/300s
[INFO ] 2026-06-01 00:55:04.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:55:05.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 00:55:05.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422287,ok=422287,error=0, records=41
[WARN ] 2026-06-01 00:55:07.994 [3098 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:55:19.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:55:20.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 00:55:20.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422288,ok=422288,error=0, records=41
[WARN ] 2026-06-01 00:55:22.999 [3098 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:55:34.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:55:35.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 00:55:35.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422289,ok=422289,error=0, records=41
[WARN ] 2026-06-01 00:55:38.004 [3069 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:55:40.590 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21129/300s
[INFO ] 2026-06-01 00:55:49.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:55:50.209 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21116/300s
[INFO ] 2026-06-01 00:55:50.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 00:55:50.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422290,ok=422290,error=0, records=41
[WARN ] 2026-06-01 00:55:53.009 [3126 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:56:04.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:56:05.787 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 00:56:05.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422291,ok=422291,error=0, records=41
[WARN ] 2026-06-01 00:56:08.014 [3126 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:56:19.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:56:20.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 00:56:20.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422292,ok=422292,error=0, records=41
[WARN ] 2026-06-01 00:56:23.018 [3126 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:56:29.091 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21125/300s
[INFO ] 2026-06-01 00:56:34.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:56:35.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 00:56:35.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422293,ok=422293,error=0, records=41
[WARN ] 2026-06-01 00:56:38.023 [3210 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:56:49.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:56:50.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 00:56:50.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422294,ok=422294,error=0, records=41
[WARN ] 2026-06-01 00:56:53.028 [3181 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:57:04.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:57:04.431 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21128/300s
[INFO ] 2026-06-01 00:57:05.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 00:57:05.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422295,ok=422295,error=0, records=41
[WARN ] 2026-06-01 00:57:08.033 [3210 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:57:19.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:57:20.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 00:57:20.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422296,ok=422296,error=0, records=41
[WARN ] 2026-06-01 00:57:23.037 [3301 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:57:24.347 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17591/300s
[INFO ] 2026-06-01 00:57:24.349 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888340},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 00:57:24.508 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 00:57:24.508 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 00:57:24.508 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 00:57:24.508 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 00:57:24.508 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 00:57:24.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:57:24.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 00:57:24.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:57:24.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 00:57:24.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:57:24.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 00:57:34.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:57:35.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 00:57:35.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422297,ok=422297,error=0, records=41
[INFO ] 2026-06-01 00:57:37.017 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21126/300s
[WARN ] 2026-06-01 00:57:38.041 [3098 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:57:38.918 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21126/300s
[INFO ] 2026-06-01 00:57:46.718 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21126/300s
[INFO ] 2026-06-01 00:57:49.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:57:50.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 00:57:50.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422298,ok=422298,error=0, records=41
[WARN ] 2026-06-01 00:57:53.045 [3314 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:58:04.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:58:05.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 00:58:05.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422299,ok=422299,error=0, records=41
[WARN ] 2026-06-01 00:58:08.050 [3340 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:58:19.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:58:20.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 00:58:20.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422300,ok=422300,error=0, records=41
[WARN ] 2026-06-01 00:58:22.554 [3363 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:58:34.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:58:35.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 00:58:35.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422301,ok=422301,error=0, records=41
[WARN ] 2026-06-01 00:58:37.560 [3358 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:58:49.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:58:50.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 00:58:50.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422302,ok=422302,error=0, records=41
[WARN ] 2026-06-01 00:58:52.564 [3404 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:59:04.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:59:05.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 00:59:05.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422303,ok=422303,error=0, records=41
[WARN ] 2026-06-01 00:59:07.569 [3404 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:59:19.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:59:20.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 00:59:20.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422304,ok=422304,error=0, records=41
[WARN ] 2026-06-01 00:59:22.574 [3415 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:59:34.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:59:35.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 00:59:35.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422305,ok=422305,error=0, records=41
[WARN ] 2026-06-01 00:59:37.578 [3451 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 00:59:48.582 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21121/300s
[INFO ] 2026-06-01 00:59:49.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 00:59:50.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 00:59:50.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422306,ok=422306,error=0, records=41
[INFO ] 2026-06-01 00:59:50.911 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21117/300s
[WARN ] 2026-06-01 00:59:52.583 [3460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:00:00.592 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21130/300s
[INFO ] 2026-06-01 01:00:04.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:00:05.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 01:00:05.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422307,ok=422307,error=0, records=41
[WARN ] 2026-06-01 01:00:07.588 [3472 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:00:19.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:00:20.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 01:00:20.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422308,ok=422308,error=0, records=41
[WARN ] 2026-06-01 01:00:22.593 [3472 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:00:24.509 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:00:24.681 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:00:24.681 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:00:24.681 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:00:24.682 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:00:24.682 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:00:24.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:00:24.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:00:24.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:00:24.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:00:24.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:00:24.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:00:34.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:00:35.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 01:00:35.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422309,ok=422309,error=0, records=41
[WARN ] 2026-06-01 01:00:37.597 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:00:40.595 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21130/300s
[INFO ] 2026-06-01 01:00:49.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:00:50.385 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21117/300s
[INFO ] 2026-06-01 01:00:50.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 01:00:50.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422310,ok=422310,error=0, records=41
[WARN ] 2026-06-01 01:00:52.601 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:01:04.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:01:05.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 01:01:05.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422311,ok=422311,error=0, records=41
[WARN ] 2026-06-01 01:01:07.607 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:01:19.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:01:20.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 01:01:20.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422312,ok=422312,error=0, records=41
[WARN ] 2026-06-01 01:01:22.613 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:01:29.137 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21126/300s
[INFO ] 2026-06-01 01:01:34.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:01:35.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 01:01:35.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422313,ok=422313,error=0, records=41
[WARN ] 2026-06-01 01:01:37.618 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:01:49.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:01:50.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 01:01:50.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422314,ok=422314,error=0, records=41
[WARN ] 2026-06-01 01:01:52.623 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:02:04.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:02:04.442 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21129/300s
[INFO ] 2026-06-01 01:02:05.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 01:02:05.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422315,ok=422315,error=0, records=41
[WARN ] 2026-06-01 01:02:07.628 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:02:19.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:02:20.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 01:02:20.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422316,ok=422316,error=0, records=41
[WARN ] 2026-06-01 01:02:22.634 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:02:34.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:02:35.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 01:02:35.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422317,ok=422317,error=0, records=41
[INFO ] 2026-06-01 01:02:37.022 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21127/300s
[WARN ] 2026-06-01 01:02:37.639 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:02:38.924 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21127/300s
[INFO ] 2026-06-01 01:02:46.720 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21127/300s
[INFO ] 2026-06-01 01:02:49.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:02:50.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10135, records=41
[INFO ] 2026-06-01 01:02:50.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422318,ok=422318,error=0, records=41
[WARN ] 2026-06-01 01:02:52.643 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:03:04.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:03:05.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 01:03:05.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422319,ok=422319,error=0, records=41
[WARN ] 2026-06-01 01:03:07.648 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:03:19.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:03:20.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 01:03:20.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422320,ok=422320,error=0, records=41
[WARN ] 2026-06-01 01:03:22.653 [3466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:03:24.682 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17592/300s
[INFO ] 2026-06-01 01:03:24.683 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888164},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:03:24.940 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:03:24.940 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 01:03:24.941 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:03:24.941 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:03:24.941 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:03:24.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:03:24.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:03:24.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:03:24.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:03:24.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:03:24.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:03:34.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 01:03:34.446 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 01:03:36.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 01:03:36.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422321,ok=422321,error=0, records=41
[WARN ] 2026-06-01 01:03:37.658 [3466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:03:49.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:03:51.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 01:03:51.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422322,ok=422322,error=0, records=41
[WARN ] 2026-06-01 01:03:52.663 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:04:04.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:04:06.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 01:04:06.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422323,ok=422323,error=0, records=41
[WARN ] 2026-06-01 01:04:07.668 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:04:19.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:04:21.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 01:04:21.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422324,ok=422324,error=0, records=41
[WARN ] 2026-06-01 01:04:22.674 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:04:34.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:04:36.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-06-01 01:04:36.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422325,ok=422325,error=0, records=41
[WARN ] 2026-06-01 01:04:37.680 [3466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:04:48.684 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21122/300s
[INFO ] 2026-06-01 01:04:49.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:04:51.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10139, records=41
[INFO ] 2026-06-01 01:04:51.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422326,ok=422326,error=0, records=41
[INFO ] 2026-06-01 01:04:51.023 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21118/300s
[WARN ] 2026-06-01 01:04:52.685 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:05:00.595 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21131/300s
[INFO ] 2026-06-01 01:05:04.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:05:06.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 01:05:06.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422327,ok=422327,error=0, records=41
[WARN ] 2026-06-01 01:05:07.690 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:05:19.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:05:21.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 01:05:21.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422328,ok=422328,error=0, records=41
[WARN ] 2026-06-01 01:05:22.696 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:05:34.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:05:36.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 01:05:36.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422329,ok=422329,error=0, records=41
[WARN ] 2026-06-01 01:05:37.701 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:05:40.600 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21131/300s
[INFO ] 2026-06-01 01:05:49.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:05:50.558 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21118/300s
[INFO ] 2026-06-01 01:05:51.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 01:05:51.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422330,ok=422330,error=0, records=41
[WARN ] 2026-06-01 01:05:52.706 [3466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:06:04.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:06:06.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 01:06:06.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422331,ok=422331,error=0, records=41
[WARN ] 2026-06-01 01:06:07.711 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:06:19.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:06:21.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 01:06:21.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422332,ok=422332,error=0, records=41
[WARN ] 2026-06-01 01:06:22.716 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:06:24.942 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20888060},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:06:25.117 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:06:25.117 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:06:25.117 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:06:25.117 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:06:25.117 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:06:25.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:06:25.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:06:25.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:06:25.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:06:25.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:06:25.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:06:29.191 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21127/300s
[INFO ] 2026-06-01 01:06:34.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:06:36.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 01:06:36.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422333,ok=422333,error=0, records=41
[WARN ] 2026-06-01 01:06:37.720 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:06:49.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:06:51.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 01:06:51.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422334,ok=422334,error=0, records=41
[WARN ] 2026-06-01 01:06:52.725 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:07:04.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:07:04.453 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21130/300s
[INFO ] 2026-06-01 01:07:06.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 01:07:06.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422335,ok=422335,error=0, records=41
[WARN ] 2026-06-01 01:07:07.731 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:07:19.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:07:21.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 01:07:21.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422336,ok=422336,error=0, records=41
[WARN ] 2026-06-01 01:07:22.736 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:07:34.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:07:36.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 01:07:36.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422337,ok=422337,error=0, records=41
[INFO ] 2026-06-01 01:07:37.041 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21128/300s
[WARN ] 2026-06-01 01:07:37.741 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:07:38.943 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21128/300s
[INFO ] 2026-06-01 01:07:46.738 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21128/300s
[INFO ] 2026-06-01 01:07:49.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:07:51.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 01:07:51.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422338,ok=422338,error=0, records=41
[WARN ] 2026-06-01 01:07:52.747 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:08:04.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:08:06.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 01:08:06.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422339,ok=422339,error=0, records=41
[WARN ] 2026-06-01 01:08:07.753 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:08:19.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:08:21.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 01:08:21.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422340,ok=422340,error=0, records=41
[WARN ] 2026-06-01 01:08:22.759 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:08:34.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:08:36.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 01:08:36.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422341,ok=422341,error=0, records=41
[WARN ] 2026-06-01 01:08:37.763 [3466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:08:49.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:08:49.458 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 01:08:51.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 01:08:51.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422342,ok=422342,error=0, records=41
[WARN ] 2026-06-01 01:08:52.767 [3466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:09:04.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:09:06.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 01:09:06.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422343,ok=422343,error=0, records=41
[WARN ] 2026-06-01 01:09:07.772 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:09:19.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:09:21.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 01:09:21.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422344,ok=422344,error=0, records=41
[WARN ] 2026-06-01 01:09:22.776 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:09:25.117 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17593/300s
[INFO ] 2026-06-01 01:09:25.119 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:09:25.272 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:09:25.272 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 01:09:25.272 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:09:25.272 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:09:25.272 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:09:25.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:09:25.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:09:25.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:09:25.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:09:25.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:09:25.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:09:34.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:09:36.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 01:09:36.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422345,ok=422345,error=0, records=41
[WARN ] 2026-06-01 01:09:37.781 [3525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:09:48.784 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21123/300s
[INFO ] 2026-06-01 01:09:49.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:09:51.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 01:09:51.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422346,ok=422346,error=0, records=41
[INFO ] 2026-06-01 01:09:51.172 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21119/300s
[WARN ] 2026-06-01 01:09:52.786 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:10:00.598 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21132/300s
[INFO ] 2026-06-01 01:10:04.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:10:06.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 01:10:06.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422347,ok=422347,error=0, records=41
[WARN ] 2026-06-01 01:10:07.790 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:10:19.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:10:21.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 01:10:21.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422348,ok=422348,error=0, records=41
[WARN ] 2026-06-01 01:10:22.795 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:10:34.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:10:36.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 01:10:36.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422349,ok=422349,error=0, records=41
[WARN ] 2026-06-01 01:10:37.799 [3477 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:10:40.606 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21132/300s
[INFO ] 2026-06-01 01:10:49.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:10:50.742 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21119/300s
[INFO ] 2026-06-01 01:10:51.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 01:10:51.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422350,ok=422350,error=0, records=41
[WARN ] 2026-06-01 01:10:52.803 [3515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:11:04.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:11:06.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 01:11:06.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422351,ok=422351,error=0, records=41
[WARN ] 2026-06-01 01:11:07.809 [4102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:11:19.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:11:21.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 01:11:21.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422352,ok=422352,error=0, records=41
[WARN ] 2026-06-01 01:11:22.813 [4093 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:11:29.245 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21128/300s
[INFO ] 2026-06-01 01:11:34.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:11:36.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 01:11:36.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422353,ok=422353,error=0, records=41
[WARN ] 2026-06-01 01:11:37.819 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:11:49.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:11:51.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 01:11:51.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422354,ok=422354,error=0, records=41
[WARN ] 2026-06-01 01:11:52.824 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:12:04.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:12:04.466 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21131/300s
[INFO ] 2026-06-01 01:12:06.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 01:12:06.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422355,ok=422355,error=0, records=41
[WARN ] 2026-06-01 01:12:07.829 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:12:19.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:12:21.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 01:12:21.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422356,ok=422356,error=0, records=41
[WARN ] 2026-06-01 01:12:22.835 [4167 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:12:25.274 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887896},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:12:25.447 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:12:25.448 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:12:25.448 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:12:25.448 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:12:25.448 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:12:25.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:12:25.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:12:25.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:12:25.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:12:25.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:12:25.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:12:34.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:12:36.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 01:12:36.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422357,ok=422357,error=0, records=41
[INFO ] 2026-06-01 01:12:37.081 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21129/300s
[WARN ] 2026-06-01 01:12:37.840 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:12:38.983 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21129/300s
[INFO ] 2026-06-01 01:12:46.779 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21129/300s
[INFO ] 2026-06-01 01:12:49.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:12:51.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 01:12:51.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422358,ok=422358,error=0, records=41
[WARN ] 2026-06-01 01:12:52.846 [4133 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:13:04.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:13:06.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 01:13:06.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422359,ok=422359,error=0, records=41
[WARN ] 2026-06-01 01:13:07.851 [4133 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:13:19.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:13:21.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 01:13:21.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422360,ok=422360,error=0, records=41
[WARN ] 2026-06-01 01:13:22.856 [3509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:13:34.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 01:13:34.470 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 01:13:36.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 01:13:36.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422361,ok=422361,error=0, records=41
[WARN ] 2026-06-01 01:13:37.861 [4133 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:13:49.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:13:51.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 01:13:51.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422362,ok=422362,error=0, records=41
[WARN ] 2026-06-01 01:13:52.867 [4153 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:14:04.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:14:06.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 01:14:06.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422363,ok=422363,error=0, records=41
[WARN ] 2026-06-01 01:14:07.873 [4260 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:14:19.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:14:21.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 01:14:21.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422364,ok=422364,error=0, records=41
[WARN ] 2026-06-01 01:14:22.878 [4167 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:14:34.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:14:36.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 01:14:36.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422365,ok=422365,error=0, records=41
[WARN ] 2026-06-01 01:14:37.882 [4306 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:14:48.887 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21124/300s
[INFO ] 2026-06-01 01:14:49.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:14:51.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 01:14:51.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422366,ok=422366,error=0, records=41
[INFO ] 2026-06-01 01:14:51.323 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21120/300s
[WARN ] 2026-06-01 01:14:52.888 [4323 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:15:00.601 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21133/300s
[INFO ] 2026-06-01 01:15:04.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:15:06.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 01:15:06.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422367,ok=422367,error=0, records=41
[WARN ] 2026-06-01 01:15:07.893 [4335 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:15:19.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:15:21.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 01:15:21.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422368,ok=422368,error=0, records=41
[WARN ] 2026-06-01 01:15:22.899 [4307 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:15:25.448 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17594/300s
[INFO ] 2026-06-01 01:15:25.449 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887816},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:15:25.602 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:15:25.602 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:15:25.602 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:15:25.602 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:15:25.602 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:15:25.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:15:25.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:15:25.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:15:25.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:15:25.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:15:25.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:15:34.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:15:36.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 01:15:36.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422369,ok=422369,error=0, records=41
[WARN ] 2026-06-01 01:15:37.905 [4376 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:15:40.612 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21133/300s
[INFO ] 2026-06-01 01:15:49.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:15:50.917 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21120/300s
[INFO ] 2026-06-01 01:15:51.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 01:15:51.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422370,ok=422370,error=0, records=41
[WARN ] 2026-06-01 01:15:52.911 [4352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:16:04.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:16:06.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 01:16:06.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422371,ok=422371,error=0, records=41
[WARN ] 2026-06-01 01:16:07.917 [4341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:16:19.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:16:21.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 01:16:21.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422372,ok=422372,error=0, records=41
[WARN ] 2026-06-01 01:16:22.922 [4423 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:16:29.294 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21129/300s
[INFO ] 2026-06-01 01:16:34.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:16:36.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 01:16:36.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422373,ok=422373,error=0, records=41
[WARN ] 2026-06-01 01:16:37.928 [4423 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:16:49.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:16:51.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 01:16:51.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422374,ok=422374,error=0, records=41
[WARN ] 2026-06-01 01:16:52.933 [4446 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:17:04.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:17:04.478 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21132/300s
[INFO ] 2026-06-01 01:17:06.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 01:17:06.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422375,ok=422375,error=0, records=41
[WARN ] 2026-06-01 01:17:07.939 [4472 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:17:19.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:17:21.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 01:17:21.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422376,ok=422376,error=0, records=41
[WARN ] 2026-06-01 01:17:22.945 [4495 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:17:34.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:17:36.382 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 01:17:36.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422377,ok=422377,error=0, records=41
[INFO ] 2026-06-01 01:17:37.115 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21130/300s
[WARN ] 2026-06-01 01:17:37.950 [4479 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:17:39.016 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21130/300s
[INFO ] 2026-06-01 01:17:46.810 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21130/300s
[INFO ] 2026-06-01 01:17:49.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:17:51.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 01:17:51.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422378,ok=422378,error=0, records=41
[WARN ] 2026-06-01 01:17:52.955 [4463 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:18:04.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:18:06.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 01:18:06.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422379,ok=422379,error=0, records=41
[WARN ] 2026-06-01 01:18:07.959 [4532 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:18:19.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:18:21.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 01:18:21.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422380,ok=422380,error=0, records=41
[WARN ] 2026-06-01 01:18:22.963 [4495 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:18:25.604 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887740},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:18:25.774 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:18:25.774 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 01:18:25.774 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:18:25.774 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:18:25.774 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:18:25.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:18:25.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:18:25.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:18:25.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:18:25.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:18:25.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:18:34.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:18:36.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 01:18:36.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422381,ok=422381,error=0, records=41
[WARN ] 2026-06-01 01:18:37.968 [4472 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:18:49.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:18:51.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 01:18:51.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422382,ok=422382,error=0, records=41
[WARN ] 2026-06-01 01:18:52.972 [4472 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:19:04.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:19:06.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 01:19:06.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422383,ok=422383,error=0, records=41
[WARN ] 2026-06-01 01:19:07.978 [4575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:19:19.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:19:21.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 01:19:21.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422384,ok=422384,error=0, records=41
[WARN ] 2026-06-01 01:19:22.982 [4532 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:19:34.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:19:36.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 01:19:36.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422385,ok=422385,error=0, records=41
[WARN ] 2026-06-01 01:19:37.987 [4618 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:19:48.991 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21125/300s
[INFO ] 2026-06-01 01:19:49.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:19:51.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-01 01:19:51.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422386,ok=422386,error=0, records=41
[INFO ] 2026-06-01 01:19:51.438 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21121/300s
[WARN ] 2026-06-01 01:19:52.992 [4603 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:20:00.604 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21134/300s
[INFO ] 2026-06-01 01:20:04.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:20:06.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 01:20:06.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422387,ok=422387,error=0, records=41
[WARN ] 2026-06-01 01:20:07.999 [4532 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:20:19.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:20:21.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 01:20:21.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422388,ok=422388,error=0, records=41
[WARN ] 2026-06-01 01:20:23.004 [4651 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:20:34.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:20:36.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 01:20:36.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422389,ok=422389,error=0, records=41
[WARN ] 2026-06-01 01:20:38.009 [4679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:20:40.618 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21134/300s
[INFO ] 2026-06-01 01:20:49.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:20:51.098 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21121/300s
[INFO ] 2026-06-01 01:20:51.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 01:20:51.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422390,ok=422390,error=0, records=41
[WARN ] 2026-06-01 01:20:53.014 [4665 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:21:04.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:21:06.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-01 01:21:06.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422391,ok=422391,error=0, records=41
[WARN ] 2026-06-01 01:21:08.019 [4693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:21:19.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:21:21.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 01:21:21.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422392,ok=422392,error=0, records=41
[WARN ] 2026-06-01 01:21:23.024 [4693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:21:25.774 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17595/300s
[INFO ] 2026-06-01 01:21:25.776 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887664},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:21:25.934 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:21:25.934 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 01:21:25.934 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:21:25.934 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:21:25.934 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:21:25.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:21:25.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:21:25.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:21:25.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:21:25.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:21:25.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:21:29.347 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21130/300s
[INFO ] 2026-06-01 01:21:34.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:21:36.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 01:21:36.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422393,ok=422393,error=0, records=41
[WARN ] 2026-06-01 01:21:38.029 [4618 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:21:49.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:21:51.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 01:21:51.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422394,ok=422394,error=0, records=41
[WARN ] 2026-06-01 01:21:53.036 [4665 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:22:04.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:22:04.490 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21133/300s
[INFO ] 2026-06-01 01:22:06.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 01:22:06.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422395,ok=422395,error=0, records=41
[WARN ] 2026-06-01 01:22:08.040 [4768 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:22:19.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:22:21.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 01:22:21.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422396,ok=422396,error=0, records=41
[WARN ] 2026-06-01 01:22:23.044 [4791 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:22:34.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:22:36.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 01:22:36.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422397,ok=422397,error=0, records=41
[INFO ] 2026-06-01 01:22:37.162 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21131/300s
[WARN ] 2026-06-01 01:22:38.049 [4802 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:22:39.064 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21131/300s
[INFO ] 2026-06-01 01:22:46.850 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21131/300s
[INFO ] 2026-06-01 01:22:49.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:22:51.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 01:22:51.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422398,ok=422398,error=0, records=41
[WARN ] 2026-06-01 01:22:53.053 [4810 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:23:04.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:23:06.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 01:23:06.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422399,ok=422399,error=0, records=41
[WARN ] 2026-06-01 01:23:07.557 [4827 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:23:19.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:23:21.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 01:23:21.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422400,ok=422400,error=0, records=41
[WARN ] 2026-06-01 01:23:22.562 [4827 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:23:34.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 01:23:34.493 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 01:23:36.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 01:23:36.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422401,ok=422401,error=0, records=41
[WARN ] 2026-06-01 01:23:37.567 [4856 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:23:49.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:23:49.493 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 01:23:51.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 01:23:51.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422402,ok=422402,error=0, records=41
[WARN ] 2026-06-01 01:23:52.571 [4856 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:24:04.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:24:06.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 01:24:06.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422403,ok=422403,error=0, records=41
[WARN ] 2026-06-01 01:24:07.575 [4903 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:24:19.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:24:21.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 01:24:21.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422404,ok=422404,error=0, records=41
[WARN ] 2026-06-01 01:24:22.581 [4891 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:24:25.936 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887572},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:24:26.098 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:24:26.098 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:24:26.098 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:24:26.098 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:24:26.098 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:24:26.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:24:26.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:24:26.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:24:26.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:24:26.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:24:26.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:24:34.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:24:36.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 01:24:36.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422405,ok=422405,error=0, records=41
[WARN ] 2026-06-01 01:24:37.587 [4939 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:24:49.090 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21126/300s
[INFO ] 2026-06-01 01:24:49.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:24:51.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 01:24:51.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422406,ok=422406,error=0, records=41
[INFO ] 2026-06-01 01:24:51.547 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21122/300s
[WARN ] 2026-06-01 01:24:52.591 [4951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:25:00.607 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21135/300s
[INFO ] 2026-06-01 01:25:04.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:25:06.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 01:25:06.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422407,ok=422407,error=0, records=41
[WARN ] 2026-06-01 01:25:07.597 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:25:19.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:25:21.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11378, records=45
[INFO ] 2026-06-01 01:25:21.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422408,ok=422408,error=0, records=45
[WARN ] 2026-06-01 01:25:22.602 [4951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:25:34.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:25:36.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 01:25:36.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422409,ok=422409,error=0, records=41
[WARN ] 2026-06-01 01:25:37.606 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:25:40.623 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21135/300s
[INFO ] 2026-06-01 01:25:49.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:25:51.276 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21122/300s
[INFO ] 2026-06-01 01:25:51.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 01:25:51.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422410,ok=422410,error=0, records=41
[WARN ] 2026-06-01 01:25:52.612 [4979 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:26:04.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:26:06.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 01:26:06.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422411,ok=422411,error=0, records=41
[WARN ] 2026-06-01 01:26:07.617 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:26:19.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:26:21.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 01:26:21.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422412,ok=422412,error=0, records=41
[WARN ] 2026-06-01 01:26:22.623 [4951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:26:29.392 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21131/300s
[INFO ] 2026-06-01 01:26:34.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:26:36.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 01:26:36.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422413,ok=422413,error=0, records=41
[WARN ] 2026-06-01 01:26:37.629 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:26:49.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:26:51.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 01:26:51.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422414,ok=422414,error=0, records=41
[WARN ] 2026-06-01 01:26:52.634 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:27:04.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:27:04.502 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21134/300s
[INFO ] 2026-06-01 01:27:06.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 01:27:06.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422415,ok=422415,error=0, records=41
[WARN ] 2026-06-01 01:27:07.640 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:27:19.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:27:21.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 01:27:21.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422416,ok=422416,error=0, records=41
[WARN ] 2026-06-01 01:27:22.645 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:27:26.098 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17596/300s
[INFO ] 2026-06-01 01:27:26.101 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:27:26.255 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:27:26.255 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 01:27:26.255 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:27:26.256 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:27:26.256 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:27:26.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:27:26.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:27:26.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:27:26.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:27:26.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:27:26.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:27:34.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:27:36.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 01:27:36.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422417,ok=422417,error=0, records=41
[INFO ] 2026-06-01 01:27:37.186 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21132/300s
[WARN ] 2026-06-01 01:27:37.650 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:27:39.087 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21132/300s
[INFO ] 2026-06-01 01:27:46.877 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21132/300s
[INFO ] 2026-06-01 01:27:49.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:27:51.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-01 01:27:51.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422418,ok=422418,error=0, records=41
[WARN ] 2026-06-01 01:27:52.656 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:28:04.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:28:06.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 01:28:06.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422419,ok=422419,error=0, records=41
[WARN ] 2026-06-01 01:28:07.662 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:28:19.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:28:21.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 01:28:21.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422420,ok=422420,error=0, records=41
[WARN ] 2026-06-01 01:28:22.667 [4979 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:28:34.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:28:36.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-06-01 01:28:36.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422421,ok=422421,error=0, records=41
[WARN ] 2026-06-01 01:28:37.672 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:28:49.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:28:51.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 01:28:51.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422422,ok=422422,error=0, records=41
[WARN ] 2026-06-01 01:28:52.677 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:29:04.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:29:06.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 01:29:06.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422423,ok=422423,error=0, records=41
[WARN ] 2026-06-01 01:29:07.683 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:29:19.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:29:21.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 01:29:21.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422424,ok=422424,error=0, records=41
[WARN ] 2026-06-01 01:29:22.688 [4951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:29:34.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:29:36.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 01:29:36.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422425,ok=422425,error=0, records=41
[WARN ] 2026-06-01 01:29:37.694 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:29:49.197 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21127/300s
[INFO ] 2026-06-01 01:29:49.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:29:51.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 01:29:51.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422426,ok=422426,error=0, records=41
[INFO ] 2026-06-01 01:29:51.759 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21123/300s
[WARN ] 2026-06-01 01:29:52.699 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:30:00.610 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21136/300s
[INFO ] 2026-06-01 01:30:04.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:30:06.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 01:30:06.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422427,ok=422427,error=0, records=41
[WARN ] 2026-06-01 01:30:07.705 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:30:19.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:30:21.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 01:30:21.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422428,ok=422428,error=0, records=41
[WARN ] 2026-06-01 01:30:22.710 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:30:26.258 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887412},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:30:26.436 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:30:26.436 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 01:30:26.437 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:30:26.437 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:30:26.437 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:30:26.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:30:26.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:30:26.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:30:26.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:30:26.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:30:26.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:30:34.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:30:36.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 01:30:36.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422429,ok=422429,error=0, records=41
[WARN ] 2026-06-01 01:30:37.715 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:30:40.629 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21136/300s
[INFO ] 2026-06-01 01:30:49.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:30:51.453 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21123/300s
[INFO ] 2026-06-01 01:30:51.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 01:30:51.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422430,ok=422430,error=0, records=41
[WARN ] 2026-06-01 01:30:52.720 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:31:04.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:31:06.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 01:31:06.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422431,ok=422431,error=0, records=41
[WARN ] 2026-06-01 01:31:07.725 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:31:19.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:31:21.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 01:31:21.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422432,ok=422432,error=0, records=41
[WARN ] 2026-06-01 01:31:22.731 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:31:29.444 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21132/300s
[INFO ] 2026-06-01 01:31:34.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:31:36.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 01:31:36.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422433,ok=422433,error=0, records=41
[WARN ] 2026-06-01 01:31:37.737 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:31:49.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:31:51.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 01:31:51.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422434,ok=422434,error=0, records=41
[WARN ] 2026-06-01 01:31:52.742 [4979 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:32:04.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:32:04.514 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21135/300s
[INFO ] 2026-06-01 01:32:06.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 01:32:06.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422435,ok=422435,error=0, records=41
[WARN ] 2026-06-01 01:32:07.747 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:32:17.752 [4979 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23645/stat), No such file or directory
[INFO ] 2026-06-01 01:32:19.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:32:21.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 01:32:21.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422436,ok=422436,error=0, records=41
[WARN ] 2026-06-01 01:32:22.753 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:32:32.757 [4942 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5271/stat), No such file or directory
[WARN ] 2026-06-01 01:32:32.757 [4942 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23645/stat), No such file or directory
[WARN ] 2026-06-01 01:32:32.757 [4942 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1429/stat), No such file or directory
[WARN ] 2026-06-01 01:32:32.757 [4942 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2633/stat), No such file or directory
[INFO ] 2026-06-01 01:32:34.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:32:36.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 01:32:36.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422437,ok=422437,error=0, records=41
[INFO ] 2026-06-01 01:32:37.228 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21133/300s
[WARN ] 2026-06-01 01:32:37.758 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:32:39.134 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21133/300s
[INFO ] 2026-06-01 01:32:46.918 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21133/300s
[WARN ] 2026-06-01 01:32:47.762 [4951 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5271/stat), No such file or directory
[WARN ] 2026-06-01 01:32:47.763 [4951 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23645/stat), No such file or directory
[WARN ] 2026-06-01 01:32:47.763 [4951 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1429/stat), No such file or directory
[WARN ] 2026-06-01 01:32:47.763 [4951 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2633/stat), No such file or directory
[INFO ] 2026-06-01 01:32:49.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:32:51.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 01:32:51.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422438,ok=422438,error=0, records=41
[WARN ] 2026-06-01 01:32:52.763 [4951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:33:04.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:33:06.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 01:33:06.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422439,ok=422439,error=0, records=41
[WARN ] 2026-06-01 01:33:07.769 [4942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:33:17.773 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5375/stat), No such file or directory
[WARN ] 2026-06-01 01:33:17.773 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5347/stat), No such file or directory
[WARN ] 2026-06-01 01:33:17.773 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5362/stat), No such file or directory
[WARN ] 2026-06-01 01:33:17.773 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5381/stat), No such file or directory
[WARN ] 2026-06-01 01:33:17.773 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2744/stat), No such file or directory
[INFO ] 2026-06-01 01:33:19.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:33:21.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 01:33:21.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422440,ok=422440,error=0, records=41
[WARN ] 2026-06-01 01:33:22.773 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:33:26.437 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17597/300s
[INFO ] 2026-06-01 01:33:26.438 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887272},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:33:26.604 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:33:26.604 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 01:33:26.604 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:33:26.604 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:33:26.604 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:33:26.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:33:26.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:33:26.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:33:26.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:33:26.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:33:26.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 01:33:32.777 [4979 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5375/stat), No such file or directory
[WARN ] 2026-06-01 01:33:32.777 [4979 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5347/stat), No such file or directory
[WARN ] 2026-06-01 01:33:32.777 [4979 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5362/stat), No such file or directory
[WARN ] 2026-06-01 01:33:32.777 [4979 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5381/stat), No such file or directory
[WARN ] 2026-06-01 01:33:32.777 [4979 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2744/stat), No such file or directory
[INFO ] 2026-06-01 01:33:34.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 01:33:34.517 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 01:33:36.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 01:33:36.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422441,ok=422441,error=0, records=41
[WARN ] 2026-06-01 01:33:37.778 [4951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:33:47.782 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5375/stat), No such file or directory
[WARN ] 2026-06-01 01:33:47.782 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5347/stat), No such file or directory
[WARN ] 2026-06-01 01:33:47.782 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5362/stat), No such file or directory
[WARN ] 2026-06-01 01:33:47.782 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5381/stat), No such file or directory
[WARN ] 2026-06-01 01:33:47.782 [4972 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2744/stat), No such file or directory
[INFO ] 2026-06-01 01:33:49.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:33:51.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 01:33:51.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422442,ok=422442,error=0, records=41
[WARN ] 2026-06-01 01:33:52.783 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:34:04.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:34:06.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 01:34:06.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422443,ok=422443,error=0, records=41
[WARN ] 2026-06-01 01:34:07.789 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:34:19.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:34:21.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 01:34:21.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422444,ok=422444,error=0, records=41
[WARN ] 2026-06-01 01:34:22.794 [4979 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:34:34.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:34:36.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 01:34:36.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422445,ok=422445,error=0, records=41
[WARN ] 2026-06-01 01:34:37.799 [4920 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:34:49.302 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21128/300s
[INFO ] 2026-06-01 01:34:49.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:34:51.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 01:34:51.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422446,ok=422446,error=0, records=41
[INFO ] 2026-06-01 01:34:51.913 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21124/300s
[WARN ] 2026-06-01 01:34:52.803 [4972 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:35:00.613 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21137/300s
[INFO ] 2026-06-01 01:35:04.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:35:06.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 01:35:06.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422447,ok=422447,error=0, records=41
[WARN ] 2026-06-01 01:35:07.808 [5631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:35:19.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:35:21.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 01:35:21.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422448,ok=422448,error=0, records=41
[WARN ] 2026-06-01 01:35:22.813 [5646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:35:34.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:35:36.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 01:35:36.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422449,ok=422449,error=0, records=41
[WARN ] 2026-06-01 01:35:37.819 [5646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:35:40.635 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21137/300s
[INFO ] 2026-06-01 01:35:49.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:35:51.631 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21124/300s
[INFO ] 2026-06-01 01:35:51.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 01:35:51.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422450,ok=422450,error=0, records=41
[WARN ] 2026-06-01 01:35:52.824 [5646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:36:04.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:36:06.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 01:36:06.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422451,ok=422451,error=0, records=41
[WARN ] 2026-06-01 01:36:07.829 [5631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:36:19.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:36:21.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 01:36:21.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422452,ok=422452,error=0, records=41
[WARN ] 2026-06-01 01:36:22.834 [5640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:36:26.605 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887196},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:36:26.768 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:36:26.768 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 01:36:26.768 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:36:26.768 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:36:26.768 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:36:26.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:36:26.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:36:26.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:36:26.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:36:26.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:36:26.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:36:29.491 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21133/300s
[INFO ] 2026-06-01 01:36:34.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:36:36.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 01:36:36.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422453,ok=422453,error=0, records=41
[WARN ] 2026-06-01 01:36:37.840 [5704 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:36:49.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:36:51.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 01:36:51.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422454,ok=422454,error=0, records=41
[WARN ] 2026-06-01 01:36:52.845 [5726 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:37:04.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:37:04.525 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21136/300s
[INFO ] 2026-06-01 01:37:06.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 01:37:06.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422455,ok=422455,error=0, records=41
[WARN ] 2026-06-01 01:37:07.850 [5726 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:37:19.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:37:21.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 01:37:21.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422456,ok=422456,error=0, records=41
[WARN ] 2026-06-01 01:37:22.856 [5740 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:37:34.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:37:36.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 01:37:36.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422457,ok=422457,error=0, records=41
[INFO ] 2026-06-01 01:37:37.278 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21134/300s
[WARN ] 2026-06-01 01:37:37.862 [5754 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:37:39.180 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21134/300s
[INFO ] 2026-06-01 01:37:46.955 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21134/300s
[INFO ] 2026-06-01 01:37:49.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:37:51.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 01:37:51.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422458,ok=422458,error=0, records=41
[WARN ] 2026-06-01 01:37:52.867 [5783 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:38:04.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:38:06.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 01:38:06.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422459,ok=422459,error=0, records=41
[WARN ] 2026-06-01 01:38:07.873 [5726 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:38:19.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:38:21.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 01:38:21.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422460,ok=422460,error=0, records=41
[WARN ] 2026-06-01 01:38:22.878 [5819 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:38:34.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:38:36.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 01:38:36.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422461,ok=422461,error=0, records=41
[WARN ] 2026-06-01 01:38:37.884 [5812 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:38:49.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:38:49.529 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 01:38:52.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 01:38:52.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422462,ok=422462,error=0, records=41
[WARN ] 2026-06-01 01:38:52.888 [5812 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:39:04.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:39:07.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 01:39:07.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422463,ok=422463,error=0, records=41
[WARN ] 2026-06-01 01:39:07.893 [5870 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:39:19.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:39:22.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 01:39:22.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422464,ok=422464,error=0, records=41
[WARN ] 2026-06-01 01:39:22.898 [5881 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:39:26.768 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17598/300s
[INFO ] 2026-06-01 01:39:26.770 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:39:26.940 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:39:26.940 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 01:39:26.940 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:39:26.940 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:39:26.940 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:39:26.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:39:26.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:39:26.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:39:26.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:39:26.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:39:26.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:39:34.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:39:37.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 01:39:37.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422465,ok=422465,error=0, records=41
[WARN ] 2026-06-01 01:39:37.904 [5905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:39:49.408 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21129/300s
[INFO ] 2026-06-01 01:39:49.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:39:52.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 01:39:52.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422466,ok=422466,error=0, records=41
[INFO ] 2026-06-01 01:39:52.033 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21125/300s
[WARN ] 2026-06-01 01:39:52.910 [5870 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:40:00.617 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21138/300s
[INFO ] 2026-06-01 01:40:04.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:40:07.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 01:40:07.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422467,ok=422467,error=0, records=41
[WARN ] 2026-06-01 01:40:07.916 [5938 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:40:19.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:40:22.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 01:40:22.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422468,ok=422468,error=0, records=41
[WARN ] 2026-06-01 01:40:22.921 [5938 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:40:34.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:40:37.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 01:40:37.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422469,ok=422469,error=0, records=41
[WARN ] 2026-06-01 01:40:37.927 [5976 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:40:40.641 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21138/300s
[INFO ] 2026-06-01 01:40:49.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:40:51.810 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21125/300s
[INFO ] 2026-06-01 01:40:52.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 01:40:52.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422470,ok=422470,error=0, records=41
[WARN ] 2026-06-01 01:40:52.933 [5976 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:41:04.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:41:07.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 01:41:07.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422471,ok=422471,error=0, records=41
[WARN ] 2026-06-01 01:41:07.939 [6006 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:41:19.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:41:22.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 01:41:22.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422472,ok=422472,error=0, records=41
[WARN ] 2026-06-01 01:41:22.944 [6000 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:41:29.544 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21134/300s
[INFO ] 2026-06-01 01:41:34.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:41:37.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 01:41:37.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422473,ok=422473,error=0, records=41
[WARN ] 2026-06-01 01:41:37.950 [6006 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:41:49.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:41:52.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 01:41:52.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422474,ok=422474,error=0, records=41
[WARN ] 2026-06-01 01:41:52.954 [6016 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:42:04.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:42:04.538 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21137/300s
[INFO ] 2026-06-01 01:42:07.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 01:42:07.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422475,ok=422475,error=0, records=41
[WARN ] 2026-06-01 01:42:07.959 [6006 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:42:19.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:42:22.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 01:42:22.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422476,ok=422476,error=0, records=41
[WARN ] 2026-06-01 01:42:22.963 [6027 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:42:26.941 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20887036},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:42:27.113 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:42:27.113 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 01:42:27.113 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:42:27.114 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:42:27.114 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:42:27.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:42:27.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:42:27.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:42:27.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:42:27.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:42:27.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:42:34.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:42:37.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 01:42:37.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422477,ok=422477,error=0, records=41
[INFO ] 2026-06-01 01:42:37.333 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21135/300s
[WARN ] 2026-06-01 01:42:37.969 [6091 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:42:39.235 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21135/300s
[INFO ] 2026-06-01 01:42:46.996 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21135/300s
[INFO ] 2026-06-01 01:42:49.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:42:52.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 01:42:52.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422478,ok=422478,error=0, records=41
[WARN ] 2026-06-01 01:42:52.974 [6016 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:43:04.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:43:07.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 01:43:07.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422479,ok=422479,error=0, records=41
[WARN ] 2026-06-01 01:43:07.978 [6027 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:43:19.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:43:22.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 01:43:22.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422480,ok=422480,error=0, records=41
[WARN ] 2026-06-01 01:43:22.983 [6105 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:43:34.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 01:43:34.541 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 01:43:37.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 01:43:37.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422481,ok=422481,error=0, records=41
[WARN ] 2026-06-01 01:43:37.988 [6076 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:43:49.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:43:52.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 01:43:52.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422482,ok=422482,error=0, records=41
[WARN ] 2026-06-01 01:43:52.992 [6119 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:44:04.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:44:07.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 01:44:07.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422483,ok=422483,error=0, records=41
[WARN ] 2026-06-01 01:44:07.997 [6166 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:44:17.501 [6166 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5447/stat), No such file or directory
[WARN ] 2026-06-01 01:44:17.502 [6166 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5349/stat), No such file or directory
[WARN ] 2026-06-01 01:44:17.502 [6166 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5438/stat), No such file or directory
[INFO ] 2026-06-01 01:44:19.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:44:22.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 01:44:22.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422484,ok=422484,error=0, records=41
[WARN ] 2026-06-01 01:44:23.002 [6105 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:44:32.506 [6221 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5447/stat), No such file or directory
[WARN ] 2026-06-01 01:44:32.507 [6221 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5349/stat), No such file or directory
[WARN ] 2026-06-01 01:44:32.507 [6221 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5438/stat), No such file or directory
[INFO ] 2026-06-01 01:44:34.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:44:37.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 01:44:37.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422485,ok=422485,error=0, records=41
[WARN ] 2026-06-01 01:44:38.008 [6076 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 01:44:47.511 [6221 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5447/stat), No such file or directory
[WARN ] 2026-06-01 01:44:47.512 [6221 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5349/stat), No such file or directory
[WARN ] 2026-06-01 01:44:47.513 [6221 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5438/stat), No such file or directory
[INFO ] 2026-06-01 01:44:49.511 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21130/300s
[INFO ] 2026-06-01 01:44:49.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:44:52.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 01:44:52.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422486,ok=422486,error=0, records=41
[INFO ] 2026-06-01 01:44:52.228 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21126/300s
[WARN ] 2026-06-01 01:44:53.013 [6221 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:45:00.620 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21139/300s
[INFO ] 2026-06-01 01:45:04.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:45:07.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 01:45:07.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422487,ok=422487,error=0, records=41
[WARN ] 2026-06-01 01:45:08.019 [6235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:45:19.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:45:22.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 01:45:22.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422488,ok=422488,error=0, records=41
[WARN ] 2026-06-01 01:45:23.024 [6016 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:45:27.114 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17599/300s
[INFO ] 2026-06-01 01:45:27.116 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886944},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:45:27.339 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:45:27.339 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 01:45:27.339 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:45:27.339 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:45:27.339 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:45:27.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:45:27.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:45:27.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:45:27.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:45:27.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:45:27.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:45:34.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:45:37.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 01:45:37.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422489,ok=422489,error=0, records=41
[WARN ] 2026-06-01 01:45:38.030 [6221 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:45:40.647 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21139/300s
[INFO ] 2026-06-01 01:45:49.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:45:51.991 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21126/300s
[INFO ] 2026-06-01 01:45:52.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 01:45:52.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422490,ok=422490,error=0, records=41
[WARN ] 2026-06-01 01:45:53.035 [6292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:46:04.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:46:07.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 01:46:07.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422491,ok=422491,error=0, records=41
[WARN ] 2026-06-01 01:46:08.039 [6299 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:46:19.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:46:22.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 01:46:22.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422492,ok=422492,error=0, records=41
[WARN ] 2026-06-01 01:46:23.045 [6328 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:46:29.596 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21135/300s
[INFO ] 2026-06-01 01:46:34.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:46:37.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 01:46:37.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422493,ok=422493,error=0, records=41
[WARN ] 2026-06-01 01:46:38.049 [6342 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:46:49.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:46:52.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 01:46:52.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422494,ok=422494,error=0, records=41
[WARN ] 2026-06-01 01:46:52.554 [6341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:47:04.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:47:04.549 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21138/300s
[INFO ] 2026-06-01 01:47:07.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 01:47:07.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422495,ok=422495,error=0, records=41
[WARN ] 2026-06-01 01:47:07.561 [6325 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:47:19.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:47:22.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 01:47:22.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422496,ok=422496,error=0, records=41
[WARN ] 2026-06-01 01:47:22.567 [6394 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:47:34.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:47:37.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 01:47:37.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422497,ok=422497,error=0, records=41
[INFO ] 2026-06-01 01:47:37.392 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21136/300s
[WARN ] 2026-06-01 01:47:37.572 [6411 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:47:39.294 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21136/300s
[INFO ] 2026-06-01 01:47:47.042 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21136/300s
[INFO ] 2026-06-01 01:47:49.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:47:52.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 01:47:52.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422498,ok=422498,error=0, records=41
[WARN ] 2026-06-01 01:47:52.576 [6399 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:48:04.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:48:07.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 01:48:07.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422499,ok=422499,error=0, records=41
[WARN ] 2026-06-01 01:48:07.581 [6441 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:48:19.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:48:22.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 01:48:22.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422500,ok=422500,error=0, records=41
[WARN ] 2026-06-01 01:48:22.585 [6426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:48:27.340 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:48:27.516 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:48:27.516 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:48:27.516 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:48:27.517 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:48:27.517 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:48:27.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:48:27.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:48:27.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:48:27.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:48:27.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:48:27.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:48:34.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:48:37.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 01:48:37.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422501,ok=422501,error=0, records=41
[WARN ] 2026-06-01 01:48:37.590 [6426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:48:49.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:48:52.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 01:48:52.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422502,ok=422502,error=0, records=41
[WARN ] 2026-06-01 01:48:52.595 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:49:04.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:49:07.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 01:49:07.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422503,ok=422503,error=0, records=41
[WARN ] 2026-06-01 01:49:07.600 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:49:19.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:49:22.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 01:49:22.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422504,ok=422504,error=0, records=41
[WARN ] 2026-06-01 01:49:22.605 [6508 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:49:34.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:49:37.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 01:49:37.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422505,ok=422505,error=0, records=41
[WARN ] 2026-06-01 01:49:37.609 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:49:49.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:49:49.613 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21131/300s
[INFO ] 2026-06-01 01:49:52.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 01:49:52.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422506,ok=422506,error=0, records=41
[INFO ] 2026-06-01 01:49:52.337 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21127/300s
[WARN ] 2026-06-01 01:49:52.615 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:50:00.623 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21140/300s
[INFO ] 2026-06-01 01:50:04.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:50:07.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 01:50:07.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422507,ok=422507,error=0, records=41
[WARN ] 2026-06-01 01:50:07.621 [6508 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:50:19.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:50:22.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 01:50:22.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422508,ok=422508,error=0, records=41
[WARN ] 2026-06-01 01:50:22.626 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:50:34.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:50:37.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 01:50:37.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422509,ok=422509,error=0, records=41
[WARN ] 2026-06-01 01:50:37.631 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:50:40.653 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21140/300s
[INFO ] 2026-06-01 01:50:49.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:50:52.166 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21127/300s
[INFO ] 2026-06-01 01:50:52.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 01:50:52.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422510,ok=422510,error=0, records=41
[WARN ] 2026-06-01 01:50:52.637 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:51:04.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:51:07.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 01:51:07.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422511,ok=422511,error=0, records=41
[WARN ] 2026-06-01 01:51:07.641 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:51:19.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:51:22.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 01:51:22.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422512,ok=422512,error=0, records=41
[WARN ] 2026-06-01 01:51:22.647 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:51:27.517 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17600/300s
[INFO ] 2026-06-01 01:51:27.518 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886764},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:51:27.697 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:51:27.697 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:51:27.698 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:51:27.698 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:51:27.698 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:51:27.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:51:27.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:51:27.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:51:27.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:51:27.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:51:27.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:51:29.649 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21136/300s
[INFO ] 2026-06-01 01:51:34.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:51:37.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 01:51:37.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422513,ok=422513,error=0, records=41
[WARN ] 2026-06-01 01:51:37.652 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:51:49.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:51:52.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 01:51:52.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422514,ok=422514,error=0, records=41
[WARN ] 2026-06-01 01:51:52.658 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:52:04.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:52:04.560 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21139/300s
[INFO ] 2026-06-01 01:52:07.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 01:52:07.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422515,ok=422515,error=0, records=41
[WARN ] 2026-06-01 01:52:07.663 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:52:19.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:52:22.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 01:52:22.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422516,ok=422516,error=0, records=41
[WARN ] 2026-06-01 01:52:22.667 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:52:34.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:52:37.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-01 01:52:37.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422517,ok=422517,error=0, records=41
[INFO ] 2026-06-01 01:52:37.413 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21137/300s
[WARN ] 2026-06-01 01:52:37.672 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:52:39.314 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21137/300s
[INFO ] 2026-06-01 01:52:47.048 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21137/300s
[INFO ] 2026-06-01 01:52:49.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:52:52.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 01:52:52.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422518,ok=422518,error=0, records=41
[WARN ] 2026-06-01 01:52:52.678 [6508 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:53:04.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:53:07.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 01:53:07.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422519,ok=422519,error=0, records=41
[WARN ] 2026-06-01 01:53:07.683 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:53:19.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:53:22.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 01:53:22.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422520,ok=422520,error=0, records=41
[WARN ] 2026-06-01 01:53:22.688 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:53:34.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 01:53:34.564 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 01:53:37.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 01:53:37.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422521,ok=422521,error=0, records=41
[WARN ] 2026-06-01 01:53:37.693 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:53:49.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:53:49.565 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 01:53:52.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 01:53:52.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422522,ok=422522,error=0, records=41
[WARN ] 2026-06-01 01:53:52.698 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:54:04.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=24.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:54:07.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 01:54:07.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422523,ok=422523,error=0, records=41
[WARN ] 2026-06-01 01:54:07.704 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:54:19.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:54:22.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 01:54:22.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422524,ok=422524,error=0, records=41
[WARN ] 2026-06-01 01:54:22.710 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:54:27.699 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886672},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:54:27.906 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:54:27.906 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 01:54:27.906 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:54:27.906 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:54:27.906 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:54:27.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:54:27.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:54:27.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:54:27.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:54:27.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:54:27.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:54:34.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:54:37.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 01:54:37.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422525,ok=422525,error=0, records=41
[WARN ] 2026-06-01 01:54:37.715 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:54:49.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:54:49.719 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21132/300s
[INFO ] 2026-06-01 01:54:52.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 01:54:52.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422526,ok=422526,error=0, records=41
[INFO ] 2026-06-01 01:54:52.538 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21128/300s
[WARN ] 2026-06-01 01:54:52.720 [6426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:55:00.626 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21141/300s
[INFO ] 2026-06-01 01:55:04.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:55:07.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 01:55:07.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422527,ok=422527,error=0, records=41
[WARN ] 2026-06-01 01:55:07.724 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:55:19.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:55:22.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 01:55:22.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422528,ok=422528,error=0, records=41
[WARN ] 2026-06-01 01:55:22.729 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:55:34.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:55:37.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 01:55:37.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422529,ok=422529,error=0, records=41
[WARN ] 2026-06-01 01:55:37.734 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:55:40.658 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21141/300s
[INFO ] 2026-06-01 01:55:49.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:55:52.345 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21128/300s
[INFO ] 2026-06-01 01:55:52.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 01:55:52.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422530,ok=422530,error=0, records=41
[WARN ] 2026-06-01 01:55:52.739 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:56:04.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:56:07.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 01:56:07.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422531,ok=422531,error=0, records=41
[WARN ] 2026-06-01 01:56:07.745 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:56:19.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:56:22.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 01:56:22.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422532,ok=422532,error=0, records=41
[WARN ] 2026-06-01 01:56:22.751 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:56:29.712 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21137/300s
[INFO ] 2026-06-01 01:56:34.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:56:37.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 01:56:37.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422533,ok=422533,error=0, records=41
[WARN ] 2026-06-01 01:56:37.757 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:56:49.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:56:52.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 01:56:52.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422534,ok=422534,error=0, records=41
[WARN ] 2026-06-01 01:56:52.762 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:57:04.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:57:04.573 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21140/300s
[INFO ] 2026-06-01 01:57:07.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 01:57:07.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422535,ok=422535,error=0, records=41
[WARN ] 2026-06-01 01:57:07.768 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:57:19.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:57:22.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 01:57:22.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422536,ok=422536,error=0, records=41
[WARN ] 2026-06-01 01:57:22.773 [6426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:57:27.907 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17601/300s
[INFO ] 2026-06-01 01:57:27.908 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886584},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 01:57:28.074 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 01:57:28.074 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 01:57:28.074 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 01:57:28.074 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 01:57:28.074 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 01:57:28.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:57:28.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 01:57:28.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:57:28.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 01:57:28.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:57:28.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 01:57:34.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:57:37.460 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21138/300s
[INFO ] 2026-06-01 01:57:37.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 01:57:37.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422537,ok=422537,error=0, records=41
[WARN ] 2026-06-01 01:57:37.780 [6484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:57:39.361 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21138/300s
[INFO ] 2026-06-01 01:57:47.077 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21138/300s
[INFO ] 2026-06-01 01:57:49.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:57:52.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 01:57:52.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422538,ok=422538,error=0, records=41
[WARN ] 2026-06-01 01:57:52.784 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:58:04.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:58:07.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 01:58:07.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422539,ok=422539,error=0, records=41
[WARN ] 2026-06-01 01:58:07.789 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:58:19.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:58:22.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 01:58:22.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422540,ok=422540,error=0, records=41
[WARN ] 2026-06-01 01:58:22.795 [6426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:58:34.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:58:37.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 01:58:37.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422541,ok=422541,error=0, records=41
[WARN ] 2026-06-01 01:58:37.800 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:58:49.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:58:52.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 01:58:52.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422542,ok=422542,error=0, records=41
[WARN ] 2026-06-01 01:58:52.806 [6478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:59:04.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:59:07.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 01:59:07.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422543,ok=422543,error=0, records=41
[WARN ] 2026-06-01 01:59:07.812 [7047 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:59:19.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:59:22.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 01:59:22.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422544,ok=422544,error=0, records=41
[WARN ] 2026-06-01 01:59:22.818 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:59:34.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:59:37.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 01:59:37.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422545,ok=422545,error=0, records=41
[WARN ] 2026-06-01 01:59:37.823 [7061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 01:59:49.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 01:59:49.827 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21133/300s
[INFO ] 2026-06-01 01:59:52.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 01:59:52.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422546,ok=422546,error=0, records=41
[INFO ] 2026-06-01 01:59:52.661 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21129/300s
[WARN ] 2026-06-01 01:59:52.829 [7081 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:00:00.629 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21142/300s
[INFO ] 2026-06-01 02:00:04.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:00:07.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 02:00:07.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422547,ok=422547,error=0, records=41
[WARN ] 2026-06-01 02:00:07.834 [7081 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:00:19.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:00:22.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 02:00:22.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422548,ok=422548,error=0, records=41
[WARN ] 2026-06-01 02:00:22.839 [7132 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:00:28.075 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:00:28.227 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:00:28.227 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 02:00:34.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:00:37.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 02:00:37.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422549,ok=422549,error=0, records=41
[WARN ] 2026-06-01 02:00:37.845 [7061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:00:40.665 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21142/300s
[INFO ] 2026-06-01 02:00:49.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:00:52.526 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21129/300s
[INFO ] 2026-06-01 02:00:52.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 02:00:52.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422550,ok=422550,error=0, records=41
[WARN ] 2026-06-01 02:00:52.850 [7061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:01:04.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:01:07.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 02:01:07.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422551,ok=422551,error=0, records=41
[WARN ] 2026-06-01 02:01:07.854 [7067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:01:19.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:01:22.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 02:01:22.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422552,ok=422552,error=0, records=41
[WARN ] 2026-06-01 02:01:22.859 [7169 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:01:29.765 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21138/300s
[INFO ] 2026-06-01 02:01:34.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:01:37.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 02:01:37.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422553,ok=422553,error=0, records=41
[WARN ] 2026-06-01 02:01:37.865 [7067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:01:49.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:01:52.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 02:01:52.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422554,ok=422554,error=0, records=41
[WARN ] 2026-06-01 02:01:52.871 [7169 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:02:04.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:02:04.586 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21141/300s
[INFO ] 2026-06-01 02:02:07.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 02:02:07.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422555,ok=422555,error=0, records=41
[WARN ] 2026-06-01 02:02:07.876 [7132 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:02:19.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:02:22.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 02:02:22.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422556,ok=422556,error=0, records=41
[WARN ] 2026-06-01 02:02:22.883 [7132 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:02:34.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:02:37.511 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21139/300s
[INFO ] 2026-06-01 02:02:37.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 02:02:37.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422557,ok=422557,error=0, records=41
[WARN ] 2026-06-01 02:02:37.888 [7282 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:02:39.418 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21139/300s
[INFO ] 2026-06-01 02:02:47.125 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21139/300s
[INFO ] 2026-06-01 02:02:49.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:02:52.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 02:02:52.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422558,ok=422558,error=0, records=41
[WARN ] 2026-06-01 02:02:52.898 [7312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:03:04.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:03:07.813 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 02:03:07.813 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422559,ok=422559,error=0, records=41
[WARN ] 2026-06-01 02:03:07.904 [7329 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:03:19.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:03:22.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 02:03:22.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422560,ok=422560,error=0, records=41
[WARN ] 2026-06-01 02:03:22.908 [7347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:03:28.227 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17602/300s
[INFO ] 2026-06-01 02:03:28.228 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886360},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:03:28.382 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:03:28.383 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:03:28.383 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:03:28.383 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:03:28.383 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:03:28.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:03:28.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:03:28.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:03:28.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:03:28.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:03:28.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:03:34.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 02:03:34.590 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 02:03:37.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 02:03:37.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422561,ok=422561,error=0, records=41
[WARN ] 2026-06-01 02:03:37.912 [7358 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:03:49.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:03:52.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 02:03:52.831 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422562,ok=422562,error=0, records=41
[WARN ] 2026-06-01 02:03:52.920 [7366 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:04:04.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:04:07.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 02:04:07.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422563,ok=422563,error=0, records=41
[WARN ] 2026-06-01 02:04:07.930 [7396 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:04:19.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:04:22.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 02:04:22.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422564,ok=422564,error=0, records=41
[WARN ] 2026-06-01 02:04:22.935 [7419 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:04:34.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:04:37.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 02:04:37.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422565,ok=422565,error=0, records=41
[WARN ] 2026-06-01 02:04:37.943 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:04:49.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:04:49.948 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21134/300s
[INFO ] 2026-06-01 02:04:52.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 02:04:52.857 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422566,ok=422566,error=0, records=41
[INFO ] 2026-06-01 02:04:52.857 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21130/300s
[WARN ] 2026-06-01 02:04:52.950 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:05:00.633 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21143/300s
[INFO ] 2026-06-01 02:05:04.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:05:07.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 02:05:07.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422567,ok=422567,error=0, records=41
[WARN ] 2026-06-01 02:05:07.954 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:05:19.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:05:22.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 02:05:22.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422568,ok=422568,error=0, records=41
[WARN ] 2026-06-01 02:05:22.961 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:05:34.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:05:37.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 02:05:37.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422569,ok=422569,error=0, records=41
[WARN ] 2026-06-01 02:05:37.965 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:05:40.671 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21143/300s
[INFO ] 2026-06-01 02:05:49.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:05:52.705 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21130/300s
[INFO ] 2026-06-01 02:05:52.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 02:05:52.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422570,ok=422570,error=0, records=41
[WARN ] 2026-06-01 02:05:52.969 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:06:04.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:06:07.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 02:06:07.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422571,ok=422571,error=0, records=41
[WARN ] 2026-06-01 02:06:07.990 [7529 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:06:19.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:06:22.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 02:06:22.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422572,ok=422572,error=0, records=41
[WARN ] 2026-06-01 02:06:22.994 [7442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:06:28.385 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886240},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:06:28.555 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:06:28.555 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:06:28.555 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:06:28.555 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:06:28.555 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:06:28.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:06:28.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:06:28.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:06:28.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:06:28.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:06:28.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:06:29.844 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21139/300s
[INFO ] 2026-06-01 02:06:34.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:06:37.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 02:06:37.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422573,ok=422573,error=0, records=41
[WARN ] 2026-06-01 02:06:37.998 [7558 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:06:49.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:06:52.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 02:06:52.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422574,ok=422574,error=0, records=41
[WARN ] 2026-06-01 02:06:53.005 [7454 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:07:04.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:07:04.602 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21142/300s
[INFO ] 2026-06-01 02:07:07.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 02:07:07.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422575,ok=422575,error=0, records=41
[WARN ] 2026-06-01 02:07:08.010 [7529 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:07:19.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:07:22.916 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 02:07:22.916 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422576,ok=422576,error=0, records=41
[WARN ] 2026-06-01 02:07:23.014 [7529 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:07:34.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:07:37.575 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21140/300s
[INFO ] 2026-06-01 02:07:37.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 02:07:37.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422577,ok=422577,error=0, records=41
[WARN ] 2026-06-01 02:07:38.018 [7618 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:07:39.473 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21140/300s
[INFO ] 2026-06-01 02:07:47.182 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21140/300s
[INFO ] 2026-06-01 02:07:49.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:07:52.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 02:07:52.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422578,ok=422578,error=0, records=41
[WARN ] 2026-06-01 02:07:53.025 [7574 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:08:04.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:08:07.935 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 02:08:07.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422579,ok=422579,error=0, records=41
[WARN ] 2026-06-01 02:08:08.034 [7632 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:08:19.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:08:22.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 02:08:22.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422580,ok=422580,error=0, records=41
[WARN ] 2026-06-01 02:08:23.040 [7659 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:08:34.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:08:37.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 02:08:37.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422581,ok=422581,error=0, records=41
[WARN ] 2026-06-01 02:08:38.049 [7682 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:08:49.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:08:49.607 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 02:08:52.554 [7652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:08:52.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 02:08:52.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422582,ok=422582,error=0, records=41
[INFO ] 2026-06-01 02:09:04.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=24.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:09:07.559 [7700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:09:07.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 02:09:07.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422583,ok=422583,error=0, records=41
[INFO ] 2026-06-01 02:09:19.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:09:22.564 [7721 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:09:22.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 02:09:22.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422584,ok=422584,error=0, records=41
[INFO ] 2026-06-01 02:09:28.555 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17603/300s
[INFO ] 2026-06-01 02:09:28.557 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886164},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:09:28.714 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:09:28.714 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:09:28.714 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:09:28.714 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:09:28.714 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:09:28.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:09:28.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:09:28.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:09:28.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:09:28.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:09:28.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:09:34.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:09:37.568 [7783 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:09:37.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 02:09:37.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422585,ok=422585,error=0, records=41
[INFO ] 2026-06-01 02:09:49.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:09:50.072 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21135/300s
[WARN ] 2026-06-01 02:09:52.573 [7802 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:09:53.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 02:09:53.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422586,ok=422586,error=0, records=41
[INFO ] 2026-06-01 02:09:53.067 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21131/300s
[INFO ] 2026-06-01 02:10:00.636 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21144/300s
[INFO ] 2026-06-01 02:10:04.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:10:07.578 [7815 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:10:08.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 02:10:08.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422587,ok=422587,error=0, records=41
[INFO ] 2026-06-01 02:10:19.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:10:22.583 [7842 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:10:23.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 02:10:23.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422588,ok=422588,error=0, records=41
[INFO ] 2026-06-01 02:10:34.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:10:37.588 [7860 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:10:38.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 02:10:38.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422589,ok=422589,error=0, records=41
[INFO ] 2026-06-01 02:10:40.677 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21144/300s
[INFO ] 2026-06-01 02:10:49.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:10:52.593 [7847 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:10:52.897 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21131/300s
[INFO ] 2026-06-01 02:10:53.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 02:10:53.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422590,ok=422590,error=0, records=41
[INFO ] 2026-06-01 02:11:04.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:11:07.598 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:11:08.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 02:11:08.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422591,ok=422591,error=0, records=41
[INFO ] 2026-06-01 02:11:19.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:11:22.603 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:11:23.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 02:11:23.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422592,ok=422592,error=0, records=41
[INFO ] 2026-06-01 02:11:29.910 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21140/300s
[INFO ] 2026-06-01 02:11:34.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:11:37.608 [7860 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:11:38.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 02:11:38.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422593,ok=422593,error=0, records=41
[INFO ] 2026-06-01 02:11:49.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:11:52.613 [7860 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:11:53.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 02:11:53.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422594,ok=422594,error=0, records=41
[INFO ] 2026-06-01 02:12:04.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:12:04.616 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21143/300s
[WARN ] 2026-06-01 02:12:07.618 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:12:08.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12038, records=51
[INFO ] 2026-06-01 02:12:08.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422595,ok=422595,error=0, records=51
[INFO ] 2026-06-01 02:12:19.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:12:22.622 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:12:23.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 02:12:23.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422596,ok=422596,error=0, records=41
[INFO ] 2026-06-01 02:12:28.716 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:12:28.892 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:12:28.893 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:12:28.893 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:12:28.893 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:12:28.893 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:12:28.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:12:28.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:12:28.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:12:28.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:12:28.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:12:28.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:12:34.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:12:37.582 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21141/300s
[WARN ] 2026-06-01 02:12:37.627 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:12:38.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 02:12:38.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422597,ok=422597,error=0, records=41
[INFO ] 2026-06-01 02:12:39.484 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21141/300s
[INFO ] 2026-06-01 02:12:47.191 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21141/300s
[INFO ] 2026-06-01 02:12:49.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:12:52.632 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:12:53.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 02:12:53.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422598,ok=422598,error=0, records=41
[INFO ] 2026-06-01 02:13:04.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:13:07.637 [7872 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:13:08.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 02:13:08.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422599,ok=422599,error=0, records=41
[INFO ] 2026-06-01 02:13:19.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:13:22.641 [7872 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:13:23.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 02:13:23.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422600,ok=422600,error=0, records=41
[INFO ] 2026-06-01 02:13:34.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 02:13:34.619 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 02:13:37.646 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:13:38.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 02:13:38.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422601,ok=422601,error=0, records=41
[INFO ] 2026-06-01 02:13:49.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:13:52.651 [7860 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:13:53.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 02:13:53.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422602,ok=422602,error=0, records=41
[INFO ] 2026-06-01 02:14:04.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:14:07.656 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:14:08.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 02:14:08.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422603,ok=422603,error=0, records=41
[INFO ] 2026-06-01 02:14:19.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:14:22.661 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:14:23.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 02:14:23.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422604,ok=422604,error=0, records=41
[INFO ] 2026-06-01 02:14:34.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:14:37.667 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:14:38.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 02:14:38.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422605,ok=422605,error=0, records=41
[INFO ] 2026-06-01 02:14:49.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:14:50.172 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21136/300s
[WARN ] 2026-06-01 02:14:52.673 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:14:53.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 02:14:53.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422606,ok=422606,error=0, records=41
[INFO ] 2026-06-01 02:14:53.303 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21132/300s
[INFO ] 2026-06-01 02:15:00.639 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21145/300s
[INFO ] 2026-06-01 02:15:04.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:15:07.678 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:15:08.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 02:15:08.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422607,ok=422607,error=0, records=41
[INFO ] 2026-06-01 02:15:19.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:15:22.683 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:15:23.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 02:15:23.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422608,ok=422608,error=0, records=41
[INFO ] 2026-06-01 02:15:28.893 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17604/300s
[INFO ] 2026-06-01 02:15:28.894 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20886012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:15:29.058 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:15:29.059 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 02:15:29.059 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:15:29.059 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:15:29.059 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:15:29.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:15:29.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:15:29.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:15:29.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:15:29.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:15:29.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:15:34.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:15:37.689 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:15:38.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 02:15:38.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422609,ok=422609,error=0, records=41
[INFO ] 2026-06-01 02:15:40.683 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21145/300s
[INFO ] 2026-06-01 02:15:49.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:15:52.695 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:15:53.076 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21132/300s
[INFO ] 2026-06-01 02:15:53.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 02:15:53.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422610,ok=422610,error=0, records=41
[INFO ] 2026-06-01 02:16:04.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:16:07.700 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:16:08.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 02:16:08.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422611,ok=422611,error=0, records=41
[INFO ] 2026-06-01 02:16:19.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:16:22.705 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:16:23.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 02:16:23.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422612,ok=422612,error=0, records=41
[INFO ] 2026-06-01 02:16:29.961 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21141/300s
[INFO ] 2026-06-01 02:16:34.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:16:37.711 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:16:38.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 02:16:38.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422613,ok=422613,error=0, records=41
[INFO ] 2026-06-01 02:16:49.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:16:52.716 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:16:53.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 02:16:53.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422614,ok=422614,error=0, records=41
[INFO ] 2026-06-01 02:17:04.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:17:04.627 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21144/300s
[WARN ] 2026-06-01 02:17:07.722 [7860 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:17:08.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 02:17:08.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422615,ok=422615,error=0, records=41
[INFO ] 2026-06-01 02:17:19.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:17:22.729 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:17:23.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 02:17:23.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422616,ok=422616,error=0, records=41
[INFO ] 2026-06-01 02:17:34.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:17:37.597 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21142/300s
[WARN ] 2026-06-01 02:17:37.735 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:17:38.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 02:17:38.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422617,ok=422617,error=0, records=41
[INFO ] 2026-06-01 02:17:39.499 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21142/300s
[INFO ] 2026-06-01 02:17:47.200 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21142/300s
[INFO ] 2026-06-01 02:17:49.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:17:52.741 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:17:53.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 02:17:53.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422618,ok=422618,error=0, records=41
[INFO ] 2026-06-01 02:18:04.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:18:07.747 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:18:08.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 02:18:08.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422619,ok=422619,error=0, records=41
[INFO ] 2026-06-01 02:18:19.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:18:22.751 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:18:23.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 02:18:23.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422620,ok=422620,error=0, records=41
[INFO ] 2026-06-01 02:18:29.060 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20885928},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:18:29.209 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:18:29.209 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:18:29.209 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:18:29.209 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:18:29.210 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:18:29.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:18:29.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:18:29.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:18:29.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:18:29.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:18:29.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:18:34.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:18:37.757 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:18:38.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 02:18:38.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422621,ok=422621,error=0, records=41
[INFO ] 2026-06-01 02:18:49.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:18:52.762 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:18:53.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 02:18:53.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422622,ok=422622,error=0, records=41
[INFO ] 2026-06-01 02:19:04.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:19:07.766 [7877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:19:08.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 02:19:08.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422623,ok=422623,error=0, records=41
[INFO ] 2026-06-01 02:19:19.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:19:22.771 [7882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:19:23.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 02:19:23.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422624,ok=422624,error=0, records=41
[INFO ] 2026-06-01 02:19:34.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:19:37.776 [7872 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:19:38.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 02:19:38.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422625,ok=422625,error=0, records=41
[INFO ] 2026-06-01 02:19:49.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:19:50.279 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21137/300s
[WARN ] 2026-06-01 02:19:52.780 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:19:53.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 02:19:53.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422626,ok=422626,error=0, records=41
[INFO ] 2026-06-01 02:19:53.640 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21133/300s
[INFO ] 2026-06-01 02:20:00.642 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21146/300s
[INFO ] 2026-06-01 02:20:04.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:20:07.785 [7892 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:20:08.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 02:20:08.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422627,ok=422627,error=0, records=41
[INFO ] 2026-06-01 02:20:19.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:20:22.790 [7872 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:20:23.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 02:20:23.694 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422628,ok=422628,error=0, records=41
[INFO ] 2026-06-01 02:20:34.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:20:37.795 [7872 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:20:38.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 02:20:38.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422629,ok=422629,error=0, records=41
[INFO ] 2026-06-01 02:20:40.689 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21146/300s
[INFO ] 2026-06-01 02:20:49.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:20:52.801 [7860 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:20:53.253 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21133/300s
[INFO ] 2026-06-01 02:20:53.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 02:20:53.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422630,ok=422630,error=0, records=41
[INFO ] 2026-06-01 02:21:04.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:21:07.806 [7872 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:21:08.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 02:21:08.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422631,ok=422631,error=0, records=41
[INFO ] 2026-06-01 02:21:19.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:21:22.811 [8448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:21:23.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 02:21:23.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422632,ok=422632,error=0, records=41
[INFO ] 2026-06-01 02:21:29.210 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17605/300s
[INFO ] 2026-06-01 02:21:29.211 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877656},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:21:29.388 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:21:29.388 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:21:29.389 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:21:29.389 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:21:29.389 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:21:29.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:21:29.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:21:29.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:21:29.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:21:29.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:21:29.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:21:30.011 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21142/300s
[INFO ] 2026-06-01 02:21:34.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:21:37.816 [8464 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:21:38.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 02:21:38.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422633,ok=422633,error=0, records=41
[INFO ] 2026-06-01 02:21:49.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:21:52.821 [8459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:21:53.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 02:21:53.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422634,ok=422634,error=0, records=41
[INFO ] 2026-06-01 02:22:04.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:22:04.639 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21145/300s
[WARN ] 2026-06-01 02:22:07.826 [8433 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:22:08.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 02:22:08.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422635,ok=422635,error=0, records=41
[INFO ] 2026-06-01 02:22:19.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:22:22.832 [8478 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:22:23.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 02:22:23.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422636,ok=422636,error=0, records=41
[INFO ] 2026-06-01 02:22:34.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:22:37.640 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21143/300s
[WARN ] 2026-06-01 02:22:37.838 [8459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:22:38.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 02:22:38.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422637,ok=422637,error=0, records=41
[INFO ] 2026-06-01 02:22:39.542 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21143/300s
[INFO ] 2026-06-01 02:22:47.246 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21143/300s
[INFO ] 2026-06-01 02:22:49.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:22:52.844 [8528 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:22:53.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 02:22:53.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422638,ok=422638,error=0, records=41
[INFO ] 2026-06-01 02:23:04.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:23:07.850 [8528 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:23:08.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 02:23:08.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422639,ok=422639,error=0, records=41
[INFO ] 2026-06-01 02:23:19.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:23:22.855 [8566 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:23:23.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 02:23:23.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422640,ok=422640,error=0, records=41
[INFO ] 2026-06-01 02:23:34.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 02:23:34.643 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 02:23:37.859 [8448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:23:38.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 02:23:38.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422641,ok=422641,error=0, records=41
[INFO ] 2026-06-01 02:23:49.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:23:49.643 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 02:23:52.864 [8566 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:23:53.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 02:23:53.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422642,ok=422642,error=0, records=41
[INFO ] 2026-06-01 02:24:04.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:24:07.868 [8552 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:24:08.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 02:24:08.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422643,ok=422643,error=0, records=41
[INFO ] 2026-06-01 02:24:19.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:24:22.874 [8608 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:24:23.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 02:24:23.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422644,ok=422644,error=0, records=41
[INFO ] 2026-06-01 02:24:29.390 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877576},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:24:29.540 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:24:29.540 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 02:24:29.540 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:24:29.540 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:24:29.540 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:24:29.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:24:29.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:24:29.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:24:29.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:24:29.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:24:29.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:24:34.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:24:37.880 [8641 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:24:38.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 02:24:38.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422645,ok=422645,error=0, records=41
[INFO ] 2026-06-01 02:24:49.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:24:50.385 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21138/300s
[WARN ] 2026-06-01 02:24:52.886 [8662 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:24:53.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 02:24:53.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422646,ok=422646,error=0, records=41
[INFO ] 2026-06-01 02:24:53.803 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21134/300s
[INFO ] 2026-06-01 02:25:00.645 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21147/300s
[INFO ] 2026-06-01 02:25:04.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:25:07.891 [8680 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:25:08.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 02:25:08.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422647,ok=422647,error=0, records=41
[INFO ] 2026-06-01 02:25:19.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:25:22.896 [8691 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:25:23.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 02:25:23.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422648,ok=422648,error=0, records=41
[INFO ] 2026-06-01 02:25:34.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:25:37.901 [8712 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:25:38.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 02:25:38.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422649,ok=422649,error=0, records=41
[INFO ] 2026-06-01 02:25:40.695 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21147/300s
[INFO ] 2026-06-01 02:25:49.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:25:52.906 [8724 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:25:53.428 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21134/300s
[INFO ] 2026-06-01 02:25:53.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 02:25:53.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422650,ok=422650,error=0, records=41
[INFO ] 2026-06-01 02:26:04.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:26:07.911 [8729 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:26:08.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 02:26:08.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422651,ok=422651,error=0, records=41
[INFO ] 2026-06-01 02:26:19.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:26:22.917 [8741 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:26:23.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 02:26:23.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422652,ok=422652,error=0, records=41
[INFO ] 2026-06-01 02:26:30.063 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21143/300s
[INFO ] 2026-06-01 02:26:34.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:26:37.922 [8753 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:26:38.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 02:26:38.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422653,ok=422653,error=0, records=41
[INFO ] 2026-06-01 02:26:49.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:26:52.927 [8797 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:26:53.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 02:26:53.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422654,ok=422654,error=0, records=41
[INFO ] 2026-06-01 02:27:04.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:27:04.652 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21146/300s
[WARN ] 2026-06-01 02:27:07.931 [8797 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:27:08.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 02:27:08.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422655,ok=422655,error=0, records=41
[INFO ] 2026-06-01 02:27:19.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:27:22.936 [8830 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:27:23.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 02:27:23.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422656,ok=422656,error=0, records=41
[INFO ] 2026-06-01 02:27:29.540 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17606/300s
[INFO ] 2026-06-01 02:27:29.542 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877500},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:27:29.696 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:27:29.696 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:27:29.696 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:27:29.696 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:27:29.696 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:27:29.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:27:29.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:27:29.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:27:29.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:27:29.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:27:29.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:27:34.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:27:37.683 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21144/300s
[WARN ] 2026-06-01 02:27:37.942 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:27:38.935 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 02:27:38.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422657,ok=422657,error=0, records=41
[INFO ] 2026-06-01 02:27:39.584 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21144/300s
[INFO ] 2026-06-01 02:27:47.290 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21144/300s
[INFO ] 2026-06-01 02:27:49.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:27:52.947 [8865 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:27:53.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 02:27:53.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422658,ok=422658,error=0, records=41
[INFO ] 2026-06-01 02:28:04.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:28:07.952 [8843 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:28:08.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 02:28:08.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422659,ok=422659,error=0, records=41
[INFO ] 2026-06-01 02:28:19.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:28:22.957 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:28:23.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 02:28:23.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422660,ok=422660,error=0, records=41
[INFO ] 2026-06-01 02:28:34.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:28:37.961 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:28:38.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 02:28:38.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422661,ok=422661,error=0, records=41
[INFO ] 2026-06-01 02:28:49.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:28:52.966 [8918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:28:53.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 02:28:53.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422662,ok=422662,error=0, records=41
[INFO ] 2026-06-01 02:29:04.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:29:07.970 [8932 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:29:08.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 02:29:08.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422663,ok=422663,error=0, records=41
[INFO ] 2026-06-01 02:29:19.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:29:22.975 [8918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:29:23.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 02:29:23.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422664,ok=422664,error=0, records=41
[INFO ] 2026-06-01 02:29:34.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:29:37.981 [8918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:29:39.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 02:29:39.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422665,ok=422665,error=0, records=41
[INFO ] 2026-06-01 02:29:49.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:29:50.485 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21139/300s
[WARN ] 2026-06-01 02:29:52.987 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:29:54.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 02:29:54.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422666,ok=422666,error=0, records=41
[INFO ] 2026-06-01 02:29:54.010 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21135/300s
[INFO ] 2026-06-01 02:30:00.649 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21148/300s
[INFO ] 2026-06-01 02:30:04.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:30:07.992 [8918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:30:09.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 02:30:09.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422667,ok=422667,error=0, records=41
[INFO ] 2026-06-01 02:30:19.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:30:22.998 [8904 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:30:24.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 02:30:24.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422668,ok=422668,error=0, records=41
[INFO ] 2026-06-01 02:30:29.698 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877420},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:30:29.863 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:30:29.863 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:30:29.863 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:30:29.863 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:30:29.863 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:30:29.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:30:29.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:30:29.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:30:29.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:30:29.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:30:29.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:30:34.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:30:38.002 [8993 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:30:39.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 02:30:39.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422669,ok=422669,error=0, records=41
[INFO ] 2026-06-01 02:30:40.701 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21148/300s
[INFO ] 2026-06-01 02:30:49.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:30:53.008 [9036 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:30:53.609 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21135/300s
[INFO ] 2026-06-01 02:30:54.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 02:30:54.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422670,ok=422670,error=0, records=41
[INFO ] 2026-06-01 02:31:04.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:31:08.013 [8993 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:31:09.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 02:31:09.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422671,ok=422671,error=0, records=41
[INFO ] 2026-06-01 02:31:19.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:31:23.018 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:31:24.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 02:31:24.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422672,ok=422672,error=0, records=41
[INFO ] 2026-06-01 02:31:30.113 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21144/300s
[INFO ] 2026-06-01 02:31:34.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:31:38.022 [8918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:31:39.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 02:31:39.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422673,ok=422673,error=0, records=41
[INFO ] 2026-06-01 02:31:49.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:31:53.028 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:31:54.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 02:31:54.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422674,ok=422674,error=0, records=41
[INFO ] 2026-06-01 02:32:04.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:32:04.664 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21147/300s
[WARN ] 2026-06-01 02:32:08.033 [8836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:32:09.069 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 02:32:09.069 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422675,ok=422675,error=0, records=41
[INFO ] 2026-06-01 02:32:19.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:32:23.037 [9036 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:32:24.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 02:32:24.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422676,ok=422676,error=0, records=41
[INFO ] 2026-06-01 02:32:34.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:32:37.722 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21145/300s
[WARN ] 2026-06-01 02:32:38.044 [9106 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:32:39.082 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 02:32:39.082 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422677,ok=422677,error=0, records=41
[INFO ] 2026-06-01 02:32:39.624 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21145/300s
[INFO ] 2026-06-01 02:32:47.331 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21145/300s
[INFO ] 2026-06-01 02:32:49.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:32:53.049 [9148 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:32:54.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 02:32:54.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422678,ok=422678,error=0, records=41
[INFO ] 2026-06-01 02:33:04.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:33:07.554 [9159 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:33:09.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 02:33:09.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422679,ok=422679,error=0, records=41
[INFO ] 2026-06-01 02:33:19.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:33:22.559 [9187 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:33:24.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 02:33:24.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422680,ok=422680,error=0, records=41
[INFO ] 2026-06-01 02:33:29.863 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17607/300s
[INFO ] 2026-06-01 02:33:29.864 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877344},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:33:30.016 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:33:30.016 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:33:30.016 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:33:30.017 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:33:30.017 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:33:30.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:33:30.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:33:30.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:33:30.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:33:30.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:33:30.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:33:34.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 02:33:34.668 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 02:33:37.564 [9193 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:33:39.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 02:33:39.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422681,ok=422681,error=0, records=41
[INFO ] 2026-06-01 02:33:49.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:33:52.568 [9224 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:33:54.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 02:33:54.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422682,ok=422682,error=0, records=41
[INFO ] 2026-06-01 02:34:04.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:34:07.572 [9247 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:34:09.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 02:34:09.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422683,ok=422683,error=0, records=41
[INFO ] 2026-06-01 02:34:19.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:34:22.577 [9208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:34:24.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 02:34:24.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422684,ok=422684,error=0, records=41
[INFO ] 2026-06-01 02:34:34.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:34:37.582 [9248 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:34:39.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 02:34:39.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422685,ok=422685,error=0, records=41
[INFO ] 2026-06-01 02:34:49.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:34:50.586 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21140/300s
[WARN ] 2026-06-01 02:34:52.587 [9248 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:34:54.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 02:34:54.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422686,ok=422686,error=0, records=41
[INFO ] 2026-06-01 02:34:54.149 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21136/300s
[INFO ] 2026-06-01 02:35:00.652 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21149/300s
[INFO ] 2026-06-01 02:35:04.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:35:07.592 [9273 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:35:09.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 02:35:09.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422687,ok=422687,error=0, records=41
[INFO ] 2026-06-01 02:35:19.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:35:22.597 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:35:24.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 02:35:24.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422688,ok=422688,error=0, records=41
[INFO ] 2026-06-01 02:35:34.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:35:37.602 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:35:39.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 02:35:39.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422689,ok=422689,error=0, records=41
[INFO ] 2026-06-01 02:35:40.707 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21149/300s
[INFO ] 2026-06-01 02:35:49.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:35:52.606 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:35:53.788 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21136/300s
[INFO ] 2026-06-01 02:35:54.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 02:35:54.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422690,ok=422690,error=0, records=41
[INFO ] 2026-06-01 02:36:04.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:36:07.611 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:36:09.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 02:36:09.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422691,ok=422691,error=0, records=41
[INFO ] 2026-06-01 02:36:19.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:36:22.616 [9327 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:36:24.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-01 02:36:24.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422692,ok=422692,error=0, records=41
[INFO ] 2026-06-01 02:36:30.018 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877268},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:36:30.169 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21145/300s
[INFO ] 2026-06-01 02:36:30.195 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:36:30.195 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:36:30.195 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:36:30.195 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:36:30.195 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:36:30.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:36:30.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:36:30.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:36:30.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:36:30.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:36:30.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:36:34.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:36:37.621 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:36:39.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 02:36:39.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422693,ok=422693,error=0, records=41
[INFO ] 2026-06-01 02:36:49.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:36:52.626 [9327 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:36:54.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 02:36:54.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422694,ok=422694,error=0, records=41
[INFO ] 2026-06-01 02:37:04.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:37:04.676 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21148/300s
[WARN ] 2026-06-01 02:37:07.631 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:37:09.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 02:37:09.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422695,ok=422695,error=0, records=41
[INFO ] 2026-06-01 02:37:19.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:37:22.637 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:37:24.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 02:37:24.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422696,ok=422696,error=0, records=41
[WARN ] 2026-06-01 02:37:32.642 [9294 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6573/stat), No such file or directory
[WARN ] 2026-06-01 02:37:32.642 [9294 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5465/stat), No such file or directory
[WARN ] 2026-06-01 02:37:32.643 [9294 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6172/stat), No such file or directory
[INFO ] 2026-06-01 02:37:34.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:37:37.643 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:37:37.784 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21146/300s
[INFO ] 2026-06-01 02:37:39.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 02:37:39.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422697,ok=422697,error=0, records=41
[INFO ] 2026-06-01 02:37:39.684 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21146/300s
[INFO ] 2026-06-01 02:37:47.389 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21146/300s
[WARN ] 2026-06-01 02:37:47.648 [9312 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6573/stat), No such file or directory
[WARN ] 2026-06-01 02:37:47.648 [9312 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5465/stat), No such file or directory
[WARN ] 2026-06-01 02:37:47.649 [9312 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6172/stat), No such file or directory
[INFO ] 2026-06-01 02:37:49.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:37:52.649 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:37:54.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 02:37:54.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422698,ok=422698,error=0, records=41
[INFO ] 2026-06-01 02:38:04.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:38:07.654 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:38:09.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 02:38:09.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422699,ok=422699,error=0, records=41
[INFO ] 2026-06-01 02:38:19.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:38:22.659 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:38:24.236 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 02:38:24.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422700,ok=422700,error=0, records=41
[INFO ] 2026-06-01 02:38:34.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:38:37.665 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:38:39.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 02:38:39.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422701,ok=422701,error=0, records=41
[INFO ] 2026-06-01 02:38:49.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:38:49.679 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 02:38:52.670 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:38:54.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 02:38:54.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422702,ok=422702,error=0, records=41
[INFO ] 2026-06-01 02:39:04.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:39:07.675 [9327 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:39:09.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 02:39:09.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422703,ok=422703,error=0, records=41
[INFO ] 2026-06-01 02:39:19.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:39:22.680 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:39:24.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 02:39:24.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422704,ok=422704,error=0, records=41
[INFO ] 2026-06-01 02:39:30.195 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17608/300s
[INFO ] 2026-06-01 02:39:30.197 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:39:30.501 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:39:30.501 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:39:30.501 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:39:30.501 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:39:30.501 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:39:30.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:39:30.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:39:30.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:39:30.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:39:30.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:39:30.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:39:34.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:39:37.686 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:39:39.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 02:39:39.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422705,ok=422705,error=0, records=41
[INFO ] 2026-06-01 02:39:49.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:39:50.690 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21141/300s
[WARN ] 2026-06-01 02:39:52.691 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:39:54.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 02:39:54.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422706,ok=422706,error=0, records=41
[INFO ] 2026-06-01 02:39:54.270 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21137/300s
[INFO ] 2026-06-01 02:40:00.655 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21150/300s
[INFO ] 2026-06-01 02:40:04.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:40:07.696 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:40:09.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 02:40:09.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422707,ok=422707,error=0, records=41
[INFO ] 2026-06-01 02:40:19.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:40:22.701 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:40:24.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 02:40:24.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422708,ok=422708,error=0, records=41
[INFO ] 2026-06-01 02:40:34.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:40:37.706 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:40:39.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 02:40:39.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422709,ok=422709,error=0, records=41
[INFO ] 2026-06-01 02:40:40.713 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21150/300s
[INFO ] 2026-06-01 02:40:49.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:40:52.711 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:40:53.963 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21137/300s
[INFO ] 2026-06-01 02:40:54.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 02:40:54.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422710,ok=422710,error=0, records=41
[INFO ] 2026-06-01 02:41:04.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:41:07.717 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:41:09.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 02:41:09.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422711,ok=422711,error=0, records=41
[INFO ] 2026-06-01 02:41:19.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:41:22.722 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:41:24.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 02:41:24.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422712,ok=422712,error=0, records=41
[INFO ] 2026-06-01 02:41:30.222 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21146/300s
[INFO ] 2026-06-01 02:41:34.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:41:37.726 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:41:39.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 02:41:39.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422713,ok=422713,error=0, records=41
[INFO ] 2026-06-01 02:41:49.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:41:52.732 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:41:54.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 02:41:54.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422714,ok=422714,error=0, records=41
[INFO ] 2026-06-01 02:42:04.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:42:04.688 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21149/300s
[WARN ] 2026-06-01 02:42:07.737 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:42:09.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 02:42:09.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422715,ok=422715,error=0, records=41
[INFO ] 2026-06-01 02:42:19.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:42:22.742 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:42:24.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 02:42:24.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422716,ok=422716,error=0, records=41
[INFO ] 2026-06-01 02:42:30.503 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877100},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:42:30.663 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:42:30.663 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:42:30.664 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:42:30.664 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:42:30.664 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:42:30.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:42:30.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:42:30.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:42:30.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:42:30.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:42:30.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:42:34.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:42:37.747 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:42:37.806 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21147/300s
[INFO ] 2026-06-01 02:42:39.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 02:42:39.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422717,ok=422717,error=0, records=41
[INFO ] 2026-06-01 02:42:39.707 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21147/300s
[INFO ] 2026-06-01 02:42:47.411 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21147/300s
[INFO ] 2026-06-01 02:42:49.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:42:52.752 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:42:54.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 02:42:54.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422718,ok=422718,error=0, records=41
[INFO ] 2026-06-01 02:43:04.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:43:07.758 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:43:09.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 02:43:09.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422719,ok=422719,error=0, records=41
[INFO ] 2026-06-01 02:43:19.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:43:22.762 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:43:24.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 02:43:24.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422720,ok=422720,error=0, records=41
[INFO ] 2026-06-01 02:43:34.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 02:43:34.691 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 02:43:37.768 [9327 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:43:39.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 02:43:39.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422721,ok=422721,error=0, records=41
[INFO ] 2026-06-01 02:43:49.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:43:52.774 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:43:54.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 02:43:54.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422722,ok=422722,error=0, records=41
[INFO ] 2026-06-01 02:44:04.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:44:07.780 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:44:09.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-01 02:44:09.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422723,ok=422723,error=0, records=41
[INFO ] 2026-06-01 02:44:19.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:44:22.785 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:44:24.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10405, records=41
[INFO ] 2026-06-01 02:44:24.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422724,ok=422724,error=0, records=41
[INFO ] 2026-06-01 02:44:34.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:44:37.790 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:44:39.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 02:44:39.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422725,ok=422725,error=0, records=41
[INFO ] 2026-06-01 02:44:49.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:44:50.794 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21142/300s
[WARN ] 2026-06-01 02:44:52.795 [9313 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:44:54.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 02:44:54.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422726,ok=422726,error=0, records=41
[INFO ] 2026-06-01 02:44:54.420 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21138/300s
[INFO ] 2026-06-01 02:45:00.658 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21151/300s
[INFO ] 2026-06-01 02:45:04.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:45:07.801 [9327 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:45:09.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 02:45:09.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422727,ok=422727,error=0, records=41
[INFO ] 2026-06-01 02:45:19.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:45:22.806 [9881 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:45:24.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 02:45:24.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422728,ok=422728,error=0, records=41
[INFO ] 2026-06-01 02:45:30.664 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17609/300s
[INFO ] 2026-06-01 02:45:30.665 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20877024},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:45:30.808 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:45:30.808 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 02:45:30.808 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:45:30.808 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:45:30.808 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:45:30.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:45:30.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:45:30.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:45:30.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:45:30.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:45:30.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:45:34.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:45:37.811 [9332 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:45:39.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 02:45:39.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422729,ok=422729,error=0, records=41
[INFO ] 2026-06-01 02:45:40.719 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21151/300s
[INFO ] 2026-06-01 02:45:49.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:45:52.816 [9917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:45:54.140 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21138/300s
[INFO ] 2026-06-01 02:45:54.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 02:45:54.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422730,ok=422730,error=0, records=41
[INFO ] 2026-06-01 02:46:04.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:46:07.820 [9902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:46:09.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 02:46:09.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422731,ok=422731,error=0, records=41
[INFO ] 2026-06-01 02:46:19.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:46:22.825 [9312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:46:24.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 02:46:24.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422732,ok=422732,error=0, records=41
[INFO ] 2026-06-01 02:46:30.269 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21147/300s
[INFO ] 2026-06-01 02:46:34.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:46:37.831 [9933 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:46:39.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 02:46:39.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422733,ok=422733,error=0, records=41
[INFO ] 2026-06-01 02:46:49.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:46:52.836 [9933 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:46:54.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 02:46:54.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422734,ok=422734,error=0, records=41
[INFO ] 2026-06-01 02:47:04.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:47:04.699 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21150/300s
[WARN ] 2026-06-01 02:47:07.841 [9933 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:47:09.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 02:47:09.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422735,ok=422735,error=0, records=41
[INFO ] 2026-06-01 02:47:19.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:47:22.846 [9933 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:47:24.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 02:47:24.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422736,ok=422736,error=0, records=41
[INFO ] 2026-06-01 02:47:34.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:47:37.819 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21148/300s
[WARN ] 2026-06-01 02:47:37.851 [9974 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:47:39.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 02:47:39.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422737,ok=422737,error=0, records=41
[INFO ] 2026-06-01 02:47:39.720 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21148/300s
[INFO ] 2026-06-01 02:47:47.427 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21148/300s
[INFO ] 2026-06-01 02:47:49.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:47:52.855 [9974 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:47:54.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 02:47:54.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422738,ok=422738,error=0, records=41
[INFO ] 2026-06-01 02:48:04.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:48:07.860 [10039] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:48:09.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 02:48:09.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422739,ok=422739,error=0, records=41
[INFO ] 2026-06-01 02:48:19.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:48:22.866 [10025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:48:24.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 02:48:24.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422740,ok=422740,error=0, records=41
[INFO ] 2026-06-01 02:48:30.810 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876936},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:48:30.963 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:48:30.963 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 02:48:30.963 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:48:30.963 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:48:30.963 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:48:31.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:48:31.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:48:31.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:48:31.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:48:31.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:48:31.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:48:34.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:48:37.872 [10039] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:48:39.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 02:48:39.513 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422741,ok=422741,error=0, records=41
[INFO ] 2026-06-01 02:48:49.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:48:52.879 [10081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:48:54.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 02:48:54.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422742,ok=422742,error=0, records=41
[INFO ] 2026-06-01 02:49:04.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:49:07.885 [9974 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:49:09.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 02:49:09.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422743,ok=422743,error=0, records=41
[INFO ] 2026-06-01 02:49:19.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:49:22.891 [10081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:49:24.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 02:49:24.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422744,ok=422744,error=0, records=41
[INFO ] 2026-06-01 02:49:34.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:49:37.896 [10138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:49:39.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 02:49:39.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422745,ok=422745,error=0, records=41
[INFO ] 2026-06-01 02:49:49.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:49:50.900 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21143/300s
[WARN ] 2026-06-01 02:49:52.902 [10105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:49:54.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 02:49:54.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422746,ok=422746,error=0, records=41
[INFO ] 2026-06-01 02:49:54.560 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21139/300s
[INFO ] 2026-06-01 02:50:00.661 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21152/300s
[INFO ] 2026-06-01 02:50:04.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:50:07.909 [10144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:50:09.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10487, records=41
[INFO ] 2026-06-01 02:50:09.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422747,ok=422747,error=0, records=41
[INFO ] 2026-06-01 02:50:19.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:50:22.920 [10105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:50:24.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10562, records=41
[INFO ] 2026-06-01 02:50:24.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422748,ok=422748,error=0, records=41
[INFO ] 2026-06-01 02:50:34.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:50:37.928 [10209] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:50:39.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10522, records=41
[INFO ] 2026-06-01 02:50:39.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422749,ok=422749,error=0, records=41
[INFO ] 2026-06-01 02:50:40.725 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21152/300s
[INFO ] 2026-06-01 02:50:49.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:50:52.938 [10226] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:50:54.319 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21139/300s
[INFO ] 2026-06-01 02:50:54.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10544, records=41
[INFO ] 2026-06-01 02:50:54.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422750,ok=422750,error=0, records=41
[INFO ] 2026-06-01 02:51:04.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:51:07.945 [10204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:51:09.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 02:51:14.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422751,ok=422751,error=0, records=41
[INFO ] 2026-06-01 02:51:19.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:51:22.954 [10246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:51:29.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 02:51:29.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422752,ok=422752,error=0, records=41
[INFO ] 2026-06-01 02:51:30.319 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21148/300s
[INFO ] 2026-06-01 02:51:30.963 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17610/300s
[INFO ] 2026-06-01 02:51:30.965 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20828976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:51:31.148 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:51:31.148 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:51:31.148 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:51:31.148 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:51:31.148 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:51:31.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:51:31.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:51:31.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:51:31.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:51:31.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:51:31.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:51:34.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:51:37.961 [10252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:51:44.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 02:51:44.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422753,ok=422753,error=0, records=41
[INFO ] 2026-06-01 02:51:49.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:51:52.969 [10252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:51:59.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 02:51:59.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422754,ok=422754,error=0, records=41
[INFO ] 2026-06-01 02:52:04.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:52:04.711 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21151/300s
[WARN ] 2026-06-01 02:52:07.977 [10296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:52:14.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 02:52:14.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422755,ok=422755,error=0, records=41
[INFO ] 2026-06-01 02:52:19.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:52:22.985 [10332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:52:29.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 02:52:29.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422756,ok=422756,error=0, records=41
[INFO ] 2026-06-01 02:52:34.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:52:37.914 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21149/300s
[WARN ] 2026-06-01 02:52:37.990 [10349] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:52:39.790 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21149/300s
[INFO ] 2026-06-01 02:52:44.091 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 02:52:44.091 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422757,ok=422757,error=0, records=41
[INFO ] 2026-06-01 02:52:47.496 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21149/300s
[INFO ] 2026-06-01 02:52:49.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:52:52.995 [10349] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:52:59.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 02:52:59.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422758,ok=422758,error=0, records=41
[INFO ] 2026-06-01 02:53:04.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:53:08.000 [10246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:53:14.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 02:53:14.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422759,ok=422759,error=0, records=41
[INFO ] 2026-06-01 02:53:19.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:53:23.014 [10349] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:53:29.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 02:53:29.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422760,ok=422760,error=0, records=41
[INFO ] 2026-06-01 02:53:34.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 02:53:34.715 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 02:53:38.020 [10365] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:53:44.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 02:53:44.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422761,ok=422761,error=0, records=41
[INFO ] 2026-06-01 02:53:49.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:53:49.716 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 02:53:53.027 [10400] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:53:59.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 02:53:59.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422762,ok=422762,error=0, records=41
[INFO ] 2026-06-01 02:54:04.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:54:08.032 [10365] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:54:14.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 02:54:14.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422763,ok=422763,error=0, records=41
[INFO ] 2026-06-01 02:54:19.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:54:23.037 [10471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:54:29.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 02:54:29.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422764,ok=422764,error=0, records=41
[INFO ] 2026-06-01 02:54:31.149 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876776},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:54:31.308 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:54:31.308 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 02:54:31.308 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:54:31.308 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:54:31.308 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:54:31.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:54:31.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:54:31.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:54:31.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:54:31.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:54:31.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:54:34.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:54:38.041 [10491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:54:44.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 02:54:44.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422765,ok=422765,error=0, records=41
[INFO ] 2026-06-01 02:54:49.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:54:51.045 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21144/300s
[WARN ] 2026-06-01 02:54:53.046 [10489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:54:59.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10130, records=41
[INFO ] 2026-06-01 02:54:59.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422766,ok=422766,error=0, records=41
[INFO ] 2026-06-01 02:54:59.148 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21140/300s
[INFO ] 2026-06-01 02:55:00.664 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21153/300s
[INFO ] 2026-06-01 02:55:04.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:55:08.050 [10516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:55:14.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 02:55:14.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422767,ok=422767,error=0, records=41
[INFO ] 2026-06-01 02:55:19.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:55:22.554 [10539] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:55:29.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 02:55:29.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422768,ok=422768,error=0, records=41
[INFO ] 2026-06-01 02:55:34.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:55:37.558 [10556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:55:40.731 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21153/300s
[INFO ] 2026-06-01 02:55:44.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 02:55:44.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422769,ok=422769,error=0, records=41
[INFO ] 2026-06-01 02:55:49.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:55:52.562 [10562] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:55:54.496 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21140/300s
[INFO ] 2026-06-01 02:55:59.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 02:55:59.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422770,ok=422770,error=0, records=41
[INFO ] 2026-06-01 02:56:04.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:56:07.567 [10598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:56:14.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-01 02:56:14.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422771,ok=422771,error=0, records=41
[INFO ] 2026-06-01 02:56:19.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:56:22.571 [10598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:56:29.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10407, records=41
[INFO ] 2026-06-01 02:56:29.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422772,ok=422772,error=0, records=41
[INFO ] 2026-06-01 02:56:30.375 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21149/300s
[INFO ] 2026-06-01 02:56:34.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:56:37.575 [10593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:56:44.185 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 02:56:44.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422773,ok=422773,error=0, records=41
[INFO ] 2026-06-01 02:56:49.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:56:52.581 [10630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:56:59.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 02:56:59.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422774,ok=422774,error=0, records=41
[INFO ] 2026-06-01 02:57:04.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:57:04.724 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21152/300s
[WARN ] 2026-06-01 02:57:07.585 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:57:14.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 02:57:14.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422775,ok=422775,error=0, records=41
[INFO ] 2026-06-01 02:57:19.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:57:22.589 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:57:29.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 02:57:29.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422776,ok=422776,error=0, records=41
[INFO ] 2026-06-01 02:57:31.309 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17611/300s
[INFO ] 2026-06-01 02:57:31.310 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876704},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 02:57:31.490 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 02:57:31.490 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 02:57:31.490 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 02:57:31.490 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 02:57:31.490 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 02:57:31.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:57:31.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 02:57:31.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:57:31.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 02:57:31.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:57:31.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 02:57:34.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:57:37.594 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:57:37.940 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21150/300s
[INFO ] 2026-06-01 02:57:39.818 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21150/300s
[INFO ] 2026-06-01 02:57:44.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 02:57:44.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422777,ok=422777,error=0, records=41
[INFO ] 2026-06-01 02:57:47.507 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21150/300s
[INFO ] 2026-06-01 02:57:49.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:57:52.599 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:57:59.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 02:57:59.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422778,ok=422778,error=0, records=41
[INFO ] 2026-06-01 02:58:04.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:58:07.604 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:58:14.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 02:58:14.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422779,ok=422779,error=0, records=41
[INFO ] 2026-06-01 02:58:19.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:58:22.609 [10650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:58:29.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 02:58:29.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422780,ok=422780,error=0, records=41
[INFO ] 2026-06-01 02:58:34.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:58:37.615 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:58:44.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 02:58:44.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422781,ok=422781,error=0, records=41
[INFO ] 2026-06-01 02:58:49.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:58:52.620 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:58:59.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 02:58:59.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422782,ok=422782,error=0, records=41
[INFO ] 2026-06-01 02:59:04.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:59:07.625 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:59:14.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 02:59:14.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422783,ok=422783,error=0, records=41
[INFO ] 2026-06-01 02:59:19.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:59:22.631 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:59:29.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 02:59:29.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422784,ok=422784,error=0, records=41
[INFO ] 2026-06-01 02:59:34.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 02:59:37.636 [10650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:59:44.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 02:59:44.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422785,ok=422785,error=0, records=41
[INFO ] 2026-06-01 02:59:49.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 02:59:51.140 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21145/300s
[WARN ] 2026-06-01 02:59:52.640 [10650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 02:59:59.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 02:59:59.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422786,ok=422786,error=0, records=41
[INFO ] 2026-06-01 02:59:59.345 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21141/300s
[INFO ] 2026-06-01 03:00:00.666 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21154/300s
[INFO ] 2026-06-01 03:00:04.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:00:07.645 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:00:14.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 03:00:14.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422787,ok=422787,error=0, records=41
[INFO ] 2026-06-01 03:00:19.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:00:22.650 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:00:29.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 03:00:29.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422788,ok=422788,error=0, records=41
[INFO ] 2026-06-01 03:00:31.492 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876624},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:00:31.665 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:00:31.666 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:00:31.666 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:00:31.666 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:00:31.666 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:00:31.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:00:31.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:00:31.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:00:31.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:00:31.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:00:31.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:00:34.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:00:37.655 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:00:40.736 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21154/300s
[INFO ] 2026-06-01 03:00:44.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 03:00:44.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422789,ok=422789,error=0, records=41
[INFO ] 2026-06-01 03:00:49.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:00:52.660 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:00:54.669 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21141/300s
[INFO ] 2026-06-01 03:00:59.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 03:00:59.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422790,ok=422790,error=0, records=41
[INFO ] 2026-06-01 03:01:04.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:01:07.665 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:01:14.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 03:01:14.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422791,ok=422791,error=0, records=41
[INFO ] 2026-06-01 03:01:19.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:01:22.670 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:01:29.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 03:01:29.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422792,ok=422792,error=0, records=41
[INFO ] 2026-06-01 03:01:30.421 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21150/300s
[INFO ] 2026-06-01 03:01:34.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:01:37.674 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:01:44.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 03:01:44.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422793,ok=422793,error=0, records=41
[INFO ] 2026-06-01 03:01:49.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:01:52.679 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:01:59.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 03:01:59.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422794,ok=422794,error=0, records=41
[INFO ] 2026-06-01 03:02:04.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:02:04.735 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21153/300s
[WARN ] 2026-06-01 03:02:07.683 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:02:14.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 03:02:14.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422795,ok=422795,error=0, records=41
[INFO ] 2026-06-01 03:02:19.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:02:22.687 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:02:29.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 03:02:29.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422796,ok=422796,error=0, records=41
[INFO ] 2026-06-01 03:02:34.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:02:37.692 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:02:37.974 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21151/300s
[INFO ] 2026-06-01 03:02:39.858 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21151/300s
[INFO ] 2026-06-01 03:02:44.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 03:02:44.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422797,ok=422797,error=0, records=41
[INFO ] 2026-06-01 03:02:47.537 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21151/300s
[INFO ] 2026-06-01 03:02:49.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:02:52.697 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:02:59.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 03:02:59.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422798,ok=422798,error=0, records=41
[INFO ] 2026-06-01 03:03:04.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:03:07.702 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:03:14.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 03:03:14.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422799,ok=422799,error=0, records=41
[INFO ] 2026-06-01 03:03:19.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:03:22.707 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:03:29.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 03:03:29.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422800,ok=422800,error=0, records=41
[INFO ] 2026-06-01 03:03:31.666 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17612/300s
[INFO ] 2026-06-01 03:03:31.667 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876552},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:03:31.821 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:03:31.821 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:03:31.821 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:03:31.821 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:03:31.821 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:03:31.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:03:31.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:03:31.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:03:31.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:03:31.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:03:31.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:03:34.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 03:03:34.739 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 03:03:37.714 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:03:44.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 03:03:44.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422801,ok=422801,error=0, records=41
[INFO ] 2026-06-01 03:03:49.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:03:52.719 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:03:59.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 03:03:59.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422802,ok=422802,error=0, records=41
[INFO ] 2026-06-01 03:04:04.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:04:07.725 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:04:14.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 03:04:14.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422803,ok=422803,error=0, records=41
[INFO ] 2026-06-01 03:04:19.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:04:22.729 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:04:29.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 03:04:29.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422804,ok=422804,error=0, records=41
[INFO ] 2026-06-01 03:04:34.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:04:37.734 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:04:44.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 03:04:44.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422805,ok=422805,error=0, records=41
[INFO ] 2026-06-01 03:04:49.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:04:51.239 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21146/300s
[WARN ] 2026-06-01 03:04:52.740 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:04:59.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 03:04:59.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422806,ok=422806,error=0, records=41
[INFO ] 2026-06-01 03:04:59.633 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21142/300s
[INFO ] 2026-06-01 03:05:00.669 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21155/300s
[INFO ] 2026-06-01 03:05:04.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:05:07.745 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:05:14.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 03:05:14.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422807,ok=422807,error=0, records=41
[INFO ] 2026-06-01 03:05:19.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:05:22.750 [10660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:05:29.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 03:05:29.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422808,ok=422808,error=0, records=41
[INFO ] 2026-06-01 03:05:34.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:05:37.755 [10650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:05:40.742 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21155/300s
[INFO ] 2026-06-01 03:05:44.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 03:05:44.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422809,ok=422809,error=0, records=41
[INFO ] 2026-06-01 03:05:49.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:05:52.761 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:05:54.848 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21142/300s
[INFO ] 2026-06-01 03:05:59.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 03:05:59.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422810,ok=422810,error=0, records=41
[INFO ] 2026-06-01 03:06:04.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:06:07.766 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:06:14.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-01 03:06:14.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422811,ok=422811,error=0, records=41
[INFO ] 2026-06-01 03:06:19.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:06:22.772 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:06:29.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 03:06:29.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422812,ok=422812,error=0, records=41
[INFO ] 2026-06-01 03:06:30.472 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21151/300s
[INFO ] 2026-06-01 03:06:31.823 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876480},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:06:31.981 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:06:31.981 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:06:31.981 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:06:31.981 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:06:31.981 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:06:32.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:06:32.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:06:32.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:06:32.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:06:32.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:06:32.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:06:34.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:06:37.777 [10650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:06:44.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-01 03:06:44.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422813,ok=422813,error=0, records=41
[INFO ] 2026-06-01 03:06:49.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:06:52.782 [10650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:06:59.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 03:06:59.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422814,ok=422814,error=0, records=41
[INFO ] 2026-06-01 03:07:04.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:07:04.748 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21154/300s
[WARN ] 2026-06-01 03:07:07.788 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:07:14.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 03:07:14.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422815,ok=422815,error=0, records=41
[INFO ] 2026-06-01 03:07:19.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:07:22.793 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:07:29.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 03:07:29.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422816,ok=422816,error=0, records=41
[INFO ] 2026-06-01 03:07:34.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:07:37.798 [10709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:07:38.009 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21152/300s
[INFO ] 2026-06-01 03:07:39.900 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21152/300s
[INFO ] 2026-06-01 03:07:44.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 03:07:44.694 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422817,ok=422817,error=0, records=41
[INFO ] 2026-06-01 03:07:47.555 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21152/300s
[INFO ] 2026-06-01 03:07:49.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:07:52.803 [10679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:07:59.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 03:07:59.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422818,ok=422818,error=0, records=41
[INFO ] 2026-06-01 03:08:04.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:08:07.809 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:08:14.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 03:08:14.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422819,ok=422819,error=0, records=41
[INFO ] 2026-06-01 03:08:19.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:08:22.814 [10695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:08:29.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 03:08:29.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422820,ok=422820,error=0, records=41
[INFO ] 2026-06-01 03:08:34.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:08:37.820 [11291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:08:44.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 03:08:44.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422821,ok=422821,error=0, records=41
[INFO ] 2026-06-01 03:08:49.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:08:49.751 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 03:08:52.825 [11291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:08:59.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 03:08:59.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422822,ok=422822,error=0, records=41
[INFO ] 2026-06-01 03:09:04.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:09:07.831 [11272] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:09:14.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 03:09:14.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422823,ok=422823,error=0, records=41
[INFO ] 2026-06-01 03:09:19.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:09:22.836 [11272] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:09:29.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 03:09:29.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422824,ok=422824,error=0, records=41
[INFO ] 2026-06-01 03:09:31.981 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17613/300s
[INFO ] 2026-06-01 03:09:31.983 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876408},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:09:32.149 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:09:32.149 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 03:09:32.149 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:09:32.149 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:09:32.149 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:09:32.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:09:32.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:09:32.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:09:32.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:09:32.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:09:32.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:09:34.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:09:37.841 [11349] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:09:44.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 03:09:44.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422825,ok=422825,error=0, records=41
[INFO ] 2026-06-01 03:09:49.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:09:51.346 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21147/300s
[WARN ] 2026-06-01 03:09:52.847 [11374] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:09:59.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 03:09:59.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422826,ok=422826,error=0, records=41
[INFO ] 2026-06-01 03:09:59.846 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21143/300s
[INFO ] 2026-06-01 03:10:00.672 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21156/300s
[INFO ] 2026-06-01 03:10:04.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:10:07.852 [11320] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:10:14.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 03:10:14.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422827,ok=422827,error=0, records=41
[INFO ] 2026-06-01 03:10:19.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:10:22.857 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:10:29.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 03:10:29.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422828,ok=422828,error=0, records=41
[INFO ] 2026-06-01 03:10:34.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:10:37.863 [11320] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:10:40.748 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21156/300s
[INFO ] 2026-06-01 03:10:44.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10381, records=41
[INFO ] 2026-06-01 03:10:44.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422829,ok=422829,error=0, records=41
[INFO ] 2026-06-01 03:10:49.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:10:52.867 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:10:55.022 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21143/300s
[INFO ] 2026-06-01 03:10:59.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 03:10:59.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422830,ok=422830,error=0, records=41
[INFO ] 2026-06-01 03:11:04.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:11:07.872 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:11:14.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 03:11:14.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422831,ok=422831,error=0, records=41
[INFO ] 2026-06-01 03:11:19.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:11:22.879 [11470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:11:29.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 03:11:29.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422832,ok=422832,error=0, records=41
[INFO ] 2026-06-01 03:11:30.518 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21152/300s
[INFO ] 2026-06-01 03:11:34.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:11:37.885 [11475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:11:44.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 03:11:44.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422833,ok=422833,error=0, records=41
[INFO ] 2026-06-01 03:11:49.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:11:52.889 [11481] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:11:59.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 03:11:59.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422834,ok=422834,error=0, records=41
[INFO ] 2026-06-01 03:12:04.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:12:04.760 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21155/300s
[WARN ] 2026-06-01 03:12:07.894 [11496] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:12:14.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 03:12:14.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422835,ok=422835,error=0, records=41
[INFO ] 2026-06-01 03:12:19.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:12:22.900 [11514] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:12:29.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 03:12:29.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422836,ok=422836,error=0, records=41
[INFO ] 2026-06-01 03:12:32.150 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876332},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:12:32.317 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:12:32.317 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:12:32.318 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:12:32.318 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:12:32.318 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:12:32.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:12:32.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:12:32.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:12:32.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:12:32.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:12:32.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:12:34.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:12:37.906 [11496] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:12:38.034 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21153/300s
[INFO ] 2026-06-01 03:12:39.936 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21153/300s
[INFO ] 2026-06-01 03:12:44.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 03:12:44.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422837,ok=422837,error=0, records=41
[INFO ] 2026-06-01 03:12:47.575 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21153/300s
[INFO ] 2026-06-01 03:12:49.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:12:52.912 [11548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:12:59.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 03:12:59.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422838,ok=422838,error=0, records=41
[INFO ] 2026-06-01 03:13:04.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:13:07.917 [11554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:13:14.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 03:13:14.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422839,ok=422839,error=0, records=41
[INFO ] 2026-06-01 03:13:19.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:13:22.922 [11603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:13:29.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 03:13:29.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422840,ok=422840,error=0, records=41
[INFO ] 2026-06-01 03:13:34.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 03:13:34.763 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 03:13:37.928 [11613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:13:44.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 03:13:44.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422841,ok=422841,error=0, records=41
[INFO ] 2026-06-01 03:13:49.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:13:52.933 [11635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:13:59.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 03:13:59.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422842,ok=422842,error=0, records=41
[INFO ] 2026-06-01 03:14:04.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:14:07.938 [11581] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:14:14.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 03:14:14.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422843,ok=422843,error=0, records=41
[INFO ] 2026-06-01 03:14:19.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:14:22.944 [11669] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:14:29.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 03:14:29.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422844,ok=422844,error=0, records=41
[INFO ] 2026-06-01 03:14:34.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:14:37.950 [11664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:14:44.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 03:14:44.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422845,ok=422845,error=0, records=41
[INFO ] 2026-06-01 03:14:49.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:14:51.454 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21148/300s
[WARN ] 2026-06-01 03:14:52.954 [11693] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:15:00.003 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 03:15:00.003 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422846,ok=422846,error=0, records=41
[INFO ] 2026-06-01 03:15:00.003 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21144/300s
[INFO ] 2026-06-01 03:15:00.675 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21157/300s
[INFO ] 2026-06-01 03:15:04.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:15:07.960 [11669] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:15:15.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 03:15:15.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422847,ok=422847,error=0, records=41
[INFO ] 2026-06-01 03:15:19.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:15:22.965 [11652] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:15:30.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 03:15:30.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422848,ok=422848,error=0, records=41
[INFO ] 2026-06-01 03:15:32.318 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17614/300s
[INFO ] 2026-06-01 03:15:32.319 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:15:32.465 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:15:32.465 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 03:15:32.465 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:15:32.465 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:15:32.465 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:15:32.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:15:32.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:15:32.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:15:32.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:15:32.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:15:32.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:15:34.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:15:37.969 [11720] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:15:40.754 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21157/300s
[INFO ] 2026-06-01 03:15:45.021 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 03:15:45.021 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422849,ok=422849,error=0, records=41
[INFO ] 2026-06-01 03:15:49.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:15:52.973 [11749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:15:55.200 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21144/300s
[INFO ] 2026-06-01 03:16:00.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 03:16:00.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422850,ok=422850,error=0, records=41
[INFO ] 2026-06-01 03:16:04.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:16:07.979 [11749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:16:15.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 03:16:15.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422851,ok=422851,error=0, records=41
[INFO ] 2026-06-01 03:16:19.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:16:22.983 [11749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:16:30.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-01 03:16:30.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422852,ok=422852,error=0, records=41
[INFO ] 2026-06-01 03:16:30.571 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21153/300s
[INFO ] 2026-06-01 03:16:34.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:16:37.988 [11720] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:16:45.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 03:16:45.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422853,ok=422853,error=0, records=41
[INFO ] 2026-06-01 03:16:49.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:16:52.992 [11652] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:17:00.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 03:17:00.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422854,ok=422854,error=0, records=41
[INFO ] 2026-06-01 03:17:04.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:17:04.772 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21156/300s
[WARN ] 2026-06-01 03:17:07.997 [11720] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:17:15.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 03:17:15.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422855,ok=422855,error=0, records=41
[INFO ] 2026-06-01 03:17:19.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:17:23.002 [11777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:17:30.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 03:17:30.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422856,ok=422856,error=0, records=41
[INFO ] 2026-06-01 03:17:34.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:17:38.007 [11720] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:17:38.070 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21154/300s
[INFO ] 2026-06-01 03:17:39.971 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21154/300s
[INFO ] 2026-06-01 03:17:45.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 03:17:45.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422857,ok=422857,error=0, records=41
[INFO ] 2026-06-01 03:17:47.597 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21154/300s
[INFO ] 2026-06-01 03:17:49.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:17:53.012 [11821] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:18:00.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 03:18:00.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422858,ok=422858,error=0, records=41
[INFO ] 2026-06-01 03:18:04.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:18:08.016 [11821] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:18:15.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-01 03:18:15.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422859,ok=422859,error=0, records=41
[INFO ] 2026-06-01 03:18:19.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:18:23.022 [11863] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:18:30.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 03:18:30.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422860,ok=422860,error=0, records=41
[INFO ] 2026-06-01 03:18:32.467 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876184},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:18:32.627 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:18:32.627 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 03:18:32.627 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:18:32.627 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:18:32.627 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:18:32.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:18:32.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:18:32.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:18:32.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:18:32.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:18:32.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:18:34.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:18:38.027 [11912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:18:45.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 03:18:45.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422861,ok=422861,error=0, records=41
[INFO ] 2026-06-01 03:18:49.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:18:53.031 [11928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:19:00.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 03:19:00.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422862,ok=422862,error=0, records=41
[INFO ] 2026-06-01 03:19:04.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:19:08.036 [11821] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:19:15.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 03:19:15.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422863,ok=422863,error=0, records=41
[INFO ] 2026-06-01 03:19:19.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:19:23.040 [11956] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:19:30.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 03:19:30.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422864,ok=422864,error=0, records=41
[INFO ] 2026-06-01 03:19:34.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:19:38.045 [11863] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:19:45.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 03:19:45.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422865,ok=422865,error=0, records=41
[INFO ] 2026-06-01 03:19:49.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:19:51.549 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21149/300s
[WARN ] 2026-06-01 03:19:53.050 [12013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:20:00.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 03:20:00.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422866,ok=422866,error=0, records=41
[INFO ] 2026-06-01 03:20:00.187 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21145/300s
[INFO ] 2026-06-01 03:20:00.677 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21158/300s
[INFO ] 2026-06-01 03:20:04.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:20:07.556 [12035] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:20:15.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 03:20:15.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422867,ok=422867,error=0, records=41
[INFO ] 2026-06-01 03:20:19.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:20:22.562 [12053] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:20:30.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 03:20:30.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422868,ok=422868,error=0, records=41
[INFO ] 2026-06-01 03:20:34.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:20:37.566 [12047] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:20:40.760 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21158/300s
[INFO ] 2026-06-01 03:20:45.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 03:20:45.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422869,ok=422869,error=0, records=41
[INFO ] 2026-06-01 03:20:49.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:20:52.570 [12092] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:20:55.376 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21145/300s
[INFO ] 2026-06-01 03:21:00.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 03:21:00.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422870,ok=422870,error=0, records=41
[INFO ] 2026-06-01 03:21:04.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:21:07.574 [12074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:21:15.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-06-01 03:21:15.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422871,ok=422871,error=0, records=41
[INFO ] 2026-06-01 03:21:19.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:21:22.579 [12120] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:21:30.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 03:21:30.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422872,ok=422872,error=0, records=41
[INFO ] 2026-06-01 03:21:30.614 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21154/300s
[INFO ] 2026-06-01 03:21:32.627 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17615/300s
[INFO ] 2026-06-01 03:21:32.628 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:21:32.805 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:21:32.805 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 03:21:32.805 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:21:32.805 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:21:32.805 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:21:32.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:21:32.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:21:32.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:21:32.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:21:32.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:21:32.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:21:34.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:21:37.584 [12136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:21:45.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 03:21:45.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422873,ok=422873,error=0, records=41
[INFO ] 2026-06-01 03:21:49.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:21:52.588 [12148] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:22:00.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-01 03:22:00.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422874,ok=422874,error=0, records=41
[INFO ] 2026-06-01 03:22:04.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:22:04.782 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21157/300s
[WARN ] 2026-06-01 03:22:07.593 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:22:15.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 03:22:15.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422875,ok=422875,error=0, records=41
[INFO ] 2026-06-01 03:22:19.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:22:22.597 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:22:30.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 03:22:30.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422876,ok=422876,error=0, records=41
[INFO ] 2026-06-01 03:22:34.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:22:37.602 [12171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:22:38.081 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21155/300s
[INFO ] 2026-06-01 03:22:39.982 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21155/300s
[INFO ] 2026-06-01 03:22:45.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 03:22:45.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422877,ok=422877,error=0, records=41
[INFO ] 2026-06-01 03:22:47.695 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21155/300s
[INFO ] 2026-06-01 03:22:49.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:22:52.607 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:23:00.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 03:23:00.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422878,ok=422878,error=0, records=41
[INFO ] 2026-06-01 03:23:04.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:23:07.613 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:23:15.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 03:23:15.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422879,ok=422879,error=0, records=41
[INFO ] 2026-06-01 03:23:19.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:23:22.617 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:23:30.407 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 03:23:30.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422880,ok=422880,error=0, records=41
[INFO ] 2026-06-01 03:23:34.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 03:23:34.786 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 03:23:37.622 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:23:45.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 03:23:45.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422881,ok=422881,error=0, records=41
[INFO ] 2026-06-01 03:23:49.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:23:49.786 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 03:23:52.627 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:24:00.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 03:24:00.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422882,ok=422882,error=0, records=41
[INFO ] 2026-06-01 03:24:04.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:24:07.633 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:24:15.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 03:24:15.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422883,ok=422883,error=0, records=41
[INFO ] 2026-06-01 03:24:19.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:24:22.638 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:24:30.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 03:24:30.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422884,ok=422884,error=0, records=41
[INFO ] 2026-06-01 03:24:32.807 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20876008},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:24:32.985 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:24:32.986 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:24:32.986 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:24:32.986 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:24:32.986 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:24:33.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:24:33.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:24:33.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:24:33.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:24:33.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:24:33.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:24:34.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:24:37.643 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:24:45.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 03:24:45.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422885,ok=422885,error=0, records=41
[WARN ] 2026-06-01 03:24:47.648 [12176] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9474/stat), No such file or directory
[INFO ] 2026-06-01 03:24:49.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:24:51.649 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21150/300s
[WARN ] 2026-06-01 03:24:52.650 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:25:00.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 03:25:00.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422886,ok=422886,error=0, records=41
[INFO ] 2026-06-01 03:25:00.459 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21146/300s
[INFO ] 2026-06-01 03:25:00.680 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21159/300s
[INFO ] 2026-06-01 03:25:04.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:25:07.655 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:25:15.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 03:25:15.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422887,ok=422887,error=0, records=41
[INFO ] 2026-06-01 03:25:19.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:25:22.660 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:25:30.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 03:25:30.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422888,ok=422888,error=0, records=41
[INFO ] 2026-06-01 03:25:34.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:25:37.665 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:25:40.765 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21159/300s
[INFO ] 2026-06-01 03:25:45.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 03:25:45.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422889,ok=422889,error=0, records=41
[INFO ] 2026-06-01 03:25:49.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:25:52.670 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:25:55.545 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21146/300s
[INFO ] 2026-06-01 03:26:00.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 03:26:00.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422890,ok=422890,error=0, records=41
[INFO ] 2026-06-01 03:26:04.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:26:07.675 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:26:15.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 03:26:15.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422891,ok=422891,error=0, records=41
[INFO ] 2026-06-01 03:26:19.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:26:22.680 [12171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:26:30.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 03:26:30.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422892,ok=422892,error=0, records=41
[INFO ] 2026-06-01 03:26:30.660 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21155/300s
[INFO ] 2026-06-01 03:26:34.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:26:37.686 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:26:45.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 03:26:45.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422893,ok=422893,error=0, records=41
[INFO ] 2026-06-01 03:26:49.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:26:52.692 [12171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:27:00.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 03:27:00.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422894,ok=422894,error=0, records=41
[INFO ] 2026-06-01 03:27:04.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:27:04.794 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21158/300s
[WARN ] 2026-06-01 03:27:07.698 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:27:15.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 03:27:15.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422895,ok=422895,error=0, records=41
[INFO ] 2026-06-01 03:27:19.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:27:22.704 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:27:30.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 03:27:30.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422896,ok=422896,error=0, records=41
[INFO ] 2026-06-01 03:27:32.986 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17616/300s
[INFO ] 2026-06-01 03:27:32.987 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875912},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:27:33.156 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:27:33.156 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:27:33.156 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:27:33.156 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:27:33.156 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:27:33.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:27:33.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:27:33.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:27:33.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:27:33.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:27:33.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:27:34.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:27:37.709 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:27:38.100 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21156/300s
[INFO ] 2026-06-01 03:27:40.002 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21156/300s
[INFO ] 2026-06-01 03:27:45.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 03:27:45.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422897,ok=422897,error=0, records=41
[INFO ] 2026-06-01 03:27:47.705 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21156/300s
[INFO ] 2026-06-01 03:27:49.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:27:52.715 [12171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:28:00.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 03:28:00.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422898,ok=422898,error=0, records=41
[INFO ] 2026-06-01 03:28:04.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:28:07.721 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:28:15.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 03:28:15.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422899,ok=422899,error=0, records=41
[INFO ] 2026-06-01 03:28:19.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:28:22.726 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:28:30.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 03:28:30.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422900,ok=422900,error=0, records=41
[INFO ] 2026-06-01 03:28:34.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:28:37.732 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:28:45.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 03:28:45.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422901,ok=422901,error=0, records=41
[INFO ] 2026-06-01 03:28:49.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:28:52.738 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:29:00.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 03:29:00.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422902,ok=422902,error=0, records=41
[INFO ] 2026-06-01 03:29:04.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:29:07.743 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:29:15.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 03:29:15.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422903,ok=422903,error=0, records=41
[INFO ] 2026-06-01 03:29:19.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:29:22.748 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:29:30.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 03:29:30.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422904,ok=422904,error=0, records=41
[INFO ] 2026-06-01 03:29:34.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:29:37.755 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:29:45.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 03:29:45.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422905,ok=422905,error=0, records=41
[INFO ] 2026-06-01 03:29:49.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:29:51.760 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21151/300s
[WARN ] 2026-06-01 03:29:52.760 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:30:00.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 03:30:00.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422906,ok=422906,error=0, records=41
[INFO ] 2026-06-01 03:30:00.577 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21147/300s
[INFO ] 2026-06-01 03:30:00.683 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21160/300s
[INFO ] 2026-06-01 03:30:04.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:30:07.766 [12176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:30:15.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 03:30:15.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422907,ok=422907,error=0, records=41
[INFO ] 2026-06-01 03:30:19.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:30:22.771 [12171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:30:30.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 03:30:30.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422908,ok=422908,error=0, records=41
[INFO ] 2026-06-01 03:30:33.158 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875820},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:30:33.330 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:30:33.330 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:30:33.330 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:30:33.330 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:30:33.330 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:30:33.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:30:33.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:30:33.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:30:33.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:30:33.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:30:33.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:30:34.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:30:37.776 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:30:40.771 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21160/300s
[INFO ] 2026-06-01 03:30:45.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 03:30:45.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422909,ok=422909,error=0, records=41
[INFO ] 2026-06-01 03:30:49.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:30:52.781 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:30:55.719 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21147/300s
[INFO ] 2026-06-01 03:31:00.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 03:31:00.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422910,ok=422910,error=0, records=41
[INFO ] 2026-06-01 03:31:04.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:31:07.787 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:31:15.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 03:31:15.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422911,ok=422911,error=0, records=41
[INFO ] 2026-06-01 03:31:19.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:31:22.792 [12186] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:31:30.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 03:31:30.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422912,ok=422912,error=0, records=41
[INFO ] 2026-06-01 03:31:30.705 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21156/300s
[INFO ] 2026-06-01 03:31:34.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:31:37.798 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:31:45.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 03:31:45.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422913,ok=422913,error=0, records=41
[INFO ] 2026-06-01 03:31:49.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:31:52.804 [12154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:32:00.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 03:32:00.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422914,ok=422914,error=0, records=41
[INFO ] 2026-06-01 03:32:04.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:32:04.805 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21159/300s
[WARN ] 2026-06-01 03:32:07.809 [12171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:32:15.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 03:32:15.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422915,ok=422915,error=0, records=41
[INFO ] 2026-06-01 03:32:19.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:32:22.814 [12751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:32:30.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 03:32:30.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422916,ok=422916,error=0, records=41
[INFO ] 2026-06-01 03:32:34.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:32:37.820 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:32:38.111 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21157/300s
[INFO ] 2026-06-01 03:32:40.013 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21157/300s
[INFO ] 2026-06-01 03:32:45.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 03:32:45.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422917,ok=422917,error=0, records=41
[INFO ] 2026-06-01 03:32:47.717 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21157/300s
[INFO ] 2026-06-01 03:32:49.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:32:52.825 [12191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:33:00.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 03:33:00.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422918,ok=422918,error=0, records=41
[INFO ] 2026-06-01 03:33:04.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:33:07.831 [12809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:33:15.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 03:33:15.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422919,ok=422919,error=0, records=41
[INFO ] 2026-06-01 03:33:19.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:33:22.836 [12795] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:33:30.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 03:33:30.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422920,ok=422920,error=0, records=41
[INFO ] 2026-06-01 03:33:33.330 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17617/300s
[INFO ] 2026-06-01 03:33:33.332 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875740},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:33:33.515 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:33:33.515 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 03:33:33.515 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:33:33.515 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:33:33.515 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:33:33.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:33:33.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:33:33.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:33:33.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:33:33.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:33:33.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:33:34.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 03:33:34.809 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 03:33:37.846 [12746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:33:45.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 03:33:45.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422921,ok=422921,error=0, records=41
[INFO ] 2026-06-01 03:33:49.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:33:52.851 [12846] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:34:00.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 03:34:00.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422922,ok=422922,error=0, records=41
[INFO ] 2026-06-01 03:34:04.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:34:07.856 [12746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:34:15.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 03:34:15.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422923,ok=422923,error=0, records=41
[INFO ] 2026-06-01 03:34:19.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:34:22.860 [12874] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:34:30.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 03:34:30.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422924,ok=422924,error=0, records=41
[INFO ] 2026-06-01 03:34:34.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:34:37.864 [12746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:34:45.687 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 03:34:45.687 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422925,ok=422925,error=0, records=41
[INFO ] 2026-06-01 03:34:49.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:34:51.869 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21152/300s
[WARN ] 2026-06-01 03:34:52.870 [12888] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:35:00.685 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21161/300s
[INFO ] 2026-06-01 03:35:00.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 03:35:00.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422926,ok=422926,error=0, records=41
[INFO ] 2026-06-01 03:35:00.692 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21148/300s
[INFO ] 2026-06-01 03:35:04.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:35:07.876 [12874] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:35:15.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 03:35:15.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422927,ok=422927,error=0, records=41
[INFO ] 2026-06-01 03:35:19.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:35:22.882 [12933] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:35:30.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 03:35:30.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422928,ok=422928,error=0, records=41
[INFO ] 2026-06-01 03:35:34.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:35:37.887 [12903] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:35:40.776 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21161/300s
[INFO ] 2026-06-01 03:35:45.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 03:35:45.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422929,ok=422929,error=0, records=41
[INFO ] 2026-06-01 03:35:49.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:35:52.893 [12956] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:35:55.892 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21148/300s
[INFO ] 2026-06-01 03:36:00.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 03:36:00.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422930,ok=422930,error=0, records=41
[INFO ] 2026-06-01 03:36:04.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:36:07.900 [12967] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:36:15.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 03:36:15.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422931,ok=422931,error=0, records=41
[INFO ] 2026-06-01 03:36:19.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:36:22.906 [13017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:36:30.749 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21157/300s
[INFO ] 2026-06-01 03:36:30.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 03:36:30.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422932,ok=422932,error=0, records=41
[INFO ] 2026-06-01 03:36:33.517 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875664},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:36:33.692 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:36:33.692 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 03:36:33.692 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:36:33.692 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:36:33.692 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:36:33.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:36:33.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:36:33.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:36:33.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:36:33.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:36:33.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:36:34.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:36:37.911 [13048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:36:45.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 03:36:45.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422933,ok=422933,error=0, records=41
[INFO ] 2026-06-01 03:36:49.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:36:52.917 [12956] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:37:00.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 03:37:00.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422934,ok=422934,error=0, records=41
[INFO ] 2026-06-01 03:37:04.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:37:04.816 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21160/300s
[WARN ] 2026-06-01 03:37:07.922 [13070] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:37:15.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 03:37:15.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422935,ok=422935,error=0, records=41
[INFO ] 2026-06-01 03:37:19.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:37:22.929 [13085] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:37:30.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 03:37:30.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422936,ok=422936,error=0, records=41
[INFO ] 2026-06-01 03:37:34.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:37:37.934 [13054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:37:38.119 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21158/300s
[INFO ] 2026-06-01 03:37:40.021 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21158/300s
[INFO ] 2026-06-01 03:37:45.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 03:37:45.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422937,ok=422937,error=0, records=41
[INFO ] 2026-06-01 03:37:47.722 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21158/300s
[INFO ] 2026-06-01 03:37:49.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:37:52.940 [13054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:38:00.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 03:38:00.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422938,ok=422938,error=0, records=41
[INFO ] 2026-06-01 03:38:04.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:38:07.946 [13128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:38:15.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-01 03:38:15.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422939,ok=422939,error=0, records=41
[INFO ] 2026-06-01 03:38:19.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:38:22.951 [13155] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:38:30.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 03:38:30.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422940,ok=422940,error=0, records=41
[INFO ] 2026-06-01 03:38:34.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:38:37.956 [13149] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:38:45.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-01 03:38:45.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422941,ok=422941,error=0, records=41
[INFO ] 2026-06-01 03:38:49.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:38:49.820 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 03:38:52.960 [13054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:39:00.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 03:39:00.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422942,ok=422942,error=0, records=41
[INFO ] 2026-06-01 03:39:04.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:39:07.966 [13149] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:39:15.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 03:39:15.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422943,ok=422943,error=0, records=41
[INFO ] 2026-06-01 03:39:19.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:39:22.972 [13128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:39:30.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 03:39:30.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422944,ok=422944,error=0, records=41
[INFO ] 2026-06-01 03:39:33.692 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17618/300s
[INFO ] 2026-06-01 03:39:33.694 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875584},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:39:33.847 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:39:33.847 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 03:39:33.847 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:39:33.847 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:39:33.847 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:39:33.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:39:33.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:39:33.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:39:33.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:39:33.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:39:33.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:39:34.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:39:37.977 [13128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:39:45.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10134, records=41
[INFO ] 2026-06-01 03:39:45.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422945,ok=422945,error=0, records=41
[INFO ] 2026-06-01 03:39:49.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:39:51.981 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21153/300s
[WARN ] 2026-06-01 03:39:52.982 [13139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:40:00.688 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21162/300s
[INFO ] 2026-06-01 03:40:00.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-01 03:40:00.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422946,ok=422946,error=0, records=41
[INFO ] 2026-06-01 03:40:00.961 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21149/300s
[INFO ] 2026-06-01 03:40:04.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:40:07.987 [13259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:40:15.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 03:40:15.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422947,ok=422947,error=0, records=41
[INFO ] 2026-06-01 03:40:19.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:40:22.993 [13139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:40:30.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 03:40:30.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422948,ok=422948,error=0, records=41
[INFO ] 2026-06-01 03:40:34.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:40:37.998 [13139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:40:40.782 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21162/300s
[INFO ] 2026-06-01 03:40:45.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 03:40:45.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422949,ok=422949,error=0, records=41
[INFO ] 2026-06-01 03:40:49.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:40:53.002 [13273] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:40:56.069 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21149/300s
[INFO ] 2026-06-01 03:41:00.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 03:41:00.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422950,ok=422950,error=0, records=41
[INFO ] 2026-06-01 03:41:04.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:41:08.007 [13273] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:41:15.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 03:41:15.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422951,ok=422951,error=0, records=41
[INFO ] 2026-06-01 03:41:19.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:41:23.012 [13225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:41:30.797 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21158/300s
[INFO ] 2026-06-01 03:41:31.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 03:41:31.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422952,ok=422952,error=0, records=41
[INFO ] 2026-06-01 03:41:34.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:41:38.017 [13315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:41:46.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 03:41:46.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422953,ok=422953,error=0, records=41
[INFO ] 2026-06-01 03:41:49.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:41:53.022 [13342] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:42:01.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 03:42:01.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422954,ok=422954,error=0, records=41
[INFO ] 2026-06-01 03:42:04.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:42:04.829 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21161/300s
[WARN ] 2026-06-01 03:42:08.026 [13287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:42:16.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 03:42:16.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422955,ok=422955,error=0, records=41
[INFO ] 2026-06-01 03:42:19.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:42:23.031 [13370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:42:31.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 03:42:31.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422956,ok=422956,error=0, records=41
[INFO ] 2026-06-01 03:42:33.849 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875504},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:42:34.009 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:42:34.009 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:42:34.009 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:42:34.009 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:42:34.009 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:42:34.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:42:34.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:42:34.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:42:34.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:42:34.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:42:34.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:42:34.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:42:38.036 [13399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:42:38.140 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21159/300s
[INFO ] 2026-06-01 03:42:40.042 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21159/300s
[INFO ] 2026-06-01 03:42:46.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 03:42:46.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422957,ok=422957,error=0, records=41
[INFO ] 2026-06-01 03:42:47.746 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21159/300s
[INFO ] 2026-06-01 03:42:49.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:42:53.042 [13412] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:43:01.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 03:43:01.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422958,ok=422958,error=0, records=41
[INFO ] 2026-06-01 03:43:04.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:43:08.047 [13424] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:43:16.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 03:43:16.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422959,ok=422959,error=0, records=41
[INFO ] 2026-06-01 03:43:19.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:43:23.052 [13445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:43:31.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 03:43:31.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422960,ok=422960,error=0, records=41
[INFO ] 2026-06-01 03:43:34.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 03:43:34.832 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 03:43:37.557 [13461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:43:46.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 03:43:46.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422961,ok=422961,error=0, records=41
[INFO ] 2026-06-01 03:43:49.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:43:52.561 [13461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:44:01.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 03:44:01.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422962,ok=422962,error=0, records=41
[INFO ] 2026-06-01 03:44:04.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:44:07.566 [13424] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:44:16.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-01 03:44:16.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422963,ok=422963,error=0, records=41
[INFO ] 2026-06-01 03:44:19.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:44:22.570 [13523] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:44:31.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 03:44:31.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422964,ok=422964,error=0, records=41
[INFO ] 2026-06-01 03:44:34.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:44:37.575 [13507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:44:46.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 03:44:46.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422965,ok=422965,error=0, records=41
[INFO ] 2026-06-01 03:44:49.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:44:52.079 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21154/300s
[WARN ] 2026-06-01 03:44:52.580 [13554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:45:00.691 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21163/300s
[INFO ] 2026-06-01 03:45:01.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 03:45:01.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422966,ok=422966,error=0, records=41
[INFO ] 2026-06-01 03:45:01.242 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21150/300s
[INFO ] 2026-06-01 03:45:04.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:45:07.584 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:45:16.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 03:45:16.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422967,ok=422967,error=0, records=41
[INFO ] 2026-06-01 03:45:19.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:45:22.589 [13576] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:45:31.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 03:45:31.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422968,ok=422968,error=0, records=41
[INFO ] 2026-06-01 03:45:34.009 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17619/300s
[INFO ] 2026-06-01 03:45:34.011 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875428},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:45:34.167 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:45:34.167 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:45:34.168 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:45:34.168 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:45:34.168 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:45:34.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:45:34.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:45:34.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:45:34.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:45:34.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:45:34.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:45:34.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:45:37.594 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:45:40.787 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21163/300s
[INFO ] 2026-06-01 03:45:46.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 03:45:46.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422969,ok=422969,error=0, records=41
[INFO ] 2026-06-01 03:45:49.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:45:52.598 [13603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:45:56.243 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21150/300s
[INFO ] 2026-06-01 03:46:01.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 03:46:01.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422970,ok=422970,error=0, records=41
[INFO ] 2026-06-01 03:46:04.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:46:07.604 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:46:16.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-01 03:46:16.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422971,ok=422971,error=0, records=41
[INFO ] 2026-06-01 03:46:19.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:46:22.609 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:46:30.841 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21159/300s
[INFO ] 2026-06-01 03:46:31.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 03:46:31.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422972,ok=422972,error=0, records=41
[INFO ] 2026-06-01 03:46:34.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:46:37.614 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:46:46.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 03:46:46.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422973,ok=422973,error=0, records=41
[INFO ] 2026-06-01 03:46:49.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:46:52.619 [13619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:47:01.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 03:47:01.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422974,ok=422974,error=0, records=41
[INFO ] 2026-06-01 03:47:04.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:47:04.840 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21162/300s
[WARN ] 2026-06-01 03:47:07.625 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:47:16.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 03:47:16.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422975,ok=422975,error=0, records=41
[INFO ] 2026-06-01 03:47:19.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:47:22.630 [13619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:47:31.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 03:47:31.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422976,ok=422976,error=0, records=41
[INFO ] 2026-06-01 03:47:34.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:47:37.635 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:47:38.148 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21160/300s
[INFO ] 2026-06-01 03:47:40.050 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21160/300s
[INFO ] 2026-06-01 03:47:46.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 03:47:46.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422977,ok=422977,error=0, records=41
[INFO ] 2026-06-01 03:47:47.757 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21160/300s
[INFO ] 2026-06-01 03:47:49.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:47:52.641 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:48:01.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 03:48:01.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422978,ok=422978,error=0, records=41
[INFO ] 2026-06-01 03:48:04.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:48:07.646 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:48:16.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 03:48:16.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422979,ok=422979,error=0, records=41
[INFO ] 2026-06-01 03:48:19.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:48:22.652 [13603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:48:31.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 03:48:31.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422980,ok=422980,error=0, records=41
[INFO ] 2026-06-01 03:48:34.169 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20875352},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:48:34.319 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:48:34.319 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 03:48:34.319 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:48:34.319 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:48:34.319 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:48:34.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:48:34.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:48:34.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:48:34.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:48:34.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:48:34.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:48:34.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:48:37.657 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:48:46.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 03:48:46.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422981,ok=422981,error=0, records=41
[INFO ] 2026-06-01 03:48:49.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:48:52.663 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:49:01.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 03:49:01.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422982,ok=422982,error=0, records=41
[INFO ] 2026-06-01 03:49:04.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:49:07.670 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:49:16.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 03:49:16.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422983,ok=422983,error=0, records=41
[INFO ] 2026-06-01 03:49:19.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:49:22.675 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:49:31.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 03:49:31.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422984,ok=422984,error=0, records=41
[INFO ] 2026-06-01 03:49:34.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:49:37.680 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:49:46.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 03:49:46.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422985,ok=422985,error=0, records=41
[INFO ] 2026-06-01 03:49:49.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:49:52.185 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21155/300s
[WARN ] 2026-06-01 03:49:52.685 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:50:00.694 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21164/300s
[INFO ] 2026-06-01 03:50:01.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 03:50:01.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422986,ok=422986,error=0, records=41
[INFO ] 2026-06-01 03:50:01.515 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21151/300s
[INFO ] 2026-06-01 03:50:04.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:50:07.692 [13603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:50:16.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 03:50:16.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422987,ok=422987,error=0, records=41
[INFO ] 2026-06-01 03:50:19.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:50:22.701 [13603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:50:31.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10429, records=41
[INFO ] 2026-06-01 03:50:31.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422988,ok=422988,error=0, records=41
[INFO ] 2026-06-01 03:50:34.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:50:37.705 [13603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:50:40.793 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21164/300s
[INFO ] 2026-06-01 03:50:46.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10439, records=41
[INFO ] 2026-06-01 03:50:46.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422989,ok=422989,error=0, records=41
[INFO ] 2026-06-01 03:50:49.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:50:52.713 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:50:56.422 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21151/300s
[INFO ] 2026-06-01 03:51:01.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10434, records=41
[INFO ] 2026-06-01 03:51:01.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422990,ok=422990,error=0, records=41
[INFO ] 2026-06-01 03:51:04.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:51:07.719 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:51:16.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10478, records=41
[INFO ] 2026-06-01 03:51:16.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422991,ok=422991,error=0, records=41
[INFO ] 2026-06-01 03:51:19.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:51:22.727 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:51:30.892 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21160/300s
[INFO ] 2026-06-01 03:51:31.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10459, records=41
[INFO ] 2026-06-01 03:51:31.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422992,ok=422992,error=0, records=41
[INFO ] 2026-06-01 03:51:34.319 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17620/300s
[INFO ] 2026-06-01 03:51:34.321 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":19151264},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:51:34.493 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:51:34.493 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 03:51:34.493 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:51:34.493 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:51:34.493 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:51:34.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:51:34.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:51:34.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:51:34.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:51:34.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:51:34.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:51:34.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:51:37.734 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:51:46.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10473, records=41
[INFO ] 2026-06-01 03:51:46.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422993,ok=422993,error=0, records=41
[INFO ] 2026-06-01 03:51:49.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:51:52.739 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:52:01.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10439, records=41
[INFO ] 2026-06-01 03:52:01.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422994,ok=422994,error=0, records=41
[INFO ] 2026-06-01 03:52:04.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:52:04.852 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21163/300s
[WARN ] 2026-06-01 03:52:07.746 [13619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:52:16.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10398, records=41
[INFO ] 2026-06-01 03:52:16.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422995,ok=422995,error=0, records=41
[INFO ] 2026-06-01 03:52:19.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:52:22.753 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:52:31.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 03:52:31.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422996,ok=422996,error=0, records=41
[INFO ] 2026-06-01 03:52:34.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:52:37.762 [13603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:52:38.241 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21161/300s
[INFO ] 2026-06-01 03:52:40.137 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21161/300s
[INFO ] 2026-06-01 03:52:46.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 03:52:46.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422997,ok=422997,error=0, records=41
[INFO ] 2026-06-01 03:52:47.846 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21161/300s
[INFO ] 2026-06-01 03:52:49.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:52:52.769 [13619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:53:01.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10423, records=41
[INFO ] 2026-06-01 03:53:01.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422998,ok=422998,error=0, records=41
[INFO ] 2026-06-01 03:53:04.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:53:07.779 [13624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:53:16.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-01 03:53:16.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=422999,ok=422999,error=0, records=41
[INFO ] 2026-06-01 03:53:19.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:53:22.786 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:53:31.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 03:53:31.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423000,ok=423000,error=0, records=41
[INFO ] 2026-06-01 03:53:34.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 03:53:34.856 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 03:53:37.792 [13564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:53:46.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 03:53:46.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423001,ok=423001,error=0, records=41
[INFO ] 2026-06-01 03:53:49.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:53:49.857 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 03:53:52.798 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:54:01.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 03:54:01.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423002,ok=423002,error=0, records=41
[INFO ] 2026-06-01 03:54:04.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:54:07.806 [14072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:54:16.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10425, records=41
[INFO ] 2026-06-01 03:54:16.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423003,ok=423003,error=0, records=41
[INFO ] 2026-06-01 03:54:19.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:54:22.814 [13619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:54:31.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 03:54:31.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423004,ok=423004,error=0, records=41
[INFO ] 2026-06-01 03:54:34.495 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":15687896},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:54:34.643 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:54:34.643 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 03:54:34.643 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:54:34.643 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:54:34.643 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:54:34.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:54:34.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:54:34.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:54:34.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:54:34.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:54:34.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:54:34.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:54:37.825 [14082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:54:46.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10392, records=41
[INFO ] 2026-06-01 03:54:46.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423005,ok=423005,error=0, records=41
[INFO ] 2026-06-01 03:54:49.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:54:52.331 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21156/300s
[WARN ] 2026-06-01 03:54:52.831 [14087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:55:00.698 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21165/300s
[INFO ] 2026-06-01 03:55:01.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-01 03:55:01.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423006,ok=423006,error=0, records=41
[INFO ] 2026-06-01 03:55:01.669 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21152/300s
[INFO ] 2026-06-01 03:55:04.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:55:07.841 [14087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:55:16.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10419, records=41
[INFO ] 2026-06-01 03:55:16.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423007,ok=423007,error=0, records=41
[INFO ] 2026-06-01 03:55:19.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:55:22.846 [14082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:55:31.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 03:55:31.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423008,ok=423008,error=0, records=41
[INFO ] 2026-06-01 03:55:34.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:55:37.854 [14087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:55:40.799 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21165/300s
[INFO ] 2026-06-01 03:55:46.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10412, records=41
[INFO ] 2026-06-01 03:55:46.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423009,ok=423009,error=0, records=41
[INFO ] 2026-06-01 03:55:49.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:55:52.861 [14087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:55:56.602 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21152/300s
[INFO ] 2026-06-01 03:56:01.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10389, records=41
[INFO ] 2026-06-01 03:56:01.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423010,ok=423010,error=0, records=41
[INFO ] 2026-06-01 03:56:04.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:56:07.867 [14138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:56:16.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 03:56:16.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423011,ok=423011,error=0, records=41
[INFO ] 2026-06-01 03:56:19.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:56:22.876 [14082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:56:30.952 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21161/300s
[INFO ] 2026-06-01 03:56:31.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-01 03:56:31.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423012,ok=423012,error=0, records=41
[INFO ] 2026-06-01 03:56:34.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:56:37.886 [14217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:56:46.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 03:56:46.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423013,ok=423013,error=0, records=41
[INFO ] 2026-06-01 03:56:49.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:56:52.902 [14237] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:57:01.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10424, records=41
[INFO ] 2026-06-01 03:57:01.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423014,ok=423014,error=0, records=41
[INFO ] 2026-06-01 03:57:04.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:57:04.865 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21164/300s
[WARN ] 2026-06-01 03:57:07.909 [14253] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:57:16.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10478, records=41
[INFO ] 2026-06-01 03:57:16.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423015,ok=423015,error=0, records=41
[INFO ] 2026-06-01 03:57:19.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:57:22.913 [14247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:57:31.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10455, records=41
[INFO ] 2026-06-01 03:57:31.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423016,ok=423016,error=0, records=41
[INFO ] 2026-06-01 03:57:34.643 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17621/300s
[INFO ] 2026-06-01 03:57:34.645 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":12620124},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 03:57:34.809 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 03:57:34.809 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 03:57:34.809 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 03:57:34.809 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 03:57:34.809 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 03:57:34.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:57:34.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 03:57:34.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:57:34.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 03:57:34.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:57:34.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 03:57:34.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:57:37.919 [14287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:57:38.262 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21162/300s
[INFO ] 2026-06-01 03:57:40.170 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21162/300s
[INFO ] 2026-06-01 03:57:46.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10490, records=41
[INFO ] 2026-06-01 03:57:46.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423017,ok=423017,error=0, records=41
[INFO ] 2026-06-01 03:57:47.853 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21162/300s
[INFO ] 2026-06-01 03:57:49.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:57:52.925 [14304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:58:01.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10472, records=41
[INFO ] 2026-06-01 03:58:01.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423018,ok=423018,error=0, records=41
[INFO ] 2026-06-01 03:58:04.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:58:07.940 [14319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:58:16.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10435, records=41
[INFO ] 2026-06-01 03:58:16.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423019,ok=423019,error=0, records=41
[INFO ] 2026-06-01 03:58:19.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:58:22.945 [14309] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:58:31.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-01 03:58:31.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423020,ok=423020,error=0, records=41
[INFO ] 2026-06-01 03:58:34.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:58:37.952 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:58:46.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 03:58:46.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423021,ok=423021,error=0, records=41
[INFO ] 2026-06-01 03:58:49.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:58:52.967 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:59:01.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10412, records=41
[INFO ] 2026-06-01 03:59:01.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423022,ok=423022,error=0, records=41
[INFO ] 2026-06-01 03:59:04.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:59:07.971 [14330] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:59:16.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10413, records=41
[INFO ] 2026-06-01 03:59:16.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423023,ok=423023,error=0, records=41
[INFO ] 2026-06-01 03:59:19.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:59:22.976 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:59:31.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-06-01 03:59:31.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423024,ok=423024,error=0, records=41
[INFO ] 2026-06-01 03:59:34.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 03:59:37.980 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 03:59:46.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-01 03:59:46.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423025,ok=423025,error=0, records=41
[INFO ] 2026-06-01 03:59:49.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 03:59:52.485 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21157/300s
[WARN ] 2026-06-01 03:59:52.985 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:00:00.701 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21166/300s
[INFO ] 2026-06-01 04:00:02.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10417, records=41
[INFO ] 2026-06-01 04:00:02.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423026,ok=423026,error=0, records=41
[INFO ] 2026-06-01 04:00:02.012 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21153/300s
[INFO ] 2026-06-01 04:00:04.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:00:07.991 [14375] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:00:17.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10414, records=41
[INFO ] 2026-06-01 04:00:17.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423027,ok=423027,error=0, records=41
[INFO ] 2026-06-01 04:00:19.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:00:22.996 [14347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:00:32.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10399, records=41
[INFO ] 2026-06-01 04:00:32.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423028,ok=423028,error=0, records=41
[INFO ] 2026-06-01 04:00:34.811 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":9012148},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:00:34.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 04:00:34.964 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=252
[INFO ] 2026-06-01 04:00:34.964 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 04:00:34.964 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:00:34.964 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:00:34.964 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:00:35.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:00:35.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:00:35.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:00:35.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:00:35.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:00:35.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:00:38.002 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:00:40.805 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21166/300s
[INFO ] 2026-06-01 04:00:47.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10408, records=41
[INFO ] 2026-06-01 04:00:47.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423029,ok=423029,error=0, records=41
[INFO ] 2026-06-01 04:00:49.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:00:53.009 [14347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:00:56.778 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21153/300s
[INFO ] 2026-06-01 04:01:02.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-01 04:01:02.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423030,ok=423030,error=0, records=41
[INFO ] 2026-06-01 04:01:04.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:01:08.015 [14456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:01:17.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 04:01:17.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423031,ok=423031,error=0, records=41
[INFO ] 2026-06-01 04:01:19.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:01:23.021 [14485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:01:31.040 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21162/300s
[INFO ] 2026-06-01 04:01:32.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 04:01:32.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423032,ok=423032,error=0, records=41
[INFO ] 2026-06-01 04:01:34.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:01:38.027 [14526] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:01:47.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 04:01:47.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423033,ok=423033,error=0, records=41
[INFO ] 2026-06-01 04:01:49.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:01:53.033 [14292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:02:02.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 04:02:02.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423034,ok=423034,error=0, records=41
[INFO ] 2026-06-01 04:02:04.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:02:04.880 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21165/300s
[WARN ] 2026-06-01 04:02:08.040 [14569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:02:17.069 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10420, records=41
[INFO ] 2026-06-01 04:02:17.069 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423035,ok=423035,error=0, records=41
[INFO ] 2026-06-01 04:02:19.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:02:23.045 [14569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:02:32.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10405, records=41
[INFO ] 2026-06-01 04:02:32.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423036,ok=423036,error=0, records=41
[INFO ] 2026-06-01 04:02:34.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:02:38.051 [14591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:02:38.308 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21163/300s
[INFO ] 2026-06-01 04:02:40.192 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21163/300s
[INFO ] 2026-06-01 04:02:47.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 04:02:47.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423037,ok=423037,error=0, records=41
[INFO ] 2026-06-01 04:02:47.936 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21163/300s
[INFO ] 2026-06-01 04:02:49.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:02:52.556 [14606] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:03:02.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10413, records=41
[INFO ] 2026-06-01 04:03:02.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423038,ok=423038,error=0, records=41
[INFO ] 2026-06-01 04:03:04.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:03:07.565 [14629] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:03:17.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 04:03:17.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423039,ok=423039,error=0, records=41
[INFO ] 2026-06-01 04:03:19.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:03:22.572 [14665] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:03:32.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 04:03:32.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423040,ok=423040,error=0, records=41
[INFO ] 2026-06-01 04:03:34.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 04:03:34.884 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 04:03:34.965 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17622/300s
[INFO ] 2026-06-01 04:03:34.967 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":5523360},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:03:35.116 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=252
[INFO ] 2026-06-01 04:03:35.116 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:03:35.117 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:03:35.117 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:03:35.117 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:03:35.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:03:35.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:03:35.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:03:35.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:03:35.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:03:35.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:03:37.578 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:03:47.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 04:03:47.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423041,ok=423041,error=0, records=41
[INFO ] 2026-06-01 04:03:49.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:03:52.603 [14690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:04:02.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 04:04:02.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423042,ok=423042,error=0, records=41
[INFO ] 2026-06-01 04:04:04.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:04:07.613 [14696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:04:17.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10424, records=41
[INFO ] 2026-06-01 04:04:17.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423043,ok=423043,error=0, records=41
[INFO ] 2026-06-01 04:04:19.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:04:22.620 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:04:32.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-01 04:04:32.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423044,ok=423044,error=0, records=41
[INFO ] 2026-06-01 04:04:34.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:04:37.628 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:04:47.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10431, records=41
[INFO ] 2026-06-01 04:04:47.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423045,ok=423045,error=0, records=41
[INFO ] 2026-06-01 04:04:49.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:04:52.634 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21158/300s
[WARN ] 2026-06-01 04:04:52.634 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:05:00.704 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21167/300s
[INFO ] 2026-06-01 04:05:02.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10392, records=41
[INFO ] 2026-06-01 04:05:02.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423046,ok=423046,error=0, records=41
[INFO ] 2026-06-01 04:05:02.274 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21154/300s
[INFO ] 2026-06-01 04:05:04.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:05:07.643 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:05:17.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10458, records=41
[INFO ] 2026-06-01 04:05:17.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423047,ok=423047,error=0, records=41
[INFO ] 2026-06-01 04:05:19.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:05:22.649 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:05:32.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10400, records=41
[INFO ] 2026-06-01 04:05:32.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423048,ok=423048,error=0, records=41
[INFO ] 2026-06-01 04:05:34.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:05:37.655 [14696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:05:40.812 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21167/300s
[INFO ] 2026-06-01 04:05:47.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-01 04:05:47.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423049,ok=423049,error=0, records=41
[INFO ] 2026-06-01 04:05:49.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:05:52.661 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:05:56.958 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21154/300s
[INFO ] 2026-06-01 04:06:02.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 04:06:02.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423050,ok=423050,error=0, records=41
[INFO ] 2026-06-01 04:06:04.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:06:07.667 [14690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:06:17.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 04:06:17.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423051,ok=423051,error=0, records=41
[INFO ] 2026-06-01 04:06:19.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:06:22.676 [14690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:06:31.102 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21163/300s
[INFO ] 2026-06-01 04:06:32.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 04:06:32.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423052,ok=423052,error=0, records=41
[INFO ] 2026-06-01 04:06:34.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:06:35.125 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":4471096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:06:35.300 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=252
[INFO ] 2026-06-01 04:06:35.300 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 04:06:35.300 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:06:35.300 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:06:35.300 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:06:35.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:06:35.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:06:35.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:06:35.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:06:35.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:06:35.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:06:37.683 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:06:47.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 04:06:47.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423053,ok=423053,error=0, records=41
[INFO ] 2026-06-01 04:06:49.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:06:52.688 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:07:02.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 04:07:02.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423054,ok=423054,error=0, records=41
[INFO ] 2026-06-01 04:07:04.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:07:04.893 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21166/300s
[WARN ] 2026-06-01 04:07:07.696 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:07:17.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 04:07:17.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423055,ok=423055,error=0, records=41
[INFO ] 2026-06-01 04:07:19.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:07:22.702 [14696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:07:32.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 04:07:32.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423056,ok=423056,error=0, records=41
[INFO ] 2026-06-01 04:07:34.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:07:37.711 [14696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:07:38.339 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21164/300s
[INFO ] 2026-06-01 04:07:40.236 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21164/300s
[INFO ] 2026-06-01 04:07:47.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 04:07:47.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423057,ok=423057,error=0, records=41
[INFO ] 2026-06-01 04:07:47.946 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21164/300s
[INFO ] 2026-06-01 04:07:49.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:07:52.719 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:08:02.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 04:08:02.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423058,ok=423058,error=0, records=41
[INFO ] 2026-06-01 04:08:04.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:08:07.725 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:08:17.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-01 04:08:17.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423059,ok=423059,error=0, records=41
[INFO ] 2026-06-01 04:08:19.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:08:22.732 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:08:32.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-01 04:08:32.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423060,ok=423060,error=0, records=41
[INFO ] 2026-06-01 04:08:34.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:08:37.738 [14696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:08:47.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 04:08:47.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423061,ok=423061,error=0, records=41
[INFO ] 2026-06-01 04:08:49.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:08:49.897 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 04:08:52.744 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:09:02.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 04:09:02.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423062,ok=423062,error=0, records=41
[INFO ] 2026-06-01 04:09:04.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:09:07.749 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:09:17.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 04:09:17.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423063,ok=423063,error=0, records=41
[INFO ] 2026-06-01 04:09:19.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:09:22.754 [14696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:09:32.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 04:09:32.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423064,ok=423064,error=0, records=41
[INFO ] 2026-06-01 04:09:34.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:09:35.300 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17623/300s
[INFO ] 2026-06-01 04:09:35.302 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867924},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:09:35.458 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:09:35.458 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 04:09:35.458 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:09:35.458 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:09:35.458 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:09:35.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:09:35.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:09:35.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:09:35.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:09:35.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:09:35.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:09:37.763 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:09:47.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 04:09:47.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423065,ok=423065,error=0, records=41
[INFO ] 2026-06-01 04:09:49.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:09:52.767 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21159/300s
[WARN ] 2026-06-01 04:09:52.767 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:10:00.708 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21168/300s
[INFO ] 2026-06-01 04:10:02.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-01 04:10:02.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423066,ok=423066,error=0, records=41
[INFO ] 2026-06-01 04:10:02.392 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21155/300s
[INFO ] 2026-06-01 04:10:04.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:10:07.773 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:10:17.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10460, records=41
[INFO ] 2026-06-01 04:10:17.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423067,ok=423067,error=0, records=41
[INFO ] 2026-06-01 04:10:19.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:10:22.778 [14677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:10:32.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10475, records=41
[INFO ] 2026-06-01 04:10:32.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423068,ok=423068,error=0, records=41
[INFO ] 2026-06-01 04:10:34.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:10:37.784 [14690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:10:40.818 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21168/300s
[INFO ] 2026-06-01 04:10:47.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10461, records=41
[INFO ] 2026-06-01 04:10:47.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423069,ok=423069,error=0, records=41
[INFO ] 2026-06-01 04:10:49.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:10:52.790 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:10:57.136 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21155/300s
[INFO ] 2026-06-01 04:11:02.413 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10462, records=41
[INFO ] 2026-06-01 04:11:02.413 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423070,ok=423070,error=0, records=41
[INFO ] 2026-06-01 04:11:04.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:11:07.795 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:11:17.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10511, records=41
[INFO ] 2026-06-01 04:11:17.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423071,ok=423071,error=0, records=41
[INFO ] 2026-06-01 04:11:19.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:11:22.802 [14690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:11:31.156 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21164/300s
[INFO ] 2026-06-01 04:11:32.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10463, records=41
[INFO ] 2026-06-01 04:11:32.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423072,ok=423072,error=0, records=41
[WARN ] 2026-06-01 04:11:32.810 [14680] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13849/stat), No such file or directory
[WARN ] 2026-06-01 04:11:32.810 [14680] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13848/stat), No such file or directory
[INFO ] 2026-06-01 04:11:34.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:11:37.812 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:11:47.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10414, records=41
[INFO ] 2026-06-01 04:11:47.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423073,ok=423073,error=0, records=41
[WARN ] 2026-06-01 04:11:47.818 [14684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13849/stat), No such file or directory
[WARN ] 2026-06-01 04:11:47.818 [14684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13848/stat), No such file or directory
[INFO ] 2026-06-01 04:11:49.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:11:52.821 [15113] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:12:02.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13509, records=49
[INFO ] 2026-06-01 04:12:02.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423074,ok=423074,error=0, records=49
[INFO ] 2026-06-01 04:12:04.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:12:04.906 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21167/300s
[WARN ] 2026-06-01 04:12:07.830 [14684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:12:17.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10407, records=41
[INFO ] 2026-06-01 04:12:17.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423075,ok=423075,error=0, records=41
[INFO ] 2026-06-01 04:12:19.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:12:22.838 [15113] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:12:32.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10409, records=41
[INFO ] 2026-06-01 04:12:32.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423076,ok=423076,error=0, records=41
[INFO ] 2026-06-01 04:12:34.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:12:35.459 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20045280},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:12:35.636 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:12:35.636 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 04:12:35.636 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:12:35.636 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:12:35.636 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:12:35.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:12:35.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:12:35.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:12:35.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:12:35.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:12:35.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:12:37.843 [15167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:12:38.433 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21165/300s
[INFO ] 2026-06-01 04:12:40.334 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21165/300s
[INFO ] 2026-06-01 04:12:47.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10410, records=41
[INFO ] 2026-06-01 04:12:47.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423077,ok=423077,error=0, records=41
[INFO ] 2026-06-01 04:12:48.016 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21165/300s
[INFO ] 2026-06-01 04:12:49.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:12:52.867 [15167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:13:02.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10500, records=41
[INFO ] 2026-06-01 04:13:02.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423078,ok=423078,error=0, records=41
[INFO ] 2026-06-01 04:13:04.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:13:07.881 [14680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 04:13:17.390 [15113] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15111/stat), No such file or directory
[WARN ] 2026-06-01 04:13:17.390 [15113] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15110/stat), No such file or directory
[INFO ] 2026-06-01 04:13:17.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10447, records=41
[INFO ] 2026-06-01 04:13:17.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423079,ok=423079,error=0, records=41
[INFO ] 2026-06-01 04:13:19.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:13:22.891 [15299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 04:13:32.396 [15237] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15111/stat), No such file or directory
[WARN ] 2026-06-01 04:13:32.397 [15237] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15110/stat), No such file or directory
[INFO ] 2026-06-01 04:13:32.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10473, records=41
[INFO ] 2026-06-01 04:13:32.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423080,ok=423080,error=0, records=41
[INFO ] 2026-06-01 04:13:34.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 04:13:34.913 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 04:13:37.898 [15314] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 04:13:47.403 [15315] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15111/stat), No such file or directory
[WARN ] 2026-06-01 04:13:47.403 [15315] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15110/stat), No such file or directory
[WARN ] 2026-06-01 04:13:47.405 [15315] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9444/stat), No such file or directory
[INFO ] 2026-06-01 04:13:47.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 04:13:47.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423081,ok=423081,error=0, records=41
[INFO ] 2026-06-01 04:13:49.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:13:52.904 [15325] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:14:02.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 04:14:02.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423082,ok=423082,error=0, records=41
[INFO ] 2026-06-01 04:14:04.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:14:07.910 [15315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:14:17.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 04:14:17.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423083,ok=423083,error=0, records=41
[INFO ] 2026-06-01 04:14:19.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:14:22.915 [15315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:14:32.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 04:14:32.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423084,ok=423084,error=0, records=41
[INFO ] 2026-06-01 04:14:34.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:14:37.921 [15381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:14:47.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10131, records=41
[INFO ] 2026-06-01 04:14:47.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423085,ok=423085,error=0, records=41
[INFO ] 2026-06-01 04:14:49.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:14:52.927 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21160/300s
[WARN ] 2026-06-01 04:14:52.928 [15191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:15:00.711 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21169/300s
[INFO ] 2026-06-01 04:15:02.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 04:15:02.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423086,ok=423086,error=0, records=41
[INFO ] 2026-06-01 04:15:02.583 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21156/300s
[INFO ] 2026-06-01 04:15:04.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:15:07.933 [15191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:15:17.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 04:15:17.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423087,ok=423087,error=0, records=41
[INFO ] 2026-06-01 04:15:19.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:15:22.939 [15421] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:15:32.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 04:15:32.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423088,ok=423088,error=0, records=41
[INFO ] 2026-06-01 04:15:34.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:15:35.636 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17624/300s
[INFO ] 2026-06-01 04:15:35.638 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874528},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:15:35.807 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:15:35.807 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:15:35.807 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:15:35.807 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:15:35.807 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:15:35.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:15:35.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:15:35.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:15:35.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:15:35.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:15:35.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:15:37.944 [15437] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:15:40.833 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21169/300s
[INFO ] 2026-06-01 04:15:47.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 04:15:47.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423089,ok=423089,error=0, records=41
[INFO ] 2026-06-01 04:15:49.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:15:52.949 [15471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:15:57.318 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21156/300s
[INFO ] 2026-06-01 04:16:02.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 04:16:02.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423090,ok=423090,error=0, records=41
[INFO ] 2026-06-01 04:16:04.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:16:07.954 [15427] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:16:17.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 04:16:17.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423091,ok=423091,error=0, records=41
[INFO ] 2026-06-01 04:16:19.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:16:22.958 [15437] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:16:31.229 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21165/300s
[INFO ] 2026-06-01 04:16:32.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 04:16:32.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423092,ok=423092,error=0, records=41
[INFO ] 2026-06-01 04:16:34.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:16:37.963 [15472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:16:47.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 04:16:47.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423093,ok=423093,error=0, records=41
[INFO ] 2026-06-01 04:16:49.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:16:52.969 [15437] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:17:02.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 04:17:02.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423094,ok=423094,error=0, records=41
[INFO ] 2026-06-01 04:17:04.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:17:04.922 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21168/300s
[WARN ] 2026-06-01 04:17:07.974 [15437] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:17:17.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 04:17:17.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423095,ok=423095,error=0, records=41
[INFO ] 2026-06-01 04:17:19.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:17:22.978 [15544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:17:32.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 04:17:32.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423096,ok=423096,error=0, records=41
[INFO ] 2026-06-01 04:17:34.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:17:37.983 [15572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:17:38.478 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21166/300s
[INFO ] 2026-06-01 04:17:40.409 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21166/300s
[INFO ] 2026-06-01 04:17:47.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 04:17:47.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423097,ok=423097,error=0, records=41
[INFO ] 2026-06-01 04:17:48.044 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21166/300s
[INFO ] 2026-06-01 04:17:49.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:17:52.988 [15500] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:18:02.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 04:18:02.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423098,ok=423098,error=0, records=41
[INFO ] 2026-06-01 04:18:04.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:18:07.992 [15572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:18:17.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 04:18:17.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423099,ok=423099,error=0, records=41
[INFO ] 2026-06-01 04:18:19.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:18:23.006 [15500] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:18:32.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 04:18:32.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423100,ok=423100,error=0, records=41
[INFO ] 2026-06-01 04:18:34.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:18:35.809 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874444},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:18:35.968 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:18:35.968 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 04:18:35.969 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:18:35.969 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:18:35.969 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:18:36.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:18:36.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:18:36.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:18:36.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:18:36.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:18:36.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:18:38.010 [15633] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:18:47.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 04:18:47.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423101,ok=423101,error=0, records=41
[INFO ] 2026-06-01 04:18:49.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:18:53.014 [15606] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:19:02.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 04:19:02.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423102,ok=423102,error=0, records=41
[INFO ] 2026-06-01 04:19:04.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:19:08.019 [15648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:19:17.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 04:19:17.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423103,ok=423103,error=0, records=41
[INFO ] 2026-06-01 04:19:19.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:19:23.025 [15572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:19:32.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 04:19:32.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423104,ok=423104,error=0, records=41
[INFO ] 2026-06-01 04:19:34.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:19:38.029 [15572] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:19:47.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 04:19:47.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423105,ok=423105,error=0, records=41
[INFO ] 2026-06-01 04:19:49.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:19:53.033 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21161/300s
[WARN ] 2026-06-01 04:19:53.034 [15606] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:20:00.714 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21170/300s
[INFO ] 2026-06-01 04:20:02.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11362, records=44
[INFO ] 2026-06-01 04:20:02.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423106,ok=423106,error=0, records=44
[INFO ] 2026-06-01 04:20:02.789 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21157/300s
[INFO ] 2026-06-01 04:20:04.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:20:08.037 [15689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:20:17.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 04:20:17.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423107,ok=423107,error=0, records=41
[INFO ] 2026-06-01 04:20:19.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:20:23.042 [15689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:20:32.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 04:20:32.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423108,ok=423108,error=0, records=41
[INFO ] 2026-06-01 04:20:34.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:20:38.045 [15749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:20:40.839 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21170/300s
[INFO ] 2026-06-01 04:20:47.805 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 04:20:47.805 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423109,ok=423109,error=0, records=41
[INFO ] 2026-06-01 04:20:49.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:20:53.050 [15772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:20:57.497 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21157/300s
[INFO ] 2026-06-01 04:21:02.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 04:21:02.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423110,ok=423110,error=0, records=41
[INFO ] 2026-06-01 04:21:04.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:21:07.556 [15792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:21:17.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 04:21:17.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423111,ok=423111,error=0, records=41
[INFO ] 2026-06-01 04:21:19.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:21:22.560 [15796] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:21:31.277 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21166/300s
[INFO ] 2026-06-01 04:21:32.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 04:21:32.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423112,ok=423112,error=0, records=41
[INFO ] 2026-06-01 04:21:34.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:21:35.969 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17625/300s
[INFO ] 2026-06-01 04:21:35.970 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:21:36.131 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:21:36.131 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 04:21:36.131 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:21:36.131 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:21:36.131 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:21:36.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:21:36.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:21:36.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:21:36.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:21:36.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:21:36.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:21:37.565 [15819] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:21:47.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 04:21:47.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423113,ok=423113,error=0, records=41
[INFO ] 2026-06-01 04:21:49.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:21:52.569 [15832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:22:02.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 04:22:02.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423114,ok=423114,error=0, records=41
[INFO ] 2026-06-01 04:22:04.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:22:04.933 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21169/300s
[WARN ] 2026-06-01 04:22:07.573 [15850] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:22:17.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 04:22:17.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423115,ok=423115,error=0, records=41
[INFO ] 2026-06-01 04:22:19.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:22:22.578 [15832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:22:32.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 04:22:32.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423116,ok=423116,error=0, records=41
[INFO ] 2026-06-01 04:22:34.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:22:37.583 [15832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:22:38.510 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21167/300s
[INFO ] 2026-06-01 04:22:40.442 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21167/300s
[INFO ] 2026-06-01 04:22:47.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 04:22:47.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423117,ok=423117,error=0, records=41
[INFO ] 2026-06-01 04:22:48.056 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21167/300s
[INFO ] 2026-06-01 04:22:49.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:22:52.588 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:23:02.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 04:23:02.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423118,ok=423118,error=0, records=41
[INFO ] 2026-06-01 04:23:04.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:23:07.606 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:23:17.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 04:23:17.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423119,ok=423119,error=0, records=41
[INFO ] 2026-06-01 04:23:19.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:23:22.612 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:23:32.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 04:23:32.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423120,ok=423120,error=0, records=41
[INFO ] 2026-06-01 04:23:34.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 04:23:34.937 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 04:23:37.617 [15923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:23:47.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 04:23:47.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423121,ok=423121,error=0, records=41
[INFO ] 2026-06-01 04:23:49.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:23:49.937 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 04:23:52.622 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:24:02.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 04:24:02.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423122,ok=423122,error=0, records=41
[INFO ] 2026-06-01 04:24:04.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:24:07.627 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:24:17.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 04:24:17.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423123,ok=423123,error=0, records=41
[INFO ] 2026-06-01 04:24:19.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:24:22.632 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:24:32.916 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 04:24:32.916 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423124,ok=423124,error=0, records=41
[INFO ] 2026-06-01 04:24:34.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:24:36.133 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874252},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:24:36.295 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:24:36.295 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[WARN ] 2026-06-01 04:24:37.637 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:24:47.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 04:24:47.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423125,ok=423125,error=0, records=41
[INFO ] 2026-06-01 04:24:49.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:24:52.642 [15913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:24:53.142 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21162/300s
[INFO ] 2026-06-01 04:25:00.716 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21171/300s
[INFO ] 2026-06-01 04:25:02.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 04:25:02.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423126,ok=423126,error=0, records=41
[INFO ] 2026-06-01 04:25:02.928 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21158/300s
[INFO ] 2026-06-01 04:25:04.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:25:07.647 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:25:17.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 04:25:17.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423127,ok=423127,error=0, records=41
[INFO ] 2026-06-01 04:25:19.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:25:22.652 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:25:32.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 04:25:32.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423128,ok=423128,error=0, records=41
[INFO ] 2026-06-01 04:25:34.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:25:37.658 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:25:40.845 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21171/300s
[INFO ] 2026-06-01 04:25:47.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 04:25:47.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423129,ok=423129,error=0, records=41
[INFO ] 2026-06-01 04:25:49.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:25:52.663 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:25:57.670 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21158/300s
[INFO ] 2026-06-01 04:26:02.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 04:26:02.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423130,ok=423130,error=0, records=41
[INFO ] 2026-06-01 04:26:04.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:26:07.669 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:26:17.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10136, records=41
[INFO ] 2026-06-01 04:26:17.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423131,ok=423131,error=0, records=41
[INFO ] 2026-06-01 04:26:19.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:26:22.674 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:26:31.323 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21167/300s
[INFO ] 2026-06-01 04:26:32.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 04:26:32.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423132,ok=423132,error=0, records=41
[INFO ] 2026-06-01 04:26:34.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:26:37.680 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:26:47.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-01 04:26:47.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423133,ok=423133,error=0, records=41
[INFO ] 2026-06-01 04:26:49.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:26:52.685 [15923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:27:02.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 04:27:02.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423134,ok=423134,error=0, records=41
[INFO ] 2026-06-01 04:27:04.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:27:04.945 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21170/300s
[WARN ] 2026-06-01 04:27:07.690 [15913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:27:18.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 04:27:18.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423135,ok=423135,error=0, records=41
[INFO ] 2026-06-01 04:27:19.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:27:22.696 [15913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:27:33.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 04:27:33.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423136,ok=423136,error=0, records=41
[INFO ] 2026-06-01 04:27:34.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:27:36.295 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17626/300s
[INFO ] 2026-06-01 04:27:36.297 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:27:36.474 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:27:36.474 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 04:27:36.474 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:27:36.474 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:27:36.474 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:27:36.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:27:36.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:27:36.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:27:36.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:27:36.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:27:36.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:27:37.701 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:27:38.529 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21168/300s
[INFO ] 2026-06-01 04:27:40.457 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21168/300s
[INFO ] 2026-06-01 04:27:48.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 04:27:48.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423137,ok=423137,error=0, records=41
[INFO ] 2026-06-01 04:27:48.154 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21168/300s
[INFO ] 2026-06-01 04:27:49.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:27:52.707 [15912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:28:03.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 04:28:03.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423138,ok=423138,error=0, records=41
[INFO ] 2026-06-01 04:28:04.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:28:07.712 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:28:18.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 04:28:18.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423139,ok=423139,error=0, records=41
[INFO ] 2026-06-01 04:28:19.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:28:22.717 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:28:33.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10143, records=41
[INFO ] 2026-06-01 04:28:33.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423140,ok=423140,error=0, records=41
[INFO ] 2026-06-01 04:28:34.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:28:37.721 [15913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:28:48.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-01 04:28:48.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423141,ok=423141,error=0, records=41
[INFO ] 2026-06-01 04:28:49.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:28:52.727 [15923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:29:03.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 04:29:03.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423142,ok=423142,error=0, records=41
[INFO ] 2026-06-01 04:29:04.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:29:07.732 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:29:18.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 04:29:18.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423143,ok=423143,error=0, records=41
[INFO ] 2026-06-01 04:29:19.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:29:22.738 [15913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:29:33.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 04:29:33.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423144,ok=423144,error=0, records=41
[INFO ] 2026-06-01 04:29:34.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:29:37.743 [15923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:29:48.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 04:29:48.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423145,ok=423145,error=0, records=41
[INFO ] 2026-06-01 04:29:49.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:29:52.748 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:29:53.248 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21163/300s
[INFO ] 2026-06-01 04:30:00.719 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21172/300s
[INFO ] 2026-06-01 04:30:03.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 04:30:03.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423146,ok=423146,error=0, records=41
[INFO ] 2026-06-01 04:30:03.064 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21159/300s
[INFO ] 2026-06-01 04:30:04.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:30:07.754 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:30:18.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 04:30:18.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423147,ok=423147,error=0, records=41
[INFO ] 2026-06-01 04:30:19.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:30:22.759 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:30:33.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 04:30:33.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423148,ok=423148,error=0, records=41
[INFO ] 2026-06-01 04:30:34.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:30:36.475 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:30:36.639 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:30:36.639 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:30:36.639 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:30:36.639 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:30:36.639 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:30:36.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:30:36.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:30:36.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:30:36.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:30:36.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:30:36.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:30:37.763 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:30:40.850 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21172/300s
[INFO ] 2026-06-01 04:30:48.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 04:30:48.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423149,ok=423149,error=0, records=41
[INFO ] 2026-06-01 04:30:49.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:30:52.768 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:30:57.843 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21159/300s
[INFO ] 2026-06-01 04:31:03.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 04:31:03.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423150,ok=423150,error=0, records=41
[INFO ] 2026-06-01 04:31:04.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:31:07.772 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:31:18.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 04:31:18.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423151,ok=423151,error=0, records=41
[INFO ] 2026-06-01 04:31:19.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:31:22.778 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:31:31.367 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21168/300s
[INFO ] 2026-06-01 04:31:33.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 04:31:33.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423152,ok=423152,error=0, records=41
[INFO ] 2026-06-01 04:31:34.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:31:37.783 [15923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:31:48.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 04:31:48.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423153,ok=423153,error=0, records=41
[INFO ] 2026-06-01 04:31:49.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:31:52.788 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:32:03.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 04:32:03.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423154,ok=423154,error=0, records=41
[INFO ] 2026-06-01 04:32:04.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:32:04.956 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21171/300s
[WARN ] 2026-06-01 04:32:07.793 [15907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:32:18.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 04:32:18.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423155,ok=423155,error=0, records=41
[INFO ] 2026-06-01 04:32:19.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:32:22.799 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:32:33.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 04:32:33.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423156,ok=423156,error=0, records=41
[INFO ] 2026-06-01 04:32:34.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:32:37.805 [16443] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:32:38.562 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21169/300s
[INFO ] 2026-06-01 04:32:40.483 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21169/300s
[INFO ] 2026-06-01 04:32:48.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 04:32:48.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423157,ok=423157,error=0, records=41
[INFO ] 2026-06-01 04:32:48.169 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21169/300s
[INFO ] 2026-06-01 04:32:49.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:32:52.809 [15923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:33:03.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10400, records=41
[INFO ] 2026-06-01 04:33:03.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423158,ok=423158,error=0, records=41
[INFO ] 2026-06-01 04:33:04.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:33:07.814 [16474] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:33:18.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 04:33:18.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423159,ok=423159,error=0, records=41
[INFO ] 2026-06-01 04:33:19.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:33:22.819 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:33:33.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-06-01 04:33:33.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423160,ok=423160,error=0, records=41
[INFO ] 2026-06-01 04:33:34.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 04:33:34.960 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 04:33:36.639 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17627/300s
[INFO ] 2026-06-01 04:33:36.640 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20874008},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:33:36.790 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:33:36.790 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 04:33:36.791 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:33:36.791 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:33:36.791 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:33:36.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:33:36.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:33:36.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:33:36.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:33:36.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:33:36.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:33:37.825 [16467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:33:48.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 04:33:48.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423161,ok=423161,error=0, records=41
[INFO ] 2026-06-01 04:33:49.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:33:52.831 [16489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:34:03.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 04:34:03.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423162,ok=423162,error=0, records=41
[INFO ] 2026-06-01 04:34:04.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:34:07.836 [16489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:34:18.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 04:34:18.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423163,ok=423163,error=0, records=41
[INFO ] 2026-06-01 04:34:19.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:34:22.841 [16518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:34:33.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 04:34:33.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423164,ok=423164,error=0, records=41
[INFO ] 2026-06-01 04:34:34.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:34:37.846 [16518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:34:48.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 04:34:48.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423165,ok=423165,error=0, records=41
[INFO ] 2026-06-01 04:34:49.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:34:52.852 [16474] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:34:53.352 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21164/300s
[INFO ] 2026-06-01 04:35:00.722 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21173/300s
[INFO ] 2026-06-01 04:35:03.185 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 04:35:03.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423166,ok=423166,error=0, records=41
[INFO ] 2026-06-01 04:35:03.185 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21160/300s
[INFO ] 2026-06-01 04:35:04.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:35:07.857 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:35:18.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 04:35:18.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423167,ok=423167,error=0, records=41
[INFO ] 2026-06-01 04:35:19.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:35:22.864 [16597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:35:33.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 04:35:33.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423168,ok=423168,error=0, records=41
[INFO ] 2026-06-01 04:35:34.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:35:37.870 [15901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:35:40.856 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21173/300s
[INFO ] 2026-06-01 04:35:48.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 04:35:48.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423169,ok=423169,error=0, records=41
[INFO ] 2026-06-01 04:35:49.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:35:52.874 [16597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:35:58.023 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21160/300s
[INFO ] 2026-06-01 04:36:03.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 04:36:03.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423170,ok=423170,error=0, records=41
[INFO ] 2026-06-01 04:36:04.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:36:07.880 [16641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:36:18.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 04:36:18.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423171,ok=423171,error=0, records=41
[INFO ] 2026-06-01 04:36:19.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:36:22.886 [16631] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:36:31.414 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21169/300s
[INFO ] 2026-06-01 04:36:33.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 04:36:33.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423172,ok=423172,error=0, records=41
[INFO ] 2026-06-01 04:36:34.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:36:36.792 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873916},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:36:36.935 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:36:36.935 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 04:36:36.936 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:36:36.936 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:36:36.936 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:36:36.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:36:36.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:36:36.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:36:36.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:36:36.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:36:36.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:36:37.891 [16672] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:36:48.236 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 04:36:48.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423173,ok=423173,error=0, records=41
[INFO ] 2026-06-01 04:36:49.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:36:52.897 [16689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:37:03.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 04:37:03.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423174,ok=423174,error=0, records=41
[INFO ] 2026-06-01 04:37:04.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:37:04.967 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21172/300s
[WARN ] 2026-06-01 04:37:07.904 [16690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:37:18.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 04:37:18.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423175,ok=423175,error=0, records=41
[INFO ] 2026-06-01 04:37:19.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:37:22.909 [16717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:37:33.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 04:37:33.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423176,ok=423176,error=0, records=41
[INFO ] 2026-06-01 04:37:34.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:37:37.914 [16729] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:37:38.573 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21170/300s
[INFO ] 2026-06-01 04:37:40.506 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21170/300s
[INFO ] 2026-06-01 04:37:48.176 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21170/300s
[INFO ] 2026-06-01 04:37:48.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 04:37:48.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423177,ok=423177,error=0, records=41
[INFO ] 2026-06-01 04:37:49.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:37:52.920 [16757] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:38:03.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 04:38:03.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423178,ok=423178,error=0, records=41
[INFO ] 2026-06-01 04:38:04.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:38:07.925 [16763] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:38:18.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 04:38:18.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423179,ok=423179,error=0, records=41
[INFO ] 2026-06-01 04:38:19.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:38:22.931 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:38:33.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 04:38:33.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423180,ok=423180,error=0, records=41
[INFO ] 2026-06-01 04:38:34.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:38:37.938 [16805] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:38:48.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 04:38:48.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423181,ok=423181,error=0, records=41
[INFO ] 2026-06-01 04:38:49.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:38:49.971 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 04:38:52.945 [16831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:39:03.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 04:39:03.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423182,ok=423182,error=0, records=41
[INFO ] 2026-06-01 04:39:04.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:39:07.950 [16810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:39:18.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 04:39:18.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423183,ok=423183,error=0, records=41
[INFO ] 2026-06-01 04:39:19.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:39:22.954 [16785] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:39:33.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 04:39:33.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423184,ok=423184,error=0, records=41
[INFO ] 2026-06-01 04:39:34.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:39:36.936 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17628/300s
[INFO ] 2026-06-01 04:39:36.937 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:39:37.099 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:39:37.099 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:39:37.100 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:39:37.100 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:39:37.100 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:39:37.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:39:37.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:39:37.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:39:37.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:39:37.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:39:37.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:39:37.960 [16870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:39:48.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 04:39:48.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423185,ok=423185,error=0, records=41
[INFO ] 2026-06-01 04:39:49.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:39:52.965 [16810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:39:53.465 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21165/300s
[INFO ] 2026-06-01 04:40:00.725 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21174/300s
[INFO ] 2026-06-01 04:40:03.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 04:40:03.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423186,ok=423186,error=0, records=41
[INFO ] 2026-06-01 04:40:03.396 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21161/300s
[INFO ] 2026-06-01 04:40:04.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:40:07.970 [16856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:40:18.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 04:40:18.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423187,ok=423187,error=0, records=41
[INFO ] 2026-06-01 04:40:19.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:40:22.974 [16856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:40:33.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 04:40:33.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423188,ok=423188,error=0, records=41
[INFO ] 2026-06-01 04:40:34.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:40:37.979 [16856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:40:40.861 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21174/300s
[INFO ] 2026-06-01 04:40:48.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 04:40:48.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423189,ok=423189,error=0, records=41
[INFO ] 2026-06-01 04:40:49.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:40:52.984 [16856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:40:58.188 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21161/300s
[INFO ] 2026-06-01 04:41:03.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 04:41:03.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423190,ok=423190,error=0, records=41
[INFO ] 2026-06-01 04:41:04.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:41:07.989 [16856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:41:18.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 04:41:18.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423191,ok=423191,error=0, records=41
[INFO ] 2026-06-01 04:41:19.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:41:22.993 [16841] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:41:31.459 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21170/300s
[INFO ] 2026-06-01 04:41:33.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 04:41:33.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423192,ok=423192,error=0, records=41
[INFO ] 2026-06-01 04:41:34.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:41:37.998 [16810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:41:48.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 04:41:48.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423193,ok=423193,error=0, records=41
[INFO ] 2026-06-01 04:41:49.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:41:53.003 [16904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:42:03.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 04:42:03.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423194,ok=423194,error=0, records=41
[INFO ] 2026-06-01 04:42:04.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:42:04.978 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21173/300s
[WARN ] 2026-06-01 04:42:08.008 [16904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:42:18.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 04:42:18.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423195,ok=423195,error=0, records=41
[INFO ] 2026-06-01 04:42:19.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:42:23.014 [16958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 04:42:32.519 [16904] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6164/stat), No such file or directory
[INFO ] 2026-06-01 04:42:33.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 04:42:33.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423196,ok=423196,error=0, records=41
[INFO ] 2026-06-01 04:42:34.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:42:37.101 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873692},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:42:37.253 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:42:37.253 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 04:42:37.253 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:42:37.253 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:42:37.253 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:42:37.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:42:37.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:42:37.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:42:37.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:42:37.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:42:37.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:42:38.018 [16904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:42:38.671 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21171/300s
[INFO ] 2026-06-01 04:42:40.513 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21171/300s
[WARN ] 2026-06-01 04:42:47.523 [16904] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6164/stat), No such file or directory
[INFO ] 2026-06-01 04:42:48.176 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21171/300s
[INFO ] 2026-06-01 04:42:48.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 04:42:48.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423197,ok=423197,error=0, records=41
[INFO ] 2026-06-01 04:42:49.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:42:53.022 [17074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:43:03.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 04:43:03.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423198,ok=423198,error=0, records=41
[INFO ] 2026-06-01 04:43:04.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:43:08.027 [17058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:43:18.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 04:43:18.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423199,ok=423199,error=0, records=41
[INFO ] 2026-06-01 04:43:19.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:43:23.032 [16958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:43:33.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 04:43:33.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423200,ok=423200,error=0, records=41
[INFO ] 2026-06-01 04:43:34.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 04:43:34.981 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 04:43:38.037 [17074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:43:48.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 04:43:48.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423201,ok=423201,error=0, records=41
[INFO ] 2026-06-01 04:43:49.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:43:53.041 [17133] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:44:03.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 04:44:03.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423202,ok=423202,error=0, records=41
[INFO ] 2026-06-01 04:44:04.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:44:08.046 [17153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:44:18.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 04:44:18.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423203,ok=423203,error=0, records=41
[INFO ] 2026-06-01 04:44:19.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:44:23.051 [17142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:44:33.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 04:44:33.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423204,ok=423204,error=0, records=41
[INFO ] 2026-06-01 04:44:34.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:44:37.557 [17142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:44:48.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 04:44:48.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423205,ok=423205,error=0, records=41
[INFO ] 2026-06-01 04:44:49.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:44:52.567 [17192] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:44:53.567 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21166/300s
[INFO ] 2026-06-01 04:45:00.727 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21175/300s
[INFO ] 2026-06-01 04:45:03.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 04:45:03.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423206,ok=423206,error=0, records=41
[INFO ] 2026-06-01 04:45:03.509 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21162/300s
[INFO ] 2026-06-01 04:45:04.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:45:07.571 [17221] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:45:18.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 04:45:18.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423207,ok=423207,error=0, records=41
[INFO ] 2026-06-01 04:45:19.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:45:22.576 [17244] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:45:33.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 04:45:33.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423208,ok=423208,error=0, records=41
[INFO ] 2026-06-01 04:45:34.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:45:37.254 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17629/300s
[INFO ] 2026-06-01 04:45:37.255 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873608},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:45:37.428 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:45:37.428 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 04:45:37.428 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:45:37.428 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:45:37.428 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:45:37.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:45:37.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:45:37.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:45:37.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:45:37.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:45:37.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 04:45:37.582 [17237] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:45:40.867 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21175/300s
[INFO ] 2026-06-01 04:45:48.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 04:45:48.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423209,ok=423209,error=0, records=41
[INFO ] 2026-06-01 04:45:49.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:45:52.586 [17204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:45:58.361 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21162/300s
[INFO ] 2026-06-01 04:46:03.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 04:46:03.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423210,ok=423210,error=0, records=41
[INFO ] 2026-06-01 04:46:04.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:46:07.591 [17292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:46:18.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 04:46:18.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423211,ok=423211,error=0, records=41
[INFO ] 2026-06-01 04:46:19.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:46:22.595 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:46:31.505 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21171/300s
[INFO ] 2026-06-01 04:46:33.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 04:46:33.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423212,ok=423212,error=0, records=41
[INFO ] 2026-06-01 04:46:34.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:46:37.602 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:46:48.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 04:46:48.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423213,ok=423213,error=0, records=41
[INFO ] 2026-06-01 04:46:49.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:46:52.607 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:47:03.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 04:47:03.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423214,ok=423214,error=0, records=41
[INFO ] 2026-06-01 04:47:04.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:47:04.990 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21174/300s
[WARN ] 2026-06-01 04:47:07.612 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:47:18.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 04:47:18.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423215,ok=423215,error=0, records=41
[INFO ] 2026-06-01 04:47:19.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:47:22.617 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:47:33.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 04:47:33.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423216,ok=423216,error=0, records=41
[INFO ] 2026-06-01 04:47:34.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:47:37.622 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:47:38.709 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21172/300s
[INFO ] 2026-06-01 04:47:40.564 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21172/300s
[INFO ] 2026-06-01 04:47:48.215 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21172/300s
[INFO ] 2026-06-01 04:47:48.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 04:47:48.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423217,ok=423217,error=0, records=41
[INFO ] 2026-06-01 04:47:49.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:47:52.628 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:48:03.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 04:48:03.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423218,ok=423218,error=0, records=41
[INFO ] 2026-06-01 04:48:04.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:48:07.633 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:48:18.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 04:48:18.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423219,ok=423219,error=0, records=41
[INFO ] 2026-06-01 04:48:19.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:48:22.639 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:48:33.717 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 04:48:33.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423220,ok=423220,error=0, records=41
[INFO ] 2026-06-01 04:48:34.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:48:37.430 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873532},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:48:37.580 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:48:37.580 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:48:37.580 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:48:37.580 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:48:37.580 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[WARN ] 2026-06-01 04:48:37.644 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:48:37.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:48:37.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:48:37.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:48:37.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:48:37.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:48:37.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:48:48.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 04:48:48.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423221,ok=423221,error=0, records=41
[INFO ] 2026-06-01 04:48:49.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:48:52.650 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:49:03.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 04:49:03.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423222,ok=423222,error=0, records=41
[INFO ] 2026-06-01 04:49:04.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:49:07.656 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:49:18.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 04:49:18.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423223,ok=423223,error=0, records=41
[INFO ] 2026-06-01 04:49:19.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:49:22.661 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:49:33.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 04:49:33.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423224,ok=423224,error=0, records=41
[INFO ] 2026-06-01 04:49:34.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:49:37.666 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:49:48.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 04:49:48.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423225,ok=423225,error=0, records=41
[INFO ] 2026-06-01 04:49:49.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:49:52.671 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:49:53.671 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21167/300s
[INFO ] 2026-06-01 04:50:00.731 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21176/300s
[INFO ] 2026-06-01 04:50:03.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 04:50:03.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423226,ok=423226,error=0, records=41
[INFO ] 2026-06-01 04:50:03.747 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21163/300s
[INFO ] 2026-06-01 04:50:04.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:50:07.677 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:50:18.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 04:50:18.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423227,ok=423227,error=0, records=41
[INFO ] 2026-06-01 04:50:19.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:50:22.681 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:50:33.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 04:50:33.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423228,ok=423228,error=0, records=41
[INFO ] 2026-06-01 04:50:34.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:50:37.686 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:50:40.872 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21176/300s
[INFO ] 2026-06-01 04:50:48.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 04:50:48.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423229,ok=423229,error=0, records=41
[INFO ] 2026-06-01 04:50:49.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:50:52.692 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:50:58.533 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21163/300s
[INFO ] 2026-06-01 04:51:03.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 04:51:03.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423230,ok=423230,error=0, records=41
[INFO ] 2026-06-01 04:51:04.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:51:07.697 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:51:18.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 04:51:18.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423231,ok=423231,error=0, records=41
[INFO ] 2026-06-01 04:51:19.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:51:22.702 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:51:31.550 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21172/300s
[INFO ] 2026-06-01 04:51:33.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 04:51:33.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423232,ok=423232,error=0, records=41
[INFO ] 2026-06-01 04:51:34.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:51:37.580 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17630/300s
[INFO ] 2026-06-01 04:51:37.582 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873448},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 04:51:37.707 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:51:37.756 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:51:37.757 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:51:37.757 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:51:37.757 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:51:37.757 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:51:37.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:51:37.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:51:37.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:51:37.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:51:37.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:51:37.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:51:48.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 04:51:48.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423233,ok=423233,error=0, records=41
[INFO ] 2026-06-01 04:51:50.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:51:52.713 [17313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:52:03.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 04:52:03.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423234,ok=423234,error=0, records=41
[INFO ] 2026-06-01 04:52:05.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:52:05.001 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21175/300s
[WARN ] 2026-06-01 04:52:07.718 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:52:18.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 04:52:18.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423235,ok=423235,error=0, records=41
[INFO ] 2026-06-01 04:52:20.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:52:22.723 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:52:33.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 04:52:33.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423236,ok=423236,error=0, records=41
[INFO ] 2026-06-01 04:52:35.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:52:37.727 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:52:38.724 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21173/300s
[INFO ] 2026-06-01 04:52:40.602 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21173/300s
[INFO ] 2026-06-01 04:52:48.233 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21173/300s
[INFO ] 2026-06-01 04:52:48.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 04:52:48.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423237,ok=423237,error=0, records=41
[INFO ] 2026-06-01 04:52:50.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:52:52.732 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:53:03.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 04:53:03.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423238,ok=423238,error=0, records=41
[INFO ] 2026-06-01 04:53:05.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:53:07.736 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:53:18.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 04:53:18.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423239,ok=423239,error=0, records=41
[INFO ] 2026-06-01 04:53:20.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:53:22.741 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:53:33.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 04:53:33.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423240,ok=423240,error=0, records=41
[INFO ] 2026-06-01 04:53:35.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 04:53:35.004 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 04:53:37.747 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:53:48.842 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 04:53:48.842 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423241,ok=423241,error=0, records=41
[INFO ] 2026-06-01 04:53:50.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:53:50.005 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 04:53:52.753 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:54:03.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 04:54:03.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423242,ok=423242,error=0, records=41
[INFO ] 2026-06-01 04:54:05.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:54:07.757 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:54:18.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 04:54:18.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423243,ok=423243,error=0, records=41
[INFO ] 2026-06-01 04:54:20.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:54:22.764 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:54:33.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 04:54:33.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423244,ok=423244,error=0, records=41
[INFO ] 2026-06-01 04:54:35.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:54:37.758 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873364},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 04:54:37.768 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:54:37.917 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:54:37.917 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 04:54:48.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 04:54:48.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423245,ok=423245,error=0, records=41
[INFO ] 2026-06-01 04:54:50.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:54:52.773 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:54:53.774 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21168/300s
[INFO ] 2026-06-01 04:55:00.734 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21177/300s
[INFO ] 2026-06-01 04:55:03.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 04:55:03.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423246,ok=423246,error=0, records=41
[INFO ] 2026-06-01 04:55:03.873 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21164/300s
[INFO ] 2026-06-01 04:55:05.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:55:07.780 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:55:18.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 04:55:18.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423247,ok=423247,error=0, records=41
[INFO ] 2026-06-01 04:55:20.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:55:22.785 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:55:33.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 04:55:33.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423248,ok=423248,error=0, records=41
[INFO ] 2026-06-01 04:55:35.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:55:37.791 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:55:40.878 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21177/300s
[INFO ] 2026-06-01 04:55:48.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 04:55:48.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423249,ok=423249,error=0, records=41
[INFO ] 2026-06-01 04:55:50.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:55:52.795 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:55:58.714 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21164/300s
[INFO ] 2026-06-01 04:56:03.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 04:56:03.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423250,ok=423250,error=0, records=41
[INFO ] 2026-06-01 04:56:05.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:56:07.800 [17274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:56:18.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 04:56:18.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423251,ok=423251,error=0, records=41
[INFO ] 2026-06-01 04:56:20.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:56:22.805 [17268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:56:31.600 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21173/300s
[INFO ] 2026-06-01 04:56:33.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 04:56:33.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423252,ok=423252,error=0, records=41
[INFO ] 2026-06-01 04:56:35.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:56:37.810 [17866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:56:48.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 04:56:48.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423253,ok=423253,error=0, records=41
[INFO ] 2026-06-01 04:56:50.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:56:52.815 [17872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:57:03.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 04:57:03.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423254,ok=423254,error=0, records=41
[INFO ] 2026-06-01 04:57:05.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 04:57:05.014 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21176/300s
[WARN ] 2026-06-01 04:57:07.820 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:57:18.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 04:57:18.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423255,ok=423255,error=0, records=41
[INFO ] 2026-06-01 04:57:20.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:57:22.825 [17278] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:57:33.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 04:57:33.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423256,ok=423256,error=0, records=41
[INFO ] 2026-06-01 04:57:35.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:57:37.830 [17881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:57:37.917 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17631/300s
[INFO ] 2026-06-01 04:57:37.918 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873292},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 04:57:38.074 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 04:57:38.074 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 04:57:38.074 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 04:57:38.074 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 04:57:38.074 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 04:57:38.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:57:38.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 04:57:38.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:57:38.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 04:57:38.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:57:38.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 04:57:38.749 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21174/300s
[INFO ] 2026-06-01 04:57:40.650 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21174/300s
[INFO ] 2026-06-01 04:57:48.255 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21174/300s
[INFO ] 2026-06-01 04:57:48.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 04:57:48.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423257,ok=423257,error=0, records=41
[INFO ] 2026-06-01 04:57:50.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:57:52.836 [17872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:58:03.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 04:58:03.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423258,ok=423258,error=0, records=41
[INFO ] 2026-06-01 04:58:05.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:58:07.841 [17954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:58:18.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 04:58:18.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423259,ok=423259,error=0, records=41
[INFO ] 2026-06-01 04:58:20.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:58:22.846 [17307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:58:33.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 04:58:33.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423260,ok=423260,error=0, records=41
[INFO ] 2026-06-01 04:58:35.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:58:37.852 [17916] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:58:48.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 04:58:48.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423261,ok=423261,error=0, records=41
[INFO ] 2026-06-01 04:58:50.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:58:52.857 [17916] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:59:04.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 04:59:04.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423262,ok=423262,error=0, records=41
[INFO ] 2026-06-01 04:59:05.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:59:07.863 [17954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:59:19.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 04:59:19.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423263,ok=423263,error=0, records=41
[INFO ] 2026-06-01 04:59:20.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:59:22.868 [17954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:59:34.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 04:59:34.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423264,ok=423264,error=0, records=41
[INFO ] 2026-06-01 04:59:35.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:59:37.873 [17872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:59:49.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 04:59:49.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423265,ok=423265,error=0, records=41
[INFO ] 2026-06-01 04:59:50.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 04:59:52.879 [18009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 04:59:53.879 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21169/300s
[INFO ] 2026-06-01 05:00:00.737 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21178/300s
[INFO ] 2026-06-01 05:00:04.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 05:00:04.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423266,ok=423266,error=0, records=41
[INFO ] 2026-06-01 05:00:04.023 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21165/300s
[INFO ] 2026-06-01 05:00:05.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:00:07.884 [18054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:00:19.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 05:00:19.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423267,ok=423267,error=0, records=41
[INFO ] 2026-06-01 05:00:20.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:00:22.889 [18092] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:00:34.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 05:00:34.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423268,ok=423268,error=0, records=41
[INFO ] 2026-06-01 05:00:35.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:00:37.894 [18092] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:00:38.076 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873208},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:00:38.223 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:00:38.223 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 05:00:38.223 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:00:38.223 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:00:38.223 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:00:38.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:00:38.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:00:38.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:00:38.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:00:38.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:00:38.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:00:40.884 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21178/300s
[INFO ] 2026-06-01 05:00:49.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 05:00:49.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423269,ok=423269,error=0, records=41
[INFO ] 2026-06-01 05:00:50.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:00:52.899 [18124] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:00:58.890 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21165/300s
[INFO ] 2026-06-01 05:01:04.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 05:01:04.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423270,ok=423270,error=0, records=41
[INFO ] 2026-06-01 05:01:05.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:01:07.904 [18160] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:01:19.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 05:01:19.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423271,ok=423271,error=0, records=41
[INFO ] 2026-06-01 05:01:20.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:01:22.909 [18165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:01:31.653 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21174/300s
[INFO ] 2026-06-01 05:01:34.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 05:01:34.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423272,ok=423272,error=0, records=41
[INFO ] 2026-06-01 05:01:35.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:01:37.915 [18178] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:01:49.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 05:01:49.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423273,ok=423273,error=0, records=41
[INFO ] 2026-06-01 05:01:50.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:01:52.921 [18165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:02:04.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 05:02:04.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423274,ok=423274,error=0, records=41
[INFO ] 2026-06-01 05:02:05.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:02:05.025 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21177/300s
[WARN ] 2026-06-01 05:02:07.926 [18187] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:02:19.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 05:02:19.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423275,ok=423275,error=0, records=41
[INFO ] 2026-06-01 05:02:20.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 05:02:22.931 [18220] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:02:35.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 05:02:35.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 05:02:35.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423276,ok=423276,error=0, records=41
[WARN ] 2026-06-01 05:02:37.938 [18237] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:02:38.776 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21175/300s
[INFO ] 2026-06-01 05:02:40.677 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21175/300s
[INFO ] 2026-06-01 05:02:48.280 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21175/300s
[INFO ] 2026-06-01 05:02:50.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:02:50.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11605, records=49
[INFO ] 2026-06-01 05:02:50.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423277,ok=423277,error=0, records=49
[WARN ] 2026-06-01 05:02:52.943 [18265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:03:05.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:03:05.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 05:03:05.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423278,ok=423278,error=0, records=41
[WARN ] 2026-06-01 05:03:07.948 [18287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:03:20.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:03:20.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12292, records=49
[INFO ] 2026-06-01 05:03:20.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423279,ok=423279,error=0, records=49
[WARN ] 2026-06-01 05:03:22.953 [18303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:03:35.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 05:03:35.029 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 05:03:35.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10392, records=41
[INFO ] 2026-06-01 05:03:35.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423280,ok=423280,error=0, records=41
[WARN ] 2026-06-01 05:03:37.957 [18288] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:03:38.224 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17632/300s
[INFO ] 2026-06-01 05:03:38.225 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873136},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:03:38.401 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:03:38.401 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 05:03:38.401 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:03:38.401 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:03:38.401 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:03:38.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:03:38.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:03:38.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:03:38.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:03:38.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:03:38.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:03:50.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:03:50.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 05:03:50.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423281,ok=423281,error=0, records=41
[WARN ] 2026-06-01 05:03:52.962 [18333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:04:05.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:04:05.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 05:04:05.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423282,ok=423282,error=0, records=41
[WARN ] 2026-06-01 05:04:07.967 [18333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:04:20.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:04:20.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 05:04:20.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423283,ok=423283,error=0, records=41
[WARN ] 2026-06-01 05:04:22.972 [18347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:04:35.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:04:35.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 05:04:35.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423284,ok=423284,error=0, records=41
[WARN ] 2026-06-01 05:04:37.977 [18347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:04:50.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:04:50.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 05:04:50.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423285,ok=423285,error=0, records=41
[WARN ] 2026-06-01 05:04:52.982 [18347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:04:53.982 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21170/300s
[INFO ] 2026-06-01 05:05:00.740 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21179/300s
[INFO ] 2026-06-01 05:05:05.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:05:05.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:05:05.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423286,ok=423286,error=0, records=41
[INFO ] 2026-06-01 05:05:05.204 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21166/300s
[WARN ] 2026-06-01 05:05:07.987 [18347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:05:20.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:05:20.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 05:05:20.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423287,ok=423287,error=0, records=41
[WARN ] 2026-06-01 05:05:22.991 [18303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:05:35.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:05:35.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 05:05:35.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423288,ok=423288,error=0, records=41
[WARN ] 2026-06-01 05:05:37.996 [18389] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:05:40.890 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21179/300s
[INFO ] 2026-06-01 05:05:50.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:05:50.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:05:50.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423289,ok=423289,error=0, records=41
[WARN ] 2026-06-01 05:05:53.001 [18347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:05:59.055 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21166/300s
[INFO ] 2026-06-01 05:06:05.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:06:05.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 05:06:05.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423290,ok=423290,error=0, records=41
[WARN ] 2026-06-01 05:06:08.005 [18417] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:06:20.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:06:20.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 05:06:20.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423291,ok=423291,error=0, records=41
[WARN ] 2026-06-01 05:06:23.010 [18473] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:06:31.700 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21175/300s
[INFO ] 2026-06-01 05:06:35.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:06:35.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 05:06:35.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423292,ok=423292,error=0, records=41
[WARN ] 2026-06-01 05:06:38.015 [18473] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:06:38.403 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20873056},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:06:38.584 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:06:38.585 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 05:06:38.585 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:06:38.585 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:06:38.585 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:06:38.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:06:38.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:06:38.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:06:38.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:06:38.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:06:38.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:06:50.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:06:50.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 05:06:50.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423293,ok=423293,error=0, records=41
[WARN ] 2026-06-01 05:06:53.019 [18487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:07:05.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:07:05.037 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21178/300s
[INFO ] 2026-06-01 05:07:05.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 05:07:05.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423294,ok=423294,error=0, records=41
[WARN ] 2026-06-01 05:07:08.024 [18389] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:07:20.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:07:20.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 05:07:20.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423295,ok=423295,error=0, records=41
[WARN ] 2026-06-01 05:07:23.029 [18473] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:07:35.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:07:35.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 05:07:35.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423296,ok=423296,error=0, records=41
[WARN ] 2026-06-01 05:07:38.034 [18473] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:07:38.800 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21176/300s
[INFO ] 2026-06-01 05:07:40.701 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21176/300s
[INFO ] 2026-06-01 05:07:48.304 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21176/300s
[INFO ] 2026-06-01 05:07:50.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:07:50.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 05:07:50.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423297,ok=423297,error=0, records=41
[WARN ] 2026-06-01 05:07:53.038 [18560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:08:05.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:08:05.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 05:08:05.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423298,ok=423298,error=0, records=41
[WARN ] 2026-06-01 05:08:08.042 [18544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:08:20.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:08:20.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 05:08:20.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423299,ok=423299,error=0, records=41
[WARN ] 2026-06-01 05:08:23.046 [18593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:08:35.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:08:35.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 05:08:35.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423300,ok=423300,error=0, records=41
[WARN ] 2026-06-01 05:08:38.050 [18610] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:08:50.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:08:50.040 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 05:08:50.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 05:08:50.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423301,ok=423301,error=0, records=41
[WARN ] 2026-06-01 05:08:52.554 [18621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:09:05.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:09:05.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 05:09:05.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423302,ok=423302,error=0, records=41
[WARN ] 2026-06-01 05:09:07.558 [18644] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:09:20.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:09:20.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 05:09:20.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423303,ok=423303,error=0, records=41
[WARN ] 2026-06-01 05:09:22.563 [18650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:09:35.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:09:35.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 05:09:35.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423304,ok=423304,error=0, records=41
[WARN ] 2026-06-01 05:09:37.568 [18650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:09:38.585 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17633/300s
[INFO ] 2026-06-01 05:09:38.586 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872984},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:09:38.735 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:09:38.735 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 05:09:38.735 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:09:38.735 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:09:38.735 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:09:38.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:09:38.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:09:38.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:09:38.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:09:38.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:09:38.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:09:50.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:09:50.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-01 05:09:50.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423305,ok=423305,error=0, records=41
[WARN ] 2026-06-01 05:09:52.573 [18685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:09:54.073 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21171/300s
[INFO ] 2026-06-01 05:10:00.743 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21180/300s
[INFO ] 2026-06-01 05:10:05.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:10:05.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 05:10:05.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423306,ok=423306,error=0, records=41
[INFO ] 2026-06-01 05:10:05.310 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21167/300s
[WARN ] 2026-06-01 05:10:07.578 [18663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:10:20.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:10:20.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 05:10:20.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423307,ok=423307,error=0, records=41
[WARN ] 2026-06-01 05:10:22.583 [18735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:10:35.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:10:35.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 05:10:35.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423308,ok=423308,error=0, records=41
[WARN ] 2026-06-01 05:10:37.587 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:10:40.895 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21180/300s
[INFO ] 2026-06-01 05:10:50.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:10:50.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 05:10:50.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423309,ok=423309,error=0, records=41
[WARN ] 2026-06-01 05:10:52.592 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:10:59.227 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21167/300s
[INFO ] 2026-06-01 05:11:05.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:11:05.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 05:11:05.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423310,ok=423310,error=0, records=41
[WARN ] 2026-06-01 05:11:07.597 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:11:20.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:11:20.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 05:11:20.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423311,ok=423311,error=0, records=41
[WARN ] 2026-06-01 05:11:22.602 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:11:31.743 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21176/300s
[INFO ] 2026-06-01 05:11:35.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:11:35.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 05:11:35.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423312,ok=423312,error=0, records=41
[WARN ] 2026-06-01 05:11:37.607 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:11:50.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:11:50.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 05:11:50.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423313,ok=423313,error=0, records=41
[WARN ] 2026-06-01 05:11:52.612 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:12:05.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:12:05.048 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21179/300s
[INFO ] 2026-06-01 05:12:05.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 05:12:05.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423314,ok=423314,error=0, records=41
[WARN ] 2026-06-01 05:12:07.618 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:12:20.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:12:20.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 05:12:20.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423315,ok=423315,error=0, records=41
[WARN ] 2026-06-01 05:12:22.623 [18789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:12:35.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:12:35.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:12:35.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423316,ok=423316,error=0, records=41
[WARN ] 2026-06-01 05:12:37.628 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:12:38.736 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872888},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:12:38.896 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21177/300s
[INFO ] 2026-06-01 05:12:38.928 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:12:38.928 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 05:12:38.928 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:12:38.928 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:12:38.928 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:12:38.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:12:38.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:12:38.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:12:38.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:12:38.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:12:38.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:12:40.798 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21177/300s
[INFO ] 2026-06-01 05:12:48.402 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21177/300s
[INFO ] 2026-06-01 05:12:50.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:12:50.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:12:50.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423317,ok=423317,error=0, records=41
[WARN ] 2026-06-01 05:12:52.634 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:13:05.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:13:05.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 05:13:05.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423318,ok=423318,error=0, records=41
[WARN ] 2026-06-01 05:13:07.639 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:13:20.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:13:20.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 05:13:20.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423319,ok=423319,error=0, records=41
[WARN ] 2026-06-01 05:13:22.643 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:13:35.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 05:13:35.052 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 05:13:35.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 05:13:35.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423320,ok=423320,error=0, records=41
[WARN ] 2026-06-01 05:13:37.650 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:13:50.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:13:50.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 05:13:50.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423321,ok=423321,error=0, records=41
[WARN ] 2026-06-01 05:13:52.654 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:14:05.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:14:05.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 05:14:05.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423322,ok=423322,error=0, records=41
[WARN ] 2026-06-01 05:14:07.661 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:14:20.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:14:20.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 05:14:20.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423323,ok=423323,error=0, records=41
[WARN ] 2026-06-01 05:14:22.665 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:14:35.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:14:35.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 05:14:35.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423324,ok=423324,error=0, records=41
[WARN ] 2026-06-01 05:14:37.671 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:14:50.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:14:50.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 05:14:50.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423325,ok=423325,error=0, records=41
[WARN ] 2026-06-01 05:14:52.676 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:14:54.177 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21172/300s
[INFO ] 2026-06-01 05:15:00.746 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21181/300s
[INFO ] 2026-06-01 05:15:05.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:15:05.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 05:15:05.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423326,ok=423326,error=0, records=41
[INFO ] 2026-06-01 05:15:05.603 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21168/300s
[WARN ] 2026-06-01 05:15:07.681 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:15:20.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:15:20.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 05:15:20.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423327,ok=423327,error=0, records=41
[WARN ] 2026-06-01 05:15:22.686 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:15:35.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:15:35.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 05:15:35.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423328,ok=423328,error=0, records=41
[WARN ] 2026-06-01 05:15:37.691 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:15:38.928 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17634/300s
[INFO ] 2026-06-01 05:15:38.929 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872812},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:15:39.116 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:15:39.116 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 05:15:39.116 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:15:39.116 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:15:39.116 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:15:39.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:15:39.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:15:39.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:15:39.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:15:39.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:15:39.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:15:40.901 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21181/300s
[INFO ] 2026-06-01 05:15:50.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:15:50.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 05:15:50.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423329,ok=423329,error=0, records=41
[WARN ] 2026-06-01 05:15:52.696 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:15:59.405 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21168/300s
[INFO ] 2026-06-01 05:16:05.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:16:05.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 05:16:05.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423330,ok=423330,error=0, records=41
[WARN ] 2026-06-01 05:16:07.701 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:16:20.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:16:20.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 05:16:20.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423331,ok=423331,error=0, records=41
[WARN ] 2026-06-01 05:16:22.706 [18789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:16:31.792 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21177/300s
[INFO ] 2026-06-01 05:16:35.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:16:35.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 05:16:35.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423332,ok=423332,error=0, records=41
[WARN ] 2026-06-01 05:16:37.712 [18789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:16:50.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:16:50.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 05:16:50.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423333,ok=423333,error=0, records=41
[WARN ] 2026-06-01 05:16:52.717 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:17:05.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:17:05.062 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21180/300s
[INFO ] 2026-06-01 05:17:05.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:17:05.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423334,ok=423334,error=0, records=41
[WARN ] 2026-06-01 05:17:07.722 [18789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:17:20.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:17:20.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 05:17:20.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423335,ok=423335,error=0, records=41
[WARN ] 2026-06-01 05:17:22.729 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:17:35.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:17:35.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 05:17:35.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423336,ok=423336,error=0, records=41
[WARN ] 2026-06-01 05:17:37.734 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:17:38.951 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21178/300s
[INFO ] 2026-06-01 05:17:40.852 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21178/300s
[INFO ] 2026-06-01 05:17:48.457 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21178/300s
[INFO ] 2026-06-01 05:17:50.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:17:50.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 05:17:50.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423337,ok=423337,error=0, records=41
[WARN ] 2026-06-01 05:17:52.739 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:18:05.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:18:05.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 05:18:05.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423338,ok=423338,error=0, records=41
[WARN ] 2026-06-01 05:18:07.745 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:18:20.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:18:20.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 05:18:20.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423339,ok=423339,error=0, records=41
[WARN ] 2026-06-01 05:18:22.751 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:18:35.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:18:35.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 05:18:35.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423340,ok=423340,error=0, records=41
[WARN ] 2026-06-01 05:18:37.756 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:18:39.117 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872736},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:18:39.283 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:18:39.283 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 05:18:39.284 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:18:39.284 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:18:39.284 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:18:39.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:18:39.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:18:39.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:18:39.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:18:39.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:18:39.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:18:50.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:18:50.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 05:18:50.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423341,ok=423341,error=0, records=41
[WARN ] 2026-06-01 05:18:52.761 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:19:05.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:19:05.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 05:19:05.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423342,ok=423342,error=0, records=41
[WARN ] 2026-06-01 05:19:07.767 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:19:20.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:19:20.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 05:19:20.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423343,ok=423343,error=0, records=41
[WARN ] 2026-06-01 05:19:22.771 [18774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:19:35.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:19:35.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 05:19:35.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423344,ok=423344,error=0, records=41
[WARN ] 2026-06-01 05:19:37.776 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:19:50.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:19:50.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 05:19:50.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423345,ok=423345,error=0, records=41
[WARN ] 2026-06-01 05:19:52.782 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:19:54.282 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21173/300s
[INFO ] 2026-06-01 05:20:00.749 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21182/300s
[INFO ] 2026-06-01 05:20:05.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:20:05.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 05:20:05.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423346,ok=423346,error=0, records=41
[INFO ] 2026-06-01 05:20:05.904 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21169/300s
[WARN ] 2026-06-01 05:20:07.787 [18789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:20:20.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:20:20.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 05:20:20.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423347,ok=423347,error=0, records=41
[WARN ] 2026-06-01 05:20:22.792 [18789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:20:35.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:20:35.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 05:20:35.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423348,ok=423348,error=0, records=41
[WARN ] 2026-06-01 05:20:37.797 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:20:40.907 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21182/300s
[INFO ] 2026-06-01 05:20:50.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:20:50.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 05:20:50.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423349,ok=423349,error=0, records=41
[WARN ] 2026-06-01 05:20:52.803 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:20:59.582 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21169/300s
[INFO ] 2026-06-01 05:21:05.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:21:05.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:21:05.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423350,ok=423350,error=0, records=41
[WARN ] 2026-06-01 05:21:07.807 [18747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:21:20.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:21:20.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 05:21:20.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423351,ok=423351,error=0, records=41
[WARN ] 2026-06-01 05:21:22.812 [18742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:21:31.842 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21178/300s
[INFO ] 2026-06-01 05:21:35.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:21:35.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 05:21:35.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423352,ok=423352,error=0, records=41
[WARN ] 2026-06-01 05:21:37.817 [19326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:21:39.284 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17635/300s
[INFO ] 2026-06-01 05:21:39.285 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872644},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:21:39.447 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:21:39.447 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 05:21:39.447 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:21:39.447 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:21:39.447 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:21:39.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:21:39.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:21:39.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:21:39.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:21:39.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:21:39.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:21:50.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:21:50.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 05:21:50.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423353,ok=423353,error=0, records=41
[WARN ] 2026-06-01 05:21:52.822 [18783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:22:05.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:22:05.076 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21181/300s
[INFO ] 2026-06-01 05:22:05.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 05:22:05.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423354,ok=423354,error=0, records=41
[WARN ] 2026-06-01 05:22:07.828 [19326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:22:20.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:22:20.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 05:22:20.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423355,ok=423355,error=0, records=41
[WARN ] 2026-06-01 05:22:22.834 [19384] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:22:35.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:22:35.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 05:22:35.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423356,ok=423356,error=0, records=41
[WARN ] 2026-06-01 05:22:37.839 [19370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:22:38.965 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21179/300s
[INFO ] 2026-06-01 05:22:40.868 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21179/300s
[INFO ] 2026-06-01 05:22:48.471 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21179/300s
[INFO ] 2026-06-01 05:22:50.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:22:50.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 05:22:50.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423357,ok=423357,error=0, records=41
[WARN ] 2026-06-01 05:22:52.845 [19384] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:23:05.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:23:05.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 05:23:05.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423358,ok=423358,error=0, records=41
[WARN ] 2026-06-01 05:23:07.851 [19320] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:23:20.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:23:20.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 05:23:20.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423359,ok=423359,error=0, records=41
[WARN ] 2026-06-01 05:23:22.857 [19433] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:23:35.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 05:23:35.079 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 05:23:35.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 05:23:35.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423360,ok=423360,error=0, records=41
[WARN ] 2026-06-01 05:23:37.862 [19326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:23:50.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:23:50.080 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 05:23:50.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 05:23:50.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423361,ok=423361,error=0, records=41
[WARN ] 2026-06-01 05:23:52.867 [19384] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:24:05.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:24:05.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 05:24:05.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423362,ok=423362,error=0, records=41
[WARN ] 2026-06-01 05:24:07.872 [19489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:24:20.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:24:21.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 05:24:21.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423363,ok=423363,error=0, records=41
[WARN ] 2026-06-01 05:24:22.876 [19510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:24:35.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:24:36.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 05:24:36.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423364,ok=423364,error=0, records=41
[WARN ] 2026-06-01 05:24:37.882 [19326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:24:39.448 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872564},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:24:39.602 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:24:39.602 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 05:24:39.603 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:24:39.603 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:24:39.603 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:24:39.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:24:39.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:24:39.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:24:39.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:24:39.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:24:39.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:24:50.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:24:51.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 05:24:51.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423365,ok=423365,error=0, records=41
[WARN ] 2026-06-01 05:24:52.888 [19320] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:24:54.388 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21174/300s
[INFO ] 2026-06-01 05:25:00.752 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21183/300s
[INFO ] 2026-06-01 05:25:05.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:25:06.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 05:25:06.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423366,ok=423366,error=0, records=41
[INFO ] 2026-06-01 05:25:06.028 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21170/300s
[WARN ] 2026-06-01 05:25:07.894 [19320] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:25:20.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:25:21.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 05:25:21.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423367,ok=423367,error=0, records=41
[WARN ] 2026-06-01 05:25:22.898 [19537] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:25:35.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:25:36.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 05:25:36.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423368,ok=423368,error=0, records=41
[WARN ] 2026-06-01 05:25:37.902 [19587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:25:40.913 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21183/300s
[INFO ] 2026-06-01 05:25:50.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:25:51.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 05:25:51.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423369,ok=423369,error=0, records=41
[WARN ] 2026-06-01 05:25:52.908 [19594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:25:59.762 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21170/300s
[INFO ] 2026-06-01 05:26:05.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:26:06.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 05:26:06.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423370,ok=423370,error=0, records=41
[WARN ] 2026-06-01 05:26:07.914 [19586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:26:20.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:26:21.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 05:26:21.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423371,ok=423371,error=0, records=41
[WARN ] 2026-06-01 05:26:22.919 [19639] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:26:31.893 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21179/300s
[INFO ] 2026-06-01 05:26:35.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:26:36.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 05:26:36.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423372,ok=423372,error=0, records=41
[WARN ] 2026-06-01 05:26:37.924 [19662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:26:50.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:26:51.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 05:26:51.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423373,ok=423373,error=0, records=41
[WARN ] 2026-06-01 05:26:52.931 [19655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:27:05.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:27:05.089 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21182/300s
[INFO ] 2026-06-01 05:27:06.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 05:27:06.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423374,ok=423374,error=0, records=41
[WARN ] 2026-06-01 05:27:07.938 [19696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:27:20.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:27:21.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 05:27:21.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423375,ok=423375,error=0, records=41
[WARN ] 2026-06-01 05:27:22.943 [19712] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:27:35.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:27:36.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 05:27:36.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423376,ok=423376,error=0, records=41
[WARN ] 2026-06-01 05:27:37.948 [19721] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:27:39.001 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21180/300s
[INFO ] 2026-06-01 05:27:39.603 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17636/300s
[INFO ] 2026-06-01 05:27:39.604 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:27:39.744 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:27:39.744 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 05:27:39.744 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:27:39.744 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:27:39.744 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:27:39.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:27:39.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:27:39.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:27:39.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:27:39.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:27:39.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:27:40.902 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21180/300s
[INFO ] 2026-06-01 05:27:48.507 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21180/300s
[INFO ] 2026-06-01 05:27:50.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:27:51.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 05:27:51.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423377,ok=423377,error=0, records=41
[WARN ] 2026-06-01 05:27:52.953 [19739] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:28:05.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:28:06.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 05:28:06.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423378,ok=423378,error=0, records=41
[WARN ] 2026-06-01 05:28:07.958 [19753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:28:20.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:28:21.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 05:28:21.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423379,ok=423379,error=0, records=41
[WARN ] 2026-06-01 05:28:22.963 [19753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:28:35.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:28:36.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 05:28:36.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423380,ok=423380,error=0, records=41
[WARN ] 2026-06-01 05:28:37.968 [19673] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:28:50.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:28:51.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 05:28:51.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423381,ok=423381,error=0, records=41
[WARN ] 2026-06-01 05:28:52.973 [19722] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:29:05.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:29:06.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 05:29:06.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423382,ok=423382,error=0, records=41
[WARN ] 2026-06-01 05:29:07.978 [19673] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:29:20.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:29:21.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 05:29:21.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423383,ok=423383,error=0, records=41
[WARN ] 2026-06-01 05:29:22.983 [19753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:29:35.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:29:36.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 05:29:36.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423384,ok=423384,error=0, records=41
[WARN ] 2026-06-01 05:29:37.987 [19753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:29:50.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:29:51.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 05:29:51.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423385,ok=423385,error=0, records=41
[WARN ] 2026-06-01 05:29:52.992 [19834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:29:54.493 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21175/300s
[INFO ] 2026-06-01 05:30:00.755 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21184/300s
[INFO ] 2026-06-01 05:30:05.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:30:06.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 05:30:06.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423386,ok=423386,error=0, records=41
[INFO ] 2026-06-01 05:30:06.154 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21171/300s
[WARN ] 2026-06-01 05:30:07.998 [19780] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:30:20.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:30:21.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 05:30:21.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423387,ok=423387,error=0, records=41
[WARN ] 2026-06-01 05:30:23.002 [19869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:30:35.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:30:36.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 05:30:36.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423388,ok=423388,error=0, records=41
[WARN ] 2026-06-01 05:30:38.007 [19834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:30:39.746 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872408},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:30:39.919 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:30:39.919 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 05:30:39.919 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:30:39.919 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:30:39.919 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:30:39.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:30:39.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:30:39.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:30:39.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:30:39.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:30:39.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:30:40.919 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21184/300s
[INFO ] 2026-06-01 05:30:50.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:30:51.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 05:30:51.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423389,ok=423389,error=0, records=41
[WARN ] 2026-06-01 05:30:53.012 [19869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:30:59.939 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21171/300s
[INFO ] 2026-06-01 05:31:05.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:31:06.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 05:31:06.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423390,ok=423390,error=0, records=41
[WARN ] 2026-06-01 05:31:08.017 [19883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:31:20.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:31:21.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 05:31:21.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423391,ok=423391,error=0, records=41
[WARN ] 2026-06-01 05:31:23.021 [19849] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:31:31.942 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21180/300s
[INFO ] 2026-06-01 05:31:35.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:31:36.185 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 05:31:36.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423392,ok=423392,error=0, records=41
[WARN ] 2026-06-01 05:31:38.026 [19834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:31:50.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:31:51.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 05:31:51.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423393,ok=423393,error=0, records=41
[WARN ] 2026-06-01 05:31:53.032 [19834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:32:05.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:32:05.100 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21183/300s
[INFO ] 2026-06-01 05:32:06.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 05:32:06.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423394,ok=423394,error=0, records=41
[WARN ] 2026-06-01 05:32:08.037 [19954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:32:20.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:32:21.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 05:32:21.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423395,ok=423395,error=0, records=41
[WARN ] 2026-06-01 05:32:23.041 [20007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:32:35.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:32:36.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 05:32:36.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423396,ok=423396,error=0, records=41
[WARN ] 2026-06-01 05:32:38.046 [20023] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:32:39.027 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21181/300s
[INFO ] 2026-06-01 05:32:40.929 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21181/300s
[INFO ] 2026-06-01 05:32:48.536 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21181/300s
[INFO ] 2026-06-01 05:32:50.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:32:51.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 05:32:51.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423397,ok=423397,error=0, records=41
[WARN ] 2026-06-01 05:32:53.050 [20031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:33:05.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:33:06.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 05:33:06.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423398,ok=423398,error=0, records=41
[WARN ] 2026-06-01 05:33:07.555 [20059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:33:20.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:33:21.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 05:33:21.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423399,ok=423399,error=0, records=41
[WARN ] 2026-06-01 05:33:22.560 [20082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:33:35.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 05:33:35.104 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 05:33:36.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 05:33:36.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423400,ok=423400,error=0, records=41
[WARN ] 2026-06-01 05:33:37.565 [20098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:33:39.919 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17637/300s
[INFO ] 2026-06-01 05:33:39.920 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872320},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:33:40.072 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:33:40.072 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 05:33:40.072 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:33:40.072 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:33:40.072 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:33:40.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:33:40.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:33:40.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:33:40.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:33:40.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:33:40.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:33:50.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:33:51.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 05:33:51.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423401,ok=423401,error=0, records=41
[WARN ] 2026-06-01 05:33:52.569 [20059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:34:05.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:34:06.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 05:34:06.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423402,ok=423402,error=0, records=41
[WARN ] 2026-06-01 05:34:07.573 [20124] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:34:20.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:34:21.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 05:34:21.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423403,ok=423403,error=0, records=41
[WARN ] 2026-06-01 05:34:22.577 [20148] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:34:35.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:34:36.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 05:34:36.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423404,ok=423404,error=0, records=41
[WARN ] 2026-06-01 05:34:37.581 [20160] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:34:50.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:34:51.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 05:34:51.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423405,ok=423405,error=0, records=41
[WARN ] 2026-06-01 05:34:52.585 [20173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:34:54.585 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21176/300s
[INFO ] 2026-06-01 05:35:00.758 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21185/300s
[INFO ] 2026-06-01 05:35:05.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:35:06.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-01 05:35:06.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423406,ok=423406,error=0, records=41
[INFO ] 2026-06-01 05:35:06.264 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21172/300s
[WARN ] 2026-06-01 05:35:07.589 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:35:20.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:35:21.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 05:35:21.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423407,ok=423407,error=0, records=41
[WARN ] 2026-06-01 05:35:22.593 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:35:35.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:35:36.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10381, records=41
[INFO ] 2026-06-01 05:35:36.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423408,ok=423408,error=0, records=41
[WARN ] 2026-06-01 05:35:37.598 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:35:40.925 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21185/300s
[INFO ] 2026-06-01 05:35:50.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:35:51.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11571, records=45
[INFO ] 2026-06-01 05:35:51.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423409,ok=423409,error=0, records=45
[WARN ] 2026-06-01 05:35:52.603 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:36:00.114 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21172/300s
[INFO ] 2026-06-01 05:36:05.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:36:06.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 05:36:06.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423410,ok=423410,error=0, records=41
[WARN ] 2026-06-01 05:36:07.608 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:36:20.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:36:21.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 05:36:21.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423411,ok=423411,error=0, records=41
[WARN ] 2026-06-01 05:36:22.613 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:36:31.990 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21181/300s
[INFO ] 2026-06-01 05:36:35.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:36:36.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 05:36:36.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423412,ok=423412,error=0, records=41
[WARN ] 2026-06-01 05:36:37.618 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:36:40.074 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:36:40.237 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:36:40.237 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 05:36:40.237 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:36:40.237 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:36:40.237 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:36:40.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:36:40.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:36:40.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:36:40.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:36:40.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:36:40.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:36:50.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:36:51.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 05:36:51.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423413,ok=423413,error=0, records=41
[WARN ] 2026-06-01 05:36:52.623 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:37:05.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:37:05.111 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21184/300s
[INFO ] 2026-06-01 05:37:06.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10400, records=41
[INFO ] 2026-06-01 05:37:06.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423414,ok=423414,error=0, records=41
[WARN ] 2026-06-01 05:37:07.628 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:37:20.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:37:21.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 05:37:21.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423415,ok=423415,error=0, records=41
[WARN ] 2026-06-01 05:37:22.633 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:37:35.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:37:36.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 05:37:36.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423416,ok=423416,error=0, records=41
[WARN ] 2026-06-01 05:37:37.638 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:37:39.059 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21182/300s
[INFO ] 2026-06-01 05:37:40.960 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21182/300s
[INFO ] 2026-06-01 05:37:48.565 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21182/300s
[INFO ] 2026-06-01 05:37:50.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:37:51.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 05:37:51.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423417,ok=423417,error=0, records=41
[WARN ] 2026-06-01 05:37:52.643 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:38:05.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:38:06.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 05:38:06.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423418,ok=423418,error=0, records=41
[WARN ] 2026-06-01 05:38:07.649 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:38:20.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:38:21.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 05:38:21.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423419,ok=423419,error=0, records=41
[WARN ] 2026-06-01 05:38:22.655 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:38:35.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:38:36.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 05:38:36.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423420,ok=423420,error=0, records=41
[WARN ] 2026-06-01 05:38:37.659 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:38:50.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:38:50.115 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 05:38:51.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 05:38:51.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423421,ok=423421,error=0, records=41
[WARN ] 2026-06-01 05:38:52.664 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:39:05.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:39:06.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-06-01 05:39:06.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423422,ok=423422,error=0, records=41
[WARN ] 2026-06-01 05:39:07.668 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:39:20.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:39:21.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 05:39:21.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423423,ok=423423,error=0, records=41
[WARN ] 2026-06-01 05:39:22.673 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:39:35.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:39:36.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 05:39:36.379 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423424,ok=423424,error=0, records=41
[WARN ] 2026-06-01 05:39:37.679 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:39:40.237 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17638/300s
[INFO ] 2026-06-01 05:39:40.239 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872188},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:39:40.399 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:39:40.399 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 05:39:40.399 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:39:40.399 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:39:40.399 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:39:40.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:39:40.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:39:40.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:39:40.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:39:40.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:39:40.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:39:50.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:39:51.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 05:39:51.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423425,ok=423425,error=0, records=41
[WARN ] 2026-06-01 05:39:52.685 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:39:54.685 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21177/300s
[INFO ] 2026-06-01 05:40:00.761 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21186/300s
[INFO ] 2026-06-01 05:40:05.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:40:06.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 05:40:06.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423426,ok=423426,error=0, records=41
[INFO ] 2026-06-01 05:40:06.466 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21173/300s
[WARN ] 2026-06-01 05:40:07.691 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:40:20.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:40:21.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 05:40:21.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423427,ok=423427,error=0, records=41
[WARN ] 2026-06-01 05:40:22.696 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:40:35.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:40:36.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 05:40:36.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423428,ok=423428,error=0, records=41
[WARN ] 2026-06-01 05:40:37.700 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:40:40.932 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21186/300s
[INFO ] 2026-06-01 05:40:50.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:40:51.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 05:40:51.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423429,ok=423429,error=0, records=41
[WARN ] 2026-06-01 05:40:52.706 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:41:00.292 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21173/300s
[INFO ] 2026-06-01 05:41:05.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:41:06.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 05:41:06.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423430,ok=423430,error=0, records=41
[WARN ] 2026-06-01 05:41:07.712 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:41:20.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:41:21.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 05:41:21.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423431,ok=423431,error=0, records=41
[WARN ] 2026-06-01 05:41:22.718 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:41:32.044 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21182/300s
[INFO ] 2026-06-01 05:41:35.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:41:36.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 05:41:36.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423432,ok=423432,error=0, records=41
[WARN ] 2026-06-01 05:41:37.723 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:41:50.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:41:51.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 05:41:51.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423433,ok=423433,error=0, records=41
[WARN ] 2026-06-01 05:41:52.729 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:42:05.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:42:05.124 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21185/300s
[INFO ] 2026-06-01 05:42:06.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 05:42:06.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423434,ok=423434,error=0, records=41
[WARN ] 2026-06-01 05:42:07.735 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:42:20.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:42:21.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 05:42:21.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423435,ok=423435,error=0, records=41
[WARN ] 2026-06-01 05:42:22.740 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:42:35.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:42:36.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-01 05:42:36.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423436,ok=423436,error=0, records=41
[WARN ] 2026-06-01 05:42:37.745 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:42:39.096 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21183/300s
[INFO ] 2026-06-01 05:42:40.401 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872072},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:42:40.577 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:42:40.577 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 05:42:40.577 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:42:40.577 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:42:40.577 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:42:40.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:42:40.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:42:40.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:42:40.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:42:40.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:42:40.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:42:40.998 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21183/300s
[INFO ] 2026-06-01 05:42:48.600 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21183/300s
[INFO ] 2026-06-01 05:42:50.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:42:51.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 05:42:51.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423437,ok=423437,error=0, records=41
[WARN ] 2026-06-01 05:42:52.749 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:43:05.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:43:06.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 05:43:06.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423438,ok=423438,error=0, records=41
[WARN ] 2026-06-01 05:43:07.754 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:43:20.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:43:21.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 05:43:21.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423439,ok=423439,error=0, records=41
[WARN ] 2026-06-01 05:43:22.758 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:43:35.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 05:43:35.127 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 05:43:36.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 05:43:36.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423440,ok=423440,error=0, records=41
[WARN ] 2026-06-01 05:43:37.763 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:43:50.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:43:51.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 05:43:51.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423441,ok=423441,error=0, records=41
[WARN ] 2026-06-01 05:43:52.768 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:44:05.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:44:06.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 05:44:06.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423442,ok=423442,error=0, records=41
[WARN ] 2026-06-01 05:44:07.774 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:44:20.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:44:21.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 05:44:21.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423443,ok=423443,error=0, records=41
[WARN ] 2026-06-01 05:44:22.780 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:44:35.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:44:36.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 05:44:36.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423444,ok=423444,error=0, records=41
[WARN ] 2026-06-01 05:44:37.786 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:44:50.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:44:51.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 05:44:51.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423445,ok=423445,error=0, records=41
[WARN ] 2026-06-01 05:44:52.791 [20190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:44:54.791 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21178/300s
[INFO ] 2026-06-01 05:45:00.764 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21187/300s
[INFO ] 2026-06-01 05:45:05.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:45:06.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 05:45:06.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423446,ok=423446,error=0, records=41
[INFO ] 2026-06-01 05:45:06.586 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21174/300s
[WARN ] 2026-06-01 05:45:07.796 [20183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:45:20.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:45:21.592 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 05:45:21.592 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423447,ok=423447,error=0, records=41
[WARN ] 2026-06-01 05:45:22.801 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:45:35.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:45:36.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-01 05:45:36.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423448,ok=423448,error=0, records=41
[WARN ] 2026-06-01 05:45:37.807 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:45:40.577 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17639/300s
[INFO ] 2026-06-01 05:45:40.579 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20872004},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:45:40.743 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:45:40.743 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 05:45:40.744 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:45:40.744 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:45:40.744 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:45:40.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:45:40.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:45:40.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:45:40.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:45:40.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:45:40.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:45:40.938 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21187/300s
[INFO ] 2026-06-01 05:45:50.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:45:51.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10403, records=41
[INFO ] 2026-06-01 05:45:51.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423449,ok=423449,error=0, records=41
[WARN ] 2026-06-01 05:45:52.812 [20172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:46:00.469 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21174/300s
[INFO ] 2026-06-01 05:46:05.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:46:06.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 05:46:06.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423450,ok=423450,error=0, records=41
[WARN ] 2026-06-01 05:46:07.817 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:46:20.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:46:21.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 05:46:21.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423451,ok=423451,error=0, records=41
[WARN ] 2026-06-01 05:46:22.823 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:46:32.090 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21183/300s
[INFO ] 2026-06-01 05:46:35.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:46:36.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 05:46:36.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423452,ok=423452,error=0, records=41
[WARN ] 2026-06-01 05:46:37.828 [20925] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:46:50.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:46:51.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 05:46:51.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423453,ok=423453,error=0, records=41
[WARN ] 2026-06-01 05:46:52.834 [20225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:47:05.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:47:05.135 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21186/300s
[INFO ] 2026-06-01 05:47:06.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 05:47:06.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423454,ok=423454,error=0, records=41
[WARN ] 2026-06-01 05:47:07.839 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:47:20.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:47:21.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 05:47:21.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423455,ok=423455,error=0, records=41
[WARN ] 2026-06-01 05:47:22.845 [20905] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:47:35.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:47:36.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 05:47:36.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423456,ok=423456,error=0, records=41
[WARN ] 2026-06-01 05:47:37.851 [20925] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:47:39.102 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21184/300s
[INFO ] 2026-06-01 05:47:41.004 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21184/300s
[INFO ] 2026-06-01 05:47:48.606 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21184/300s
[INFO ] 2026-06-01 05:47:50.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:47:51.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 05:47:51.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423457,ok=423457,error=0, records=41
[WARN ] 2026-06-01 05:47:52.856 [20986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:48:05.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:48:06.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 05:48:06.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423458,ok=423458,error=0, records=41
[WARN ] 2026-06-01 05:48:07.862 [21015] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:48:20.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:48:21.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 05:48:21.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423459,ok=423459,error=0, records=41
[WARN ] 2026-06-01 05:48:22.867 [20236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:48:35.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:48:36.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 05:48:36.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423460,ok=423460,error=0, records=41
[WARN ] 2026-06-01 05:48:37.872 [21043] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:48:40.745 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:48:40.887 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:48:40.887 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 05:48:40.887 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:48:40.887 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:48:40.887 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:48:40.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:48:40.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:48:40.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:48:40.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:48:40.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:48:40.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:48:50.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:48:51.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 05:48:51.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423461,ok=423461,error=0, records=41
[WARN ] 2026-06-01 05:48:52.878 [20905] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:49:05.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:49:06.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 05:49:06.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423462,ok=423462,error=0, records=41
[WARN ] 2026-06-01 05:49:07.884 [21058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:49:20.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:49:21.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 05:49:21.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423463,ok=423463,error=0, records=41
[WARN ] 2026-06-01 05:49:22.889 [21043] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:49:35.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:49:36.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 05:49:36.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423464,ok=423464,error=0, records=41
[WARN ] 2026-06-01 05:49:37.893 [20905] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:49:50.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:49:51.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 05:49:51.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423465,ok=423465,error=0, records=41
[WARN ] 2026-06-01 05:49:52.897 [21124] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:49:54.898 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21179/300s
[INFO ] 2026-06-01 05:50:00.767 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21188/300s
[INFO ] 2026-06-01 05:50:05.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:50:06.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 05:50:06.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423466,ok=423466,error=0, records=41
[INFO ] 2026-06-01 05:50:06.823 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21175/300s
[WARN ] 2026-06-01 05:50:07.903 [21145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:50:20.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:50:21.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 05:50:21.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423467,ok=423467,error=0, records=41
[WARN ] 2026-06-01 05:50:22.908 [21162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:50:35.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:50:36.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 05:50:36.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423468,ok=423468,error=0, records=41
[WARN ] 2026-06-01 05:50:37.913 [21185] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:50:40.943 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21188/300s
[INFO ] 2026-06-01 05:50:50.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:50:51.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 05:50:51.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423469,ok=423469,error=0, records=41
[WARN ] 2026-06-01 05:50:52.918 [21185] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:51:00.643 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21175/300s
[INFO ] 2026-06-01 05:51:05.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:51:06.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-06-01 05:51:06.848 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423470,ok=423470,error=0, records=41
[WARN ] 2026-06-01 05:51:07.924 [21219] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:51:20.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:51:21.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-01 05:51:21.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423471,ok=423471,error=0, records=41
[WARN ] 2026-06-01 05:51:22.929 [21236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:51:32.140 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21184/300s
[INFO ] 2026-06-01 05:51:35.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:51:36.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 05:51:36.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423472,ok=423472,error=0, records=41
[WARN ] 2026-06-01 05:51:37.935 [21241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:51:40.887 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17640/300s
[INFO ] 2026-06-01 05:51:40.889 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:51:41.072 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:51:41.072 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 05:51:41.072 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:51:41.072 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:51:41.072 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:51:41.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:51:41.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:51:41.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:51:41.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:51:41.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:51:41.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:51:50.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:51:51.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10408, records=41
[INFO ] 2026-06-01 05:51:51.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423473,ok=423473,error=0, records=41
[WARN ] 2026-06-01 05:51:52.941 [21264] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:52:05.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:52:05.147 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21187/300s
[INFO ] 2026-06-01 05:52:06.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 05:52:06.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423474,ok=423474,error=0, records=41
[WARN ] 2026-06-01 05:52:07.947 [21258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:52:20.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:52:21.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 05:52:21.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423475,ok=423475,error=0, records=41
[WARN ] 2026-06-01 05:52:22.952 [21247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:52:35.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:52:36.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 05:52:36.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423476,ok=423476,error=0, records=41
[WARN ] 2026-06-01 05:52:37.956 [21275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:52:39.134 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21185/300s
[INFO ] 2026-06-01 05:52:41.036 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21185/300s
[INFO ] 2026-06-01 05:52:48.643 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21185/300s
[INFO ] 2026-06-01 05:52:50.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:52:51.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 05:52:51.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423477,ok=423477,error=0, records=41
[WARN ] 2026-06-01 05:52:52.961 [21323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:53:05.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:53:06.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10389, records=41
[INFO ] 2026-06-01 05:53:06.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423478,ok=423478,error=0, records=41
[WARN ] 2026-06-01 05:53:07.967 [21275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:53:20.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:53:21.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 05:53:21.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423479,ok=423479,error=0, records=41
[WARN ] 2026-06-01 05:53:22.973 [21241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:53:35.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 05:53:35.150 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 05:53:36.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-01 05:53:36.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423480,ok=423480,error=0, records=41
[WARN ] 2026-06-01 05:53:37.978 [21241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:53:50.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:53:50.151 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 05:53:51.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 05:53:51.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423481,ok=423481,error=0, records=41
[WARN ] 2026-06-01 05:53:52.983 [21247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:54:05.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:54:06.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 05:54:06.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423482,ok=423482,error=0, records=41
[WARN ] 2026-06-01 05:54:07.989 [21393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:54:20.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:54:21.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 05:54:21.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423483,ok=423483,error=0, records=41
[WARN ] 2026-06-01 05:54:22.993 [21393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:54:35.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:54:36.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 05:54:36.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423484,ok=423484,error=0, records=41
[WARN ] 2026-06-01 05:54:37.998 [21247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:54:41.074 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:54:41.238 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:54:41.238 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 05:54:41.239 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:54:41.239 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:54:41.239 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:54:41.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:54:41.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:54:41.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:54:41.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:54:41.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:54:41.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:54:50.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:54:51.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 05:54:51.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423485,ok=423485,error=0, records=41
[WARN ] 2026-06-01 05:54:53.003 [21241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:54:55.003 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21180/300s
[INFO ] 2026-06-01 05:55:00.770 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21189/300s
[INFO ] 2026-06-01 05:55:05.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:55:06.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 05:55:06.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423486,ok=423486,error=0, records=41
[INFO ] 2026-06-01 05:55:06.947 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21176/300s
[WARN ] 2026-06-01 05:55:08.008 [21393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:55:20.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:55:21.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 05:55:21.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423487,ok=423487,error=0, records=41
[WARN ] 2026-06-01 05:55:23.012 [21449] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:55:35.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:55:36.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 05:55:36.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423488,ok=423488,error=0, records=41
[WARN ] 2026-06-01 05:55:38.016 [21351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:55:40.950 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21189/300s
[INFO ] 2026-06-01 05:55:50.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:55:51.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 05:55:51.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423489,ok=423489,error=0, records=41
[WARN ] 2026-06-01 05:55:53.021 [21351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:56:00.823 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21176/300s
[INFO ] 2026-06-01 05:56:05.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:56:06.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 05:56:06.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423490,ok=423490,error=0, records=41
[WARN ] 2026-06-01 05:56:08.026 [21247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:56:20.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:56:21.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 05:56:21.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423491,ok=423491,error=0, records=41
[WARN ] 2026-06-01 05:56:23.031 [21492] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:56:32.190 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21185/300s
[INFO ] 2026-06-01 05:56:35.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:56:36.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 05:56:36.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423492,ok=423492,error=0, records=41
[WARN ] 2026-06-01 05:56:38.035 [21507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:56:50.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:56:51.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 05:56:51.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423493,ok=423493,error=0, records=41
[WARN ] 2026-06-01 05:56:53.039 [21559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:57:05.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:57:05.159 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21188/300s
[INFO ] 2026-06-01 05:57:06.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 05:57:06.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423494,ok=423494,error=0, records=41
[WARN ] 2026-06-01 05:57:08.044 [21542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:57:20.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:57:22.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 05:57:22.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423495,ok=423495,error=0, records=41
[WARN ] 2026-06-01 05:57:23.049 [21592] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:57:35.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:57:37.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 05:57:37.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423496,ok=423496,error=0, records=41
[WARN ] 2026-06-01 05:57:37.556 [21592] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:57:39.180 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21186/300s
[INFO ] 2026-06-01 05:57:41.081 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21186/300s
[INFO ] 2026-06-01 05:57:41.239 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17641/300s
[INFO ] 2026-06-01 05:57:41.240 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 05:57:41.393 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 05:57:41.393 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 05:57:41.393 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 05:57:41.393 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 05:57:41.393 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 05:57:41.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:57:41.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 05:57:41.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:57:41.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 05:57:41.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:57:41.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 05:57:48.687 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21186/300s
[INFO ] 2026-06-01 05:57:50.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:57:52.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 05:57:52.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423497,ok=423497,error=0, records=41
[WARN ] 2026-06-01 05:57:52.561 [21619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:58:05.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:58:07.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 05:58:07.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423498,ok=423498,error=0, records=41
[WARN ] 2026-06-01 05:58:07.565 [21614] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:58:20.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:58:22.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 05:58:22.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423499,ok=423499,error=0, records=41
[WARN ] 2026-06-01 05:58:22.569 [21664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:58:35.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:58:37.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 05:58:37.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423500,ok=423500,error=0, records=41
[WARN ] 2026-06-01 05:58:37.574 [21664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:58:50.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:58:52.106 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 05:58:52.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423501,ok=423501,error=0, records=41
[WARN ] 2026-06-01 05:58:52.580 [21698] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:59:05.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:59:07.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-01 05:59:07.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423502,ok=423502,error=0, records=41
[WARN ] 2026-06-01 05:59:07.585 [21664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:59:20.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:59:22.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 05:59:22.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423503,ok=423503,error=0, records=41
[WARN ] 2026-06-01 05:59:22.590 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:59:35.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:59:37.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 05:59:37.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423504,ok=423504,error=0, records=41
[WARN ] 2026-06-01 05:59:37.595 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:59:50.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 05:59:52.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 05:59:52.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423505,ok=423505,error=0, records=41
[WARN ] 2026-06-01 05:59:52.600 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 05:59:55.100 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21181/300s
[INFO ] 2026-06-01 06:00:00.773 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21190/300s
[INFO ] 2026-06-01 06:00:05.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:00:07.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 06:00:07.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423506,ok=423506,error=0, records=41
[INFO ] 2026-06-01 06:00:07.140 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21177/300s
[WARN ] 2026-06-01 06:00:07.606 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:00:20.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:00:22.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 06:00:22.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423507,ok=423507,error=0, records=41
[WARN ] 2026-06-01 06:00:22.611 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:00:35.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:00:37.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 06:00:37.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423508,ok=423508,error=0, records=41
[WARN ] 2026-06-01 06:00:37.617 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:00:40.956 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21190/300s
[INFO ] 2026-06-01 06:00:41.394 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871616},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:00:41.567 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:00:41.567 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 06:00:41.568 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:00:41.568 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:00:41.568 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:00:41.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:00:41.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:00:41.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:00:41.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:00:41.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:00:41.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:00:50.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:00:52.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 06:00:52.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423509,ok=423509,error=0, records=41
[WARN ] 2026-06-01 06:00:52.622 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:01:00.996 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21177/300s
[INFO ] 2026-06-01 06:01:05.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:01:07.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10403, records=41
[INFO ] 2026-06-01 06:01:07.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423510,ok=423510,error=0, records=41
[WARN ] 2026-06-01 06:01:07.628 [21741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:01:20.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:01:22.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 06:01:22.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423511,ok=423511,error=0, records=41
[WARN ] 2026-06-01 06:01:22.633 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:01:32.243 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21186/300s
[INFO ] 2026-06-01 06:01:35.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:01:37.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 06:01:37.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423512,ok=423512,error=0, records=41
[WARN ] 2026-06-01 06:01:37.638 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:01:50.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:01:52.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-06-01 06:01:52.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423513,ok=423513,error=0, records=41
[WARN ] 2026-06-01 06:01:52.644 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:02:05.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:02:05.170 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21189/300s
[INFO ] 2026-06-01 06:02:07.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 06:02:07.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423514,ok=423514,error=0, records=41
[WARN ] 2026-06-01 06:02:07.650 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:02:20.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:02:22.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 06:02:22.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423515,ok=423515,error=0, records=41
[WARN ] 2026-06-01 06:02:22.655 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:02:35.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:02:37.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 06:02:37.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423516,ok=423516,error=0, records=41
[WARN ] 2026-06-01 06:02:37.660 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:02:39.211 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21187/300s
[INFO ] 2026-06-01 06:02:41.112 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21187/300s
[INFO ] 2026-06-01 06:02:48.720 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21187/300s
[INFO ] 2026-06-01 06:02:50.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:02:52.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 06:02:52.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423517,ok=423517,error=0, records=41
[WARN ] 2026-06-01 06:02:52.666 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:03:05.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:03:07.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 06:03:07.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423518,ok=423518,error=0, records=41
[WARN ] 2026-06-01 06:03:07.671 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:03:20.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:03:22.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 06:03:22.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423519,ok=423519,error=0, records=41
[WARN ] 2026-06-01 06:03:22.676 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:03:35.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 06:03:35.174 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 06:03:37.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 06:03:37.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423520,ok=423520,error=0, records=41
[WARN ] 2026-06-01 06:03:37.680 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:03:41.568 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17642/300s
[INFO ] 2026-06-01 06:03:41.569 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871536},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:03:41.728 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:03:41.728 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 06:03:41.728 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:03:41.728 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:03:41.728 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:03:41.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:03:41.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:03:41.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:03:41.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:03:41.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:03:41.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:03:50.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:03:52.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 06:03:52.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423521,ok=423521,error=0, records=41
[WARN ] 2026-06-01 06:03:52.686 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:04:05.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:04:07.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 06:04:07.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423522,ok=423522,error=0, records=41
[WARN ] 2026-06-01 06:04:07.691 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:04:20.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:04:22.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 06:04:22.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423523,ok=423523,error=0, records=41
[WARN ] 2026-06-01 06:04:22.698 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:04:35.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:04:37.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 06:04:37.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423524,ok=423524,error=0, records=41
[WARN ] 2026-06-01 06:04:37.703 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:04:50.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:04:52.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 06:04:52.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423525,ok=423525,error=0, records=41
[WARN ] 2026-06-01 06:04:52.707 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:04:55.208 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21182/300s
[INFO ] 2026-06-01 06:05:00.776 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21191/300s
[INFO ] 2026-06-01 06:05:05.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:05:07.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 06:05:07.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423526,ok=423526,error=0, records=41
[INFO ] 2026-06-01 06:05:07.335 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21178/300s
[WARN ] 2026-06-01 06:05:07.713 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:05:20.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:05:22.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-01 06:05:22.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423527,ok=423527,error=0, records=41
[WARN ] 2026-06-01 06:05:22.718 [21741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:05:35.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:05:37.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 06:05:37.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423528,ok=423528,error=0, records=41
[WARN ] 2026-06-01 06:05:37.724 [21741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:05:40.962 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21191/300s
[INFO ] 2026-06-01 06:05:50.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:05:52.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 06:05:52.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423529,ok=423529,error=0, records=41
[WARN ] 2026-06-01 06:05:52.730 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:06:01.175 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21178/300s
[INFO ] 2026-06-01 06:06:05.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:06:07.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 06:06:07.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423530,ok=423530,error=0, records=41
[WARN ] 2026-06-01 06:06:07.736 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:06:20.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:06:22.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 06:06:22.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423531,ok=423531,error=0, records=41
[WARN ] 2026-06-01 06:06:22.741 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:06:32.292 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21187/300s
[INFO ] 2026-06-01 06:06:35.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:06:37.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 06:06:37.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423532,ok=423532,error=0, records=41
[WARN ] 2026-06-01 06:06:37.746 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:06:41.729 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871456},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:06:41.891 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:06:41.891 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 06:06:41.892 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:06:41.892 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:06:41.892 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:06:41.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:06:41.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:06:41.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:06:41.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:06:41.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:06:41.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:06:50.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:06:52.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 06:06:52.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423533,ok=423533,error=0, records=41
[WARN ] 2026-06-01 06:06:52.751 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:07:05.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:07:05.182 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21190/300s
[INFO ] 2026-06-01 06:07:07.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 06:07:07.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423534,ok=423534,error=0, records=41
[WARN ] 2026-06-01 06:07:07.757 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:07:20.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:07:22.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 06:07:22.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423535,ok=423535,error=0, records=41
[WARN ] 2026-06-01 06:07:22.762 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:07:35.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:07:37.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 06:07:37.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423536,ok=423536,error=0, records=41
[WARN ] 2026-06-01 06:07:37.767 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:07:39.235 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21188/300s
[INFO ] 2026-06-01 06:07:41.136 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21188/300s
[INFO ] 2026-06-01 06:07:48.740 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21188/300s
[INFO ] 2026-06-01 06:07:50.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:07:52.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 06:07:52.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423537,ok=423537,error=0, records=41
[WARN ] 2026-06-01 06:07:52.773 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:08:05.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:08:07.416 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 06:08:07.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423538,ok=423538,error=0, records=41
[WARN ] 2026-06-01 06:08:07.778 [21731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:08:20.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:08:22.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 06:08:22.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423539,ok=423539,error=0, records=41
[WARN ] 2026-06-01 06:08:22.784 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:08:35.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:08:37.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 06:08:37.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423540,ok=423540,error=0, records=41
[WARN ] 2026-06-01 06:08:37.789 [21755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:08:50.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:08:50.185 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 06:08:52.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 06:08:52.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423541,ok=423541,error=0, records=41
[WARN ] 2026-06-01 06:08:52.794 [21732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:09:05.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:09:07.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-06-01 06:09:07.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423542,ok=423542,error=0, records=41
[WARN ] 2026-06-01 06:09:07.799 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:09:20.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:09:22.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 06:09:22.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423543,ok=423543,error=0, records=41
[WARN ] 2026-06-01 06:09:22.805 [22295] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:09:35.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:09:37.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 06:09:37.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423544,ok=423544,error=0, records=41
[WARN ] 2026-06-01 06:09:37.810 [22301] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:09:41.892 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17643/300s
[INFO ] 2026-06-01 06:09:41.893 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871380},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:09:42.074 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:09:42.075 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 06:09:42.075 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:09:42.075 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:09:42.075 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:09:42.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:09:42.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:09:42.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:09:42.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:09:42.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:09:42.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:09:50.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:09:52.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 06:09:52.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423545,ok=423545,error=0, records=41
[WARN ] 2026-06-01 06:09:52.816 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:09:55.316 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21183/300s
[INFO ] 2026-06-01 06:10:00.778 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21192/300s
[INFO ] 2026-06-01 06:10:05.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:10:07.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 06:10:07.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423546,ok=423546,error=0, records=41
[INFO ] 2026-06-01 06:10:07.460 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21179/300s
[WARN ] 2026-06-01 06:10:07.822 [22316] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:10:20.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:10:22.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 06:10:22.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423547,ok=423547,error=0, records=41
[WARN ] 2026-06-01 06:10:22.827 [22332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:10:35.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:10:37.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 06:10:37.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423548,ok=423548,error=0, records=41
[WARN ] 2026-06-01 06:10:37.832 [22381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:10:40.967 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21192/300s
[INFO ] 2026-06-01 06:10:50.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:10:52.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 06:10:52.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423549,ok=423549,error=0, records=41
[WARN ] 2026-06-01 06:10:52.837 [21725] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:11:01.345 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21179/300s
[INFO ] 2026-06-01 06:11:05.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:11:07.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 06:11:07.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423550,ok=423550,error=0, records=41
[WARN ] 2026-06-01 06:11:07.842 [22395] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:11:20.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:11:22.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 06:11:22.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423551,ok=423551,error=0, records=41
[WARN ] 2026-06-01 06:11:22.846 [22395] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:11:32.343 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21188/300s
[INFO ] 2026-06-01 06:11:35.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:11:37.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 06:11:37.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423552,ok=423552,error=0, records=41
[WARN ] 2026-06-01 06:11:37.851 [22418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:11:50.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:11:52.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 06:11:52.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423553,ok=423553,error=0, records=41
[WARN ] 2026-06-01 06:11:52.856 [22446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:12:05.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:12:05.193 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21191/300s
[INFO ] 2026-06-01 06:12:07.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 06:12:07.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423554,ok=423554,error=0, records=41
[WARN ] 2026-06-01 06:12:07.861 [22461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:12:20.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:12:22.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 06:12:22.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423555,ok=423555,error=0, records=41
[WARN ] 2026-06-01 06:12:22.866 [22461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:12:35.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:12:37.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 06:12:37.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423556,ok=423556,error=0, records=41
[WARN ] 2026-06-01 06:12:37.871 [22367] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:12:39.252 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21189/300s
[INFO ] 2026-06-01 06:12:41.154 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21189/300s
[INFO ] 2026-06-01 06:12:42.076 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871292},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:12:42.243 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:12:42.243 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 06:12:42.243 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:12:42.243 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:12:42.243 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:12:42.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:12:42.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:12:42.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:12:42.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:12:42.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:12:42.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:12:48.761 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21189/300s
[INFO ] 2026-06-01 06:12:50.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:12:52.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 06:12:52.659 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423557,ok=423557,error=0, records=41
[WARN ] 2026-06-01 06:12:52.876 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:13:05.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:13:07.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 06:13:07.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423558,ok=423558,error=0, records=41
[WARN ] 2026-06-01 06:13:07.881 [22519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:13:20.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:13:22.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 06:13:22.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423559,ok=423559,error=0, records=41
[WARN ] 2026-06-01 06:13:22.885 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:13:35.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 06:13:35.197 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 06:13:37.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 06:13:37.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423560,ok=423560,error=0, records=41
[WARN ] 2026-06-01 06:13:37.892 [22553] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:13:50.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:13:52.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 06:13:52.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423561,ok=423561,error=0, records=41
[WARN ] 2026-06-01 06:13:52.897 [22553] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:14:05.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:14:07.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 06:14:07.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423562,ok=423562,error=0, records=41
[WARN ] 2026-06-01 06:14:07.902 [22580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:14:20.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:14:22.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 06:14:22.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423563,ok=423563,error=0, records=41
[WARN ] 2026-06-01 06:14:22.907 [22574] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:14:35.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:14:37.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 06:14:37.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423564,ok=423564,error=0, records=41
[WARN ] 2026-06-01 06:14:37.913 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:14:50.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:14:52.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 06:14:52.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423565,ok=423565,error=0, records=41
[WARN ] 2026-06-01 06:14:52.919 [22614] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:14:55.419 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21184/300s
[INFO ] 2026-06-01 06:15:00.781 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21193/300s
[INFO ] 2026-06-01 06:15:05.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:15:07.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 06:15:07.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423566,ok=423566,error=0, records=41
[INFO ] 2026-06-01 06:15:07.712 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21180/300s
[WARN ] 2026-06-01 06:15:07.926 [22657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:15:20.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:15:22.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 06:15:22.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423567,ok=423567,error=0, records=41
[WARN ] 2026-06-01 06:15:22.931 [22651] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:15:35.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:15:37.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 06:15:37.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423568,ok=423568,error=0, records=41
[WARN ] 2026-06-01 06:15:37.936 [22580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:15:40.973 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21193/300s
[INFO ] 2026-06-01 06:15:42.244 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17644/300s
[INFO ] 2026-06-01 06:15:42.245 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871220},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:15:42.433 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:15:42.433 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 06:15:42.434 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:15:42.434 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:15:42.434 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:15:42.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:15:42.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:15:42.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:15:42.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:15:42.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:15:42.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:15:50.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:15:52.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 06:15:52.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423569,ok=423569,error=0, records=41
[WARN ] 2026-06-01 06:15:52.942 [22706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:16:01.517 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21180/300s
[INFO ] 2026-06-01 06:16:05.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:16:07.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 06:16:07.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423570,ok=423570,error=0, records=41
[WARN ] 2026-06-01 06:16:07.948 [22706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:16:20.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:16:22.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 06:16:22.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423571,ok=423571,error=0, records=41
[WARN ] 2026-06-01 06:16:22.953 [22733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:16:32.392 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21189/300s
[INFO ] 2026-06-01 06:16:35.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:16:37.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 06:16:37.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423572,ok=423572,error=0, records=41
[WARN ] 2026-06-01 06:16:37.958 [22684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:16:50.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:16:52.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 06:16:52.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423573,ok=423573,error=0, records=41
[WARN ] 2026-06-01 06:16:52.962 [22712] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:17:05.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:17:05.205 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21192/300s
[INFO ] 2026-06-01 06:17:07.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 06:17:07.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423574,ok=423574,error=0, records=41
[WARN ] 2026-06-01 06:17:07.967 [22733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:17:20.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:17:22.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 06:17:22.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423575,ok=423575,error=0, records=41
[WARN ] 2026-06-01 06:17:22.972 [22774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:17:35.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:17:37.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 06:17:37.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423576,ok=423576,error=0, records=41
[WARN ] 2026-06-01 06:17:37.977 [22774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:17:39.274 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21190/300s
[INFO ] 2026-06-01 06:17:41.176 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21190/300s
[INFO ] 2026-06-01 06:17:48.782 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21190/300s
[INFO ] 2026-06-01 06:17:50.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:17:52.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 06:17:52.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423577,ok=423577,error=0, records=41
[WARN ] 2026-06-01 06:17:52.982 [22747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:18:05.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:18:07.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 06:18:07.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423578,ok=423578,error=0, records=41
[WARN ] 2026-06-01 06:18:07.987 [22815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:18:20.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:18:22.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 06:18:22.857 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423579,ok=423579,error=0, records=41
[WARN ] 2026-06-01 06:18:22.992 [22747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:18:35.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:18:37.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 06:18:37.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423580,ok=423580,error=0, records=41
[WARN ] 2026-06-01 06:18:37.997 [22774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:18:42.435 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871144},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:18:42.591 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:18:42.591 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 06:18:42.591 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:18:42.591 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:18:42.591 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:18:42.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:18:42.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:18:42.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:18:42.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:18:42.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:18:42.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:18:50.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:18:52.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 06:18:52.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423581,ok=423581,error=0, records=41
[WARN ] 2026-06-01 06:18:53.002 [22774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:19:05.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:19:07.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 06:19:07.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423582,ok=423582,error=0, records=41
[WARN ] 2026-06-01 06:19:08.007 [22747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:19:20.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:19:22.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 06:19:22.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423583,ok=423583,error=0, records=41
[WARN ] 2026-06-01 06:19:23.012 [22684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:19:35.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:19:37.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 06:19:37.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423584,ok=423584,error=0, records=41
[WARN ] 2026-06-01 06:19:38.017 [22857] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:19:50.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:19:52.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 06:19:52.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423585,ok=423585,error=0, records=41
[WARN ] 2026-06-01 06:19:53.022 [22815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:19:55.523 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21185/300s
[INFO ] 2026-06-01 06:20:00.784 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21194/300s
[INFO ] 2026-06-01 06:20:05.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:20:07.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 06:20:07.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423586,ok=423586,error=0, records=41
[INFO ] 2026-06-01 06:20:07.897 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21181/300s
[WARN ] 2026-06-01 06:20:08.027 [22912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:20:20.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:20:22.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 06:20:22.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423587,ok=423587,error=0, records=41
[WARN ] 2026-06-01 06:20:23.032 [22815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:20:35.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:20:37.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 06:20:37.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423588,ok=423588,error=0, records=41
[WARN ] 2026-06-01 06:20:38.037 [22815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:20:40.978 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21194/300s
[INFO ] 2026-06-01 06:20:50.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:20:52.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 06:20:52.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423589,ok=423589,error=0, records=41
[WARN ] 2026-06-01 06:20:53.041 [22815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:21:01.690 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21181/300s
[INFO ] 2026-06-01 06:21:05.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:21:07.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 06:21:07.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423590,ok=423590,error=0, records=41
[WARN ] 2026-06-01 06:21:08.045 [23002] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:21:20.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:21:22.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 06:21:22.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423591,ok=423591,error=0, records=41
[WARN ] 2026-06-01 06:21:23.049 [23030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:21:32.435 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21190/300s
[INFO ] 2026-06-01 06:21:35.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:21:37.554 [23036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:21:37.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 06:21:37.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423592,ok=423592,error=0, records=41
[INFO ] 2026-06-01 06:21:42.591 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17645/300s
[INFO ] 2026-06-01 06:21:42.593 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20871068},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:21:42.770 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:21:42.770 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 06:21:42.771 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:21:42.771 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:21:42.771 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:21:42.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:21:42.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:21:42.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:21:42.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:21:42.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:21:42.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:21:50.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:21:52.558 [23060] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:21:52.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 06:21:52.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423593,ok=423593,error=0, records=41
[INFO ] 2026-06-01 06:22:05.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:22:05.216 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21193/300s
[WARN ] 2026-06-01 06:22:07.564 [23078] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:22:07.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 06:22:07.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423594,ok=423594,error=0, records=41
[INFO ] 2026-06-01 06:22:20.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:22:22.569 [23078] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:22:22.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 06:22:22.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423595,ok=423595,error=0, records=41
[INFO ] 2026-06-01 06:22:35.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:22:37.573 [23102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:22:37.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 06:22:37.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423596,ok=423596,error=0, records=41
[INFO ] 2026-06-01 06:22:39.372 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21191/300s
[INFO ] 2026-06-01 06:22:41.274 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21191/300s
[INFO ] 2026-06-01 06:22:48.877 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21191/300s
[INFO ] 2026-06-01 06:22:50.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:22:52.578 [23113] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:22:52.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 06:22:52.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423597,ok=423597,error=0, records=41
[INFO ] 2026-06-01 06:23:05.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:23:07.583 [23149] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:23:07.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 06:23:07.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423598,ok=423598,error=0, records=41
[INFO ] 2026-06-01 06:23:20.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:23:22.587 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:23:22.994 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 06:23:22.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423599,ok=423599,error=0, records=41
[INFO ] 2026-06-01 06:23:35.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 06:23:35.220 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 06:23:37.592 [23179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:23:38.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 06:23:38.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423600,ok=423600,error=0, records=41
[INFO ] 2026-06-01 06:23:50.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:23:50.220 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 06:23:52.597 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:23:53.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 06:23:53.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423601,ok=423601,error=0, records=41
[INFO ] 2026-06-01 06:24:05.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:24:07.602 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:24:08.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 06:24:08.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423602,ok=423602,error=0, records=41
[INFO ] 2026-06-01 06:24:20.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:24:22.608 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:24:23.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 06:24:23.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423603,ok=423603,error=0, records=41
[INFO ] 2026-06-01 06:24:35.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:24:37.614 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:24:38.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 06:24:38.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423604,ok=423604,error=0, records=41
[INFO ] 2026-06-01 06:24:42.772 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870984},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:24:42.931 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:24:42.931 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 06:24:42.931 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:24:42.931 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:24:42.931 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:24:42.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:24:42.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:24:42.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:24:42.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:24:42.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:24:42.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:24:50.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:24:52.619 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:24:53.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 06:24:53.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423605,ok=423605,error=0, records=41
[INFO ] 2026-06-01 06:24:55.620 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21186/300s
[INFO ] 2026-06-01 06:25:00.787 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21195/300s
[INFO ] 2026-06-01 06:25:05.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:25:07.624 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:25:08.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 06:25:08.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423606,ok=423606,error=0, records=41
[INFO ] 2026-06-01 06:25:08.094 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21182/300s
[INFO ] 2026-06-01 06:25:20.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:25:22.629 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:25:23.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 06:25:23.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423607,ok=423607,error=0, records=41
[INFO ] 2026-06-01 06:25:35.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:25:37.634 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:25:38.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 06:25:38.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423608,ok=423608,error=0, records=41
[INFO ] 2026-06-01 06:25:40.984 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21195/300s
[INFO ] 2026-06-01 06:25:50.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:25:52.639 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:25:53.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 06:25:53.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423609,ok=423609,error=0, records=41
[INFO ] 2026-06-01 06:26:01.870 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21182/300s
[INFO ] 2026-06-01 06:26:05.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:26:07.644 [23199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:26:08.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12148, records=51
[INFO ] 2026-06-01 06:26:08.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423610,ok=423610,error=0, records=51
[INFO ] 2026-06-01 06:26:20.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:26:22.649 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:26:23.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 06:26:23.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423611,ok=423611,error=0, records=41
[INFO ] 2026-06-01 06:26:32.487 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21191/300s
[INFO ] 2026-06-01 06:26:35.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:26:37.655 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:26:38.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-01 06:26:38.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423612,ok=423612,error=0, records=41
[INFO ] 2026-06-01 06:26:50.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:26:52.661 [23199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:26:53.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 06:26:53.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423613,ok=423613,error=0, records=41
[INFO ] 2026-06-01 06:27:05.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:27:05.228 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21194/300s
[WARN ] 2026-06-01 06:27:07.666 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:27:08.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 06:27:08.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423614,ok=423614,error=0, records=41
[INFO ] 2026-06-01 06:27:20.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:27:22.672 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:27:23.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 06:27:23.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423615,ok=423615,error=0, records=41
[INFO ] 2026-06-01 06:27:35.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:27:37.678 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:27:38.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 06:27:38.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423616,ok=423616,error=0, records=41
[INFO ] 2026-06-01 06:27:39.401 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21192/300s
[INFO ] 2026-06-01 06:27:41.302 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21192/300s
[INFO ] 2026-06-01 06:27:42.931 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17646/300s
[INFO ] 2026-06-01 06:27:42.933 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870912},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:27:43.111 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:27:43.111 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 06:27:43.111 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:27:43.111 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:27:43.111 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:27:43.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:27:43.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:27:43.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:27:43.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:27:43.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:27:43.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:27:48.909 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21192/300s
[INFO ] 2026-06-01 06:27:50.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:27:52.682 [23199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:27:53.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 06:27:53.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423617,ok=423617,error=0, records=41
[INFO ] 2026-06-01 06:28:05.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:28:07.688 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:28:08.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 06:28:08.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423618,ok=423618,error=0, records=41
[INFO ] 2026-06-01 06:28:20.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:28:22.693 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:28:23.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 06:28:23.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423619,ok=423619,error=0, records=41
[INFO ] 2026-06-01 06:28:35.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:28:37.698 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:28:38.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 06:28:38.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423620,ok=423620,error=0, records=41
[INFO ] 2026-06-01 06:28:50.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:28:52.703 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:28:53.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 06:28:53.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423621,ok=423621,error=0, records=41
[INFO ] 2026-06-01 06:29:05.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:29:07.709 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:29:08.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 06:29:08.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423622,ok=423622,error=0, records=41
[INFO ] 2026-06-01 06:29:20.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:29:22.714 [23199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:29:23.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 06:29:23.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423623,ok=423623,error=0, records=41
[INFO ] 2026-06-01 06:29:35.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:29:37.720 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:29:38.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 06:29:38.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423624,ok=423624,error=0, records=41
[INFO ] 2026-06-01 06:29:50.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:29:52.726 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:29:53.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 06:29:53.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423625,ok=423625,error=0, records=41
[INFO ] 2026-06-01 06:29:55.727 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21187/300s
[INFO ] 2026-06-01 06:30:00.790 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21196/300s
[INFO ] 2026-06-01 06:30:05.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:30:07.731 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:30:08.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10400, records=41
[INFO ] 2026-06-01 06:30:08.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423626,ok=423626,error=0, records=41
[INFO ] 2026-06-01 06:30:08.346 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21183/300s
[INFO ] 2026-06-01 06:30:20.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:30:22.736 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:30:23.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 06:30:23.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423627,ok=423627,error=0, records=41
[INFO ] 2026-06-01 06:30:35.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:30:37.743 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:30:38.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 06:30:38.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423628,ok=423628,error=0, records=41
[INFO ] 2026-06-01 06:30:40.990 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21196/300s
[INFO ] 2026-06-01 06:30:43.112 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870828},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:30:43.274 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:30:43.274 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 06:30:43.274 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:30:43.275 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:30:43.275 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:30:43.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:30:43.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:30:43.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:30:43.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:30:43.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:30:43.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:30:50.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:30:52.748 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:30:53.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 06:30:53.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423629,ok=423629,error=0, records=41
[INFO ] 2026-06-01 06:31:02.048 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21183/300s
[INFO ] 2026-06-01 06:31:05.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:31:07.753 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:31:08.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 06:31:08.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423630,ok=423630,error=0, records=41
[INFO ] 2026-06-01 06:31:20.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:31:22.759 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:31:23.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 06:31:23.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423631,ok=423631,error=0, records=41
[INFO ] 2026-06-01 06:31:32.536 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21192/300s
[INFO ] 2026-06-01 06:31:35.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:31:37.764 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:31:38.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 06:31:38.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423632,ok=423632,error=0, records=41
[INFO ] 2026-06-01 06:31:50.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:31:52.767 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:31:53.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 06:31:53.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423633,ok=423633,error=0, records=41
[INFO ] 2026-06-01 06:32:05.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:32:05.241 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21195/300s
[WARN ] 2026-06-01 06:32:07.771 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:32:08.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 06:32:08.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423634,ok=423634,error=0, records=41
[INFO ] 2026-06-01 06:32:20.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:32:22.777 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:32:23.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 06:32:23.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423635,ok=423635,error=0, records=41
[INFO ] 2026-06-01 06:32:35.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:32:37.783 [23119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:32:38.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 06:32:38.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423636,ok=423636,error=0, records=41
[INFO ] 2026-06-01 06:32:39.435 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21193/300s
[INFO ] 2026-06-01 06:32:41.337 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21193/300s
[INFO ] 2026-06-01 06:32:48.943 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21193/300s
[INFO ] 2026-06-01 06:32:50.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:32:52.788 [23199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:32:53.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 06:32:53.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423637,ok=423637,error=0, records=41
[INFO ] 2026-06-01 06:33:05.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:33:07.795 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:33:08.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 06:33:08.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423638,ok=423638,error=0, records=41
[INFO ] 2026-06-01 06:33:20.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:33:22.799 [23173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:33:23.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 06:33:23.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423639,ok=423639,error=0, records=41
[INFO ] 2026-06-01 06:33:35.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 06:33:35.244 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 06:33:37.804 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:33:38.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10139, records=41
[INFO ] 2026-06-01 06:33:38.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423640,ok=423640,error=0, records=41
[INFO ] 2026-06-01 06:33:43.275 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17647/300s
[INFO ] 2026-06-01 06:33:43.276 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870756},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:33:43.430 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:33:43.431 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 06:33:43.431 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:33:43.431 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:33:43.431 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:33:43.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:33:43.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:33:43.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:33:43.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:33:43.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:33:43.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:33:50.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:33:52.809 [23738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:33:53.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 06:33:53.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423641,ok=423641,error=0, records=41
[INFO ] 2026-06-01 06:34:05.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:34:07.814 [23758] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:34:08.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 06:34:08.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423642,ok=423642,error=0, records=41
[INFO ] 2026-06-01 06:34:20.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:34:22.819 [23172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:34:23.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 06:34:23.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423643,ok=423643,error=0, records=41
[INFO ] 2026-06-01 06:34:35.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:34:37.825 [23753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:34:38.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 06:34:38.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423644,ok=423644,error=0, records=41
[INFO ] 2026-06-01 06:34:50.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:34:52.830 [23773] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:34:53.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 06:34:53.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423645,ok=423645,error=0, records=41
[INFO ] 2026-06-01 06:34:55.831 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21188/300s
[INFO ] 2026-06-01 06:35:00.793 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21197/300s
[INFO ] 2026-06-01 06:35:05.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:35:07.835 [23802] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:35:08.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 06:35:08.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423646,ok=423646,error=0, records=41
[INFO ] 2026-06-01 06:35:08.464 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21184/300s
[INFO ] 2026-06-01 06:35:20.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:35:22.841 [23773] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:35:23.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 06:35:23.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423647,ok=423647,error=0, records=41
[INFO ] 2026-06-01 06:35:35.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:35:37.847 [23753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:35:38.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 06:35:38.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423648,ok=423648,error=0, records=41
[INFO ] 2026-06-01 06:35:40.996 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21197/300s
[INFO ] 2026-06-01 06:35:50.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:35:52.853 [23787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:35:53.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 06:35:53.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423649,ok=423649,error=0, records=41
[INFO ] 2026-06-01 06:36:02.225 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21184/300s
[INFO ] 2026-06-01 06:36:05.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:36:07.858 [23787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:36:08.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 06:36:08.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423650,ok=423650,error=0, records=41
[INFO ] 2026-06-01 06:36:20.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:36:22.864 [23853] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:36:23.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10406, records=41
[INFO ] 2026-06-01 06:36:23.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423651,ok=423651,error=0, records=41
[INFO ] 2026-06-01 06:36:32.583 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21193/300s
[INFO ] 2026-06-01 06:36:35.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:36:37.870 [23853] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:36:38.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 06:36:38.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423652,ok=423652,error=0, records=41
[INFO ] 2026-06-01 06:36:43.432 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870676},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:36:43.606 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:36:43.606 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 06:36:43.606 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:36:43.606 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:36:43.606 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:36:43.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:36:43.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:36:43.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:36:43.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:36:43.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:36:43.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:36:50.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:36:52.877 [23787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:36:53.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 06:36:53.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423653,ok=423653,error=0, records=41
[INFO ] 2026-06-01 06:37:05.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:37:05.252 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21196/300s
[WARN ] 2026-06-01 06:37:07.882 [23921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:37:08.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 06:37:08.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423654,ok=423654,error=0, records=41
[INFO ] 2026-06-01 06:37:20.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:37:22.888 [23787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:37:23.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 06:37:23.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423655,ok=423655,error=0, records=41
[INFO ] 2026-06-01 06:37:35.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:37:37.893 [23960] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:37:38.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 06:37:38.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423656,ok=423656,error=0, records=41
[INFO ] 2026-06-01 06:37:39.461 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21194/300s
[INFO ] 2026-06-01 06:37:41.363 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21194/300s
[INFO ] 2026-06-01 06:37:48.969 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21194/300s
[INFO ] 2026-06-01 06:37:50.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:37:52.899 [23982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:37:53.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 06:37:53.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423657,ok=423657,error=0, records=41
[INFO ] 2026-06-01 06:38:05.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:38:07.903 [23988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:38:08.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 06:38:08.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423658,ok=423658,error=0, records=41
[INFO ] 2026-06-01 06:38:20.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:38:22.908 [24010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:38:23.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 06:38:23.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423659,ok=423659,error=0, records=41
[INFO ] 2026-06-01 06:38:35.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:38:37.914 [23944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:38:38.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 06:38:38.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423660,ok=423660,error=0, records=41
[INFO ] 2026-06-01 06:38:50.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:38:50.256 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 06:38:52.919 [24038] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:38:53.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 06:38:53.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423661,ok=423661,error=0, records=41
[INFO ] 2026-06-01 06:39:05.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:39:07.925 [24055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:39:08.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 06:39:08.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423662,ok=423662,error=0, records=41
[INFO ] 2026-06-01 06:39:20.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:39:22.930 [24049] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:39:23.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 06:39:23.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423663,ok=423663,error=0, records=41
[INFO ] 2026-06-01 06:39:35.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:39:37.937 [24093] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:39:38.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 06:39:38.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423664,ok=423664,error=0, records=41
[INFO ] 2026-06-01 06:39:43.607 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17648/300s
[INFO ] 2026-06-01 06:39:43.608 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:39:43.788 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:39:43.789 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 06:39:43.789 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:39:43.789 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:39:43.789 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:39:43.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:39:43.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:39:43.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:39:43.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:39:43.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:39:43.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:39:50.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:39:52.943 [24110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:39:53.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 06:39:53.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423665,ok=423665,error=0, records=41
[INFO ] 2026-06-01 06:39:55.944 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21189/300s
[INFO ] 2026-06-01 06:40:00.796 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21198/300s
[INFO ] 2026-06-01 06:40:05.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:40:07.949 [24132] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:40:08.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 06:40:08.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423666,ok=423666,error=0, records=41
[INFO ] 2026-06-01 06:40:08.900 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21185/300s
[INFO ] 2026-06-01 06:40:20.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:40:22.954 [24147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:40:23.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 06:40:23.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423667,ok=423667,error=0, records=41
[INFO ] 2026-06-01 06:40:35.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:40:37.958 [24115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:40:38.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 06:40:38.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423668,ok=423668,error=0, records=41
[INFO ] 2026-06-01 06:40:41.002 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21198/300s
[INFO ] 2026-06-01 06:40:50.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:40:52.963 [24098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:40:53.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 06:40:53.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423669,ok=423669,error=0, records=41
[INFO ] 2026-06-01 06:41:02.400 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21185/300s
[INFO ] 2026-06-01 06:41:05.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:41:07.967 [24175] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:41:08.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 06:41:08.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423670,ok=423670,error=0, records=41
[INFO ] 2026-06-01 06:41:20.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:41:22.971 [24203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:41:23.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 06:41:23.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423671,ok=423671,error=0, records=41
[INFO ] 2026-06-01 06:41:32.633 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21194/300s
[INFO ] 2026-06-01 06:41:35.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:41:37.976 [24175] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:41:38.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 06:41:38.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423672,ok=423672,error=0, records=41
[INFO ] 2026-06-01 06:41:50.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:41:52.982 [24115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:41:53.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 06:41:53.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423673,ok=423673,error=0, records=41
[INFO ] 2026-06-01 06:42:05.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:42:05.265 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21197/300s
[WARN ] 2026-06-01 06:42:07.987 [24243] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:42:08.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 06:42:08.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423674,ok=423674,error=0, records=41
[INFO ] 2026-06-01 06:42:20.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:42:22.992 [24115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:42:23.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 06:42:23.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423675,ok=423675,error=0, records=41
[INFO ] 2026-06-01 06:42:35.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:42:37.996 [24243] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:42:38.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 06:42:38.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423676,ok=423676,error=0, records=41
[INFO ] 2026-06-01 06:42:39.483 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21195/300s
[INFO ] 2026-06-01 06:42:41.384 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21195/300s
[INFO ] 2026-06-01 06:42:43.790 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870508},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:42:43.957 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:42:43.957 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 06:42:43.957 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:42:43.957 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:42:43.957 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:42:43.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:42:43.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:42:43.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:42:43.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:42:43.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:42:43.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:42:48.988 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21195/300s
[INFO ] 2026-06-01 06:42:50.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:42:53.001 [24115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:42:53.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 06:42:53.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423677,ok=423677,error=0, records=41
[INFO ] 2026-06-01 06:43:05.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:43:08.006 [24304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:43:08.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 06:43:08.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423678,ok=423678,error=0, records=41
[INFO ] 2026-06-01 06:43:20.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:43:23.012 [24260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:43:23.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 06:43:23.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423679,ok=423679,error=0, records=41
[INFO ] 2026-06-01 06:43:35.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 06:43:35.268 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 06:43:38.017 [24115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:43:38.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 06:43:38.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423680,ok=423680,error=0, records=41
[INFO ] 2026-06-01 06:43:50.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:43:53.022 [24260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:43:53.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 06:43:53.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423681,ok=423681,error=0, records=41
[INFO ] 2026-06-01 06:44:05.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:44:08.027 [24289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:44:08.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 06:44:08.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423682,ok=423682,error=0, records=41
[INFO ] 2026-06-01 06:44:20.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:44:23.032 [24376] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:44:24.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 06:44:24.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423683,ok=423683,error=0, records=41
[INFO ] 2026-06-01 06:44:35.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:44:38.037 [24361] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:44:39.006 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 06:44:39.006 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423684,ok=423684,error=0, records=41
[INFO ] 2026-06-01 06:44:50.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:44:53.043 [24409] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:44:54.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 06:44:54.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423685,ok=423685,error=0, records=41
[INFO ] 2026-06-01 06:44:56.043 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21190/300s
[INFO ] 2026-06-01 06:45:00.799 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21199/300s
[INFO ] 2026-06-01 06:45:05.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:45:08.048 [24420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:45:09.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 06:45:09.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423686,ok=423686,error=0, records=41
[INFO ] 2026-06-01 06:45:09.019 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21186/300s
[INFO ] 2026-06-01 06:45:20.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:45:23.053 [24442] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:45:24.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 06:45:24.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423687,ok=423687,error=0, records=41
[INFO ] 2026-06-01 06:45:35.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:45:37.557 [24431] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:45:39.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 06:45:39.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423688,ok=423688,error=0, records=41
[INFO ] 2026-06-01 06:45:41.008 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21199/300s
[INFO ] 2026-06-01 06:45:43.957 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17649/300s
[INFO ] 2026-06-01 06:45:43.959 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870432},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:45:44.111 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:45:44.111 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 06:45:44.112 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:45:44.112 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:45:44.112 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:45:44.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:45:44.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:45:44.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:45:44.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:45:44.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:45:44.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:45:50.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:45:52.562 [24471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:45:54.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 06:45:54.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423689,ok=423689,error=0, records=41
[INFO ] 2026-06-01 06:46:02.578 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21186/300s
[INFO ] 2026-06-01 06:46:05.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:46:07.567 [24481] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:46:09.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 06:46:09.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423690,ok=423690,error=0, records=41
[INFO ] 2026-06-01 06:46:20.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:46:22.571 [24502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:46:24.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 06:46:24.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423691,ok=423691,error=0, records=41
[INFO ] 2026-06-01 06:46:32.683 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21195/300s
[INFO ] 2026-06-01 06:46:35.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:46:37.575 [24522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:46:39.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 06:46:39.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423692,ok=423692,error=0, records=41
[INFO ] 2026-06-01 06:46:50.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:46:52.579 [24540] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:46:54.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 06:46:54.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423693,ok=423693,error=0, records=41
[INFO ] 2026-06-01 06:47:05.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:47:05.278 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21198/300s
[WARN ] 2026-06-01 06:47:07.583 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:47:09.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 06:47:09.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423694,ok=423694,error=0, records=41
[INFO ] 2026-06-01 06:47:20.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:47:22.588 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:47:24.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 06:47:24.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423695,ok=423695,error=0, records=41
[INFO ] 2026-06-01 06:47:35.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:47:37.592 [24605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:47:39.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 06:47:39.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423696,ok=423696,error=0, records=41
[INFO ] 2026-06-01 06:47:39.509 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21196/300s
[INFO ] 2026-06-01 06:47:41.411 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21196/300s
[INFO ] 2026-06-01 06:47:49.015 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21196/300s
[INFO ] 2026-06-01 06:47:50.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:47:52.597 [24605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:47:54.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 06:47:54.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423697,ok=423697,error=0, records=41
[INFO ] 2026-06-01 06:48:05.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:48:07.603 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:48:09.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 06:48:09.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423698,ok=423698,error=0, records=41
[INFO ] 2026-06-01 06:48:20.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:48:22.609 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:48:24.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 06:48:24.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423699,ok=423699,error=0, records=41
[INFO ] 2026-06-01 06:48:35.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:48:37.615 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:48:39.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 06:48:39.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423700,ok=423700,error=0, records=41
[INFO ] 2026-06-01 06:48:44.114 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870360},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:48:44.265 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:48:44.265 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 06:48:44.265 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:48:44.265 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:48:44.265 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:48:44.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:48:44.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:48:44.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:48:44.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:48:44.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:48:44.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:48:50.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:48:52.621 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:48:54.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 06:48:54.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423701,ok=423701,error=0, records=41
[INFO ] 2026-06-01 06:49:05.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:49:07.626 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:49:09.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 06:49:09.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423702,ok=423702,error=0, records=41
[INFO ] 2026-06-01 06:49:20.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:49:22.631 [24605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:49:24.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 06:49:24.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423703,ok=423703,error=0, records=41
[INFO ] 2026-06-01 06:49:35.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:49:37.636 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:49:39.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 06:49:39.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423704,ok=423704,error=0, records=41
[INFO ] 2026-06-01 06:49:50.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:49:52.642 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:49:54.190 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 06:49:54.190 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423705,ok=423705,error=0, records=41
[INFO ] 2026-06-01 06:49:56.143 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21191/300s
[INFO ] 2026-06-01 06:50:00.802 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21200/300s
[INFO ] 2026-06-01 06:50:05.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:50:07.647 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:50:09.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 06:50:09.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423706,ok=423706,error=0, records=41
[INFO ] 2026-06-01 06:50:09.197 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21187/300s
[INFO ] 2026-06-01 06:50:20.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:50:22.654 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:50:24.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 06:50:24.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423707,ok=423707,error=0, records=41
[INFO ] 2026-06-01 06:50:35.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:50:37.659 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:50:39.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 06:50:39.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423708,ok=423708,error=0, records=41
[INFO ] 2026-06-01 06:50:41.014 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21200/300s
[INFO ] 2026-06-01 06:50:50.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:50:52.664 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:50:54.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 06:50:54.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423709,ok=423709,error=0, records=41
[INFO ] 2026-06-01 06:51:02.754 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21187/300s
[INFO ] 2026-06-01 06:51:05.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:51:07.669 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:51:09.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 06:51:09.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423710,ok=423710,error=0, records=41
[INFO ] 2026-06-01 06:51:20.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:51:22.674 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:51:24.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 06:51:24.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423711,ok=423711,error=0, records=41
[INFO ] 2026-06-01 06:51:32.733 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21196/300s
[INFO ] 2026-06-01 06:51:35.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:51:37.679 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:51:39.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 06:51:39.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423712,ok=423712,error=0, records=41
[INFO ] 2026-06-01 06:51:44.266 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17650/300s
[INFO ] 2026-06-01 06:51:44.267 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870276},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:51:44.454 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:51:44.454 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 06:51:44.454 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:51:44.454 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:51:44.454 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:51:44.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:51:44.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:51:44.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:51:44.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:51:44.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:51:44.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:51:50.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:51:52.684 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:51:54.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 06:51:54.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423713,ok=423713,error=0, records=41
[INFO ] 2026-06-01 06:52:05.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:52:05.289 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21199/300s
[WARN ] 2026-06-01 06:52:07.690 [24605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:52:09.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 06:52:09.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423714,ok=423714,error=0, records=41
[INFO ] 2026-06-01 06:52:20.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:52:22.697 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:52:24.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 06:52:24.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423715,ok=423715,error=0, records=41
[INFO ] 2026-06-01 06:52:35.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:52:37.702 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:52:39.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 06:52:39.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423716,ok=423716,error=0, records=41
[INFO ] 2026-06-01 06:52:39.538 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21197/300s
[INFO ] 2026-06-01 06:52:41.439 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21197/300s
[INFO ] 2026-06-01 06:52:49.044 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21197/300s
[INFO ] 2026-06-01 06:52:50.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:52:52.708 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:52:54.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 06:52:54.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423717,ok=423717,error=0, records=41
[INFO ] 2026-06-01 06:53:05.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:53:07.714 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:53:09.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 06:53:09.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423718,ok=423718,error=0, records=41
[INFO ] 2026-06-01 06:53:20.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:53:22.719 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:53:24.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 06:53:24.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423719,ok=423719,error=0, records=41
[INFO ] 2026-06-01 06:53:35.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 06:53:35.292 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 06:53:37.724 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:53:39.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 06:53:39.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423720,ok=423720,error=0, records=41
[INFO ] 2026-06-01 06:53:50.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:53:50.293 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 06:53:52.729 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:53:54.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 06:53:54.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423721,ok=423721,error=0, records=41
[INFO ] 2026-06-01 06:54:05.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:54:07.734 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:54:09.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 06:54:09.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423722,ok=423722,error=0, records=41
[INFO ] 2026-06-01 06:54:20.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:54:22.740 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:54:24.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 06:54:24.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423723,ok=423723,error=0, records=41
[INFO ] 2026-06-01 06:54:35.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:54:37.746 [24551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:54:39.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 06:54:39.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423724,ok=423724,error=0, records=41
[INFO ] 2026-06-01 06:54:44.456 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870204},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:54:44.612 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:54:44.612 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 06:54:44.612 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:54:44.612 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:54:44.612 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:54:44.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:54:44.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:54:44.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:54:44.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:54:44.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:54:44.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:54:50.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:54:52.751 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:54:54.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 06:54:54.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423725,ok=423725,error=0, records=41
[INFO ] 2026-06-01 06:54:56.252 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21192/300s
[INFO ] 2026-06-01 06:55:00.805 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21201/300s
[INFO ] 2026-06-01 06:55:05.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:55:07.757 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:55:09.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 06:55:09.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423726,ok=423726,error=0, records=41
[INFO ] 2026-06-01 06:55:09.321 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21188/300s
[INFO ] 2026-06-01 06:55:20.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:55:22.761 [24605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:55:24.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 06:55:24.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423727,ok=423727,error=0, records=41
[INFO ] 2026-06-01 06:55:35.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:55:37.765 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:55:39.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 06:55:39.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423728,ok=423728,error=0, records=41
[INFO ] 2026-06-01 06:55:41.020 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21201/300s
[INFO ] 2026-06-01 06:55:50.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:55:52.769 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:55:54.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 06:55:54.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423729,ok=423729,error=0, records=41
[INFO ] 2026-06-01 06:56:02.933 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21188/300s
[INFO ] 2026-06-01 06:56:05.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:56:07.774 [24564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:56:09.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 06:56:09.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423730,ok=423730,error=0, records=41
[INFO ] 2026-06-01 06:56:20.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:56:22.780 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:56:24.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 06:56:24.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423731,ok=423731,error=0, records=41
[INFO ] 2026-06-01 06:56:32.784 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21197/300s
[INFO ] 2026-06-01 06:56:35.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:56:37.786 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:56:39.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 06:56:39.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423732,ok=423732,error=0, records=41
[INFO ] 2026-06-01 06:56:50.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:56:52.791 [24594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:56:54.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 06:56:54.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423733,ok=423733,error=0, records=41
[INFO ] 2026-06-01 06:57:05.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 06:57:05.301 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21200/300s
[WARN ] 2026-06-01 06:57:07.797 [24605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:57:09.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 06:57:09.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423734,ok=423734,error=0, records=41
[INFO ] 2026-06-01 06:57:20.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:57:22.801 [24619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:57:24.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 06:57:24.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423735,ok=423735,error=0, records=41
[INFO ] 2026-06-01 06:57:35.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:57:37.806 [25146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:57:39.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 06:57:39.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423736,ok=423736,error=0, records=41
[INFO ] 2026-06-01 06:57:39.563 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21198/300s
[INFO ] 2026-06-01 06:57:41.465 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21198/300s
[INFO ] 2026-06-01 06:57:44.612 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17651/300s
[INFO ] 2026-06-01 06:57:44.614 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870128},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 06:57:44.777 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 06:57:44.777 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 06:57:44.777 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 06:57:44.777 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 06:57:44.777 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 06:57:44.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:57:44.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 06:57:44.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:57:44.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 06:57:44.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:57:44.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 06:57:49.070 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21198/300s
[INFO ] 2026-06-01 06:57:50.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:57:52.811 [25157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:57:54.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 06:57:54.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423737,ok=423737,error=0, records=41
[INFO ] 2026-06-01 06:58:05.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:58:07.817 [25177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:58:09.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 06:58:09.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423738,ok=423738,error=0, records=41
[INFO ] 2026-06-01 06:58:20.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:58:22.821 [25157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:58:24.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 06:58:24.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423739,ok=423739,error=0, records=41
[INFO ] 2026-06-01 06:58:35.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:58:37.826 [25192] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:58:39.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 06:58:39.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423740,ok=423740,error=0, records=41
[INFO ] 2026-06-01 06:58:50.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:58:52.832 [25157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:58:54.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 06:58:54.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423741,ok=423741,error=0, records=41
[INFO ] 2026-06-01 06:59:05.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:59:07.837 [25177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:59:09.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 06:59:09.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423742,ok=423742,error=0, records=41
[INFO ] 2026-06-01 06:59:20.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:59:22.841 [25192] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:59:24.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 06:59:24.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423743,ok=423743,error=0, records=41
[INFO ] 2026-06-01 06:59:35.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:59:37.847 [25192] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:59:39.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 06:59:39.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423744,ok=423744,error=0, records=41
[INFO ] 2026-06-01 06:59:50.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 06:59:52.851 [25258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 06:59:54.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 06:59:54.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423745,ok=423745,error=0, records=41
[INFO ] 2026-06-01 06:59:56.352 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21193/300s
[INFO ] 2026-06-01 07:00:00.808 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21202/300s
[INFO ] 2026-06-01 07:00:05.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:00:07.856 [25206] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:00:09.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 07:00:09.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423746,ok=423746,error=0, records=41
[INFO ] 2026-06-01 07:00:09.432 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21189/300s
[INFO ] 2026-06-01 07:00:20.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:00:22.861 [25258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:00:24.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10384, records=41
[INFO ] 2026-06-01 07:00:24.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423747,ok=423747,error=0, records=41
[INFO ] 2026-06-01 07:00:35.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:00:37.865 [25192] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:00:39.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 07:00:39.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423748,ok=423748,error=0, records=41
[INFO ] 2026-06-01 07:00:41.027 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21202/300s
[INFO ] 2026-06-01 07:00:44.779 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20870048},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:00:44.941 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:00:44.941 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:00:44.941 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:00:44.941 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:00:44.941 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:00:44.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:00:44.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:00:44.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:00:44.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:00:44.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:00:44.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:00:50.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:00:52.869 [25258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:00:54.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 07:00:54.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423749,ok=423749,error=0, records=41
[INFO ] 2026-06-01 07:01:03.110 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21189/300s
[INFO ] 2026-06-01 07:01:05.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:01:07.874 [25361] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:01:09.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 07:01:09.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423750,ok=423750,error=0, records=41
[INFO ] 2026-06-01 07:01:20.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:01:22.879 [25372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:01:24.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 07:01:24.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423751,ok=423751,error=0, records=41
[INFO ] 2026-06-01 07:01:32.837 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21198/300s
[INFO ] 2026-06-01 07:01:35.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:01:37.884 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:01:39.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 07:01:39.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423752,ok=423752,error=0, records=41
[INFO ] 2026-06-01 07:01:50.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:01:52.889 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:01:54.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 07:01:54.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423753,ok=423753,error=0, records=41
[INFO ] 2026-06-01 07:02:05.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:02:05.314 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21201/300s
[WARN ] 2026-06-01 07:02:07.895 [25419] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:02:09.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 07:02:09.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423754,ok=423754,error=0, records=41
[WARN ] 2026-06-01 07:02:17.400 [25484] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/20663/stat), No such file or directory
[INFO ] 2026-06-01 07:02:20.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:02:22.900 [25485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:02:24.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 07:02:24.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423755,ok=423755,error=0, records=41
[WARN ] 2026-06-01 07:02:32.405 [25383] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/20635/stat), No such file or directory
[WARN ] 2026-06-01 07:02:32.405 [25383] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/20663/stat), No such file or directory
[WARN ] 2026-06-01 07:02:32.405 [25383] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17451/stat), No such file or directory
[INFO ] 2026-06-01 07:02:35.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:02:37.906 [25503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:02:39.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:02:39.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423756,ok=423756,error=0, records=41
[INFO ] 2026-06-01 07:02:39.610 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21199/300s
[INFO ] 2026-06-01 07:02:41.511 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21199/300s
[WARN ] 2026-06-01 07:02:47.410 [25456] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/20635/stat), No such file or directory
[WARN ] 2026-06-01 07:02:47.410 [25456] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/20663/stat), No such file or directory
[WARN ] 2026-06-01 07:02:47.411 [25456] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17451/stat), No such file or directory
[INFO ] 2026-06-01 07:02:49.116 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21199/300s
[INFO ] 2026-06-01 07:02:50.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:02:52.912 [25502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:02:54.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 07:02:54.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423757,ok=423757,error=0, records=41
[INFO ] 2026-06-01 07:03:05.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:03:07.918 [25491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:03:09.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 07:03:09.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423758,ok=423758,error=0, records=41
[INFO ] 2026-06-01 07:03:20.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:03:22.924 [25608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:03:24.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 07:03:24.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423759,ok=423759,error=0, records=41
[INFO ] 2026-06-01 07:03:35.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 07:03:35.317 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 07:03:37.929 [25635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:03:39.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 07:03:39.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423760,ok=423760,error=0, records=41
[INFO ] 2026-06-01 07:03:44.942 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17652/300s
[INFO ] 2026-06-01 07:03:44.943 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869940},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:03:45.113 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:03:45.114 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:03:50.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:03:52.935 [25651] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:03:54.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 07:03:54.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423761,ok=423761,error=0, records=41
[INFO ] 2026-06-01 07:04:05.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:04:07.942 [25668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:04:09.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 07:04:09.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423762,ok=423762,error=0, records=41
[INFO ] 2026-06-01 07:04:20.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:04:22.948 [25668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:04:24.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 07:04:24.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423763,ok=423763,error=0, records=41
[INFO ] 2026-06-01 07:04:35.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:04:37.952 [25608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:04:39.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 07:04:39.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423764,ok=423764,error=0, records=41
[INFO ] 2026-06-01 07:04:50.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:04:52.957 [25668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:04:54.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 07:04:54.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423765,ok=423765,error=0, records=41
[INFO ] 2026-06-01 07:04:56.458 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21194/300s
[INFO ] 2026-06-01 07:05:00.812 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21203/300s
[INFO ] 2026-06-01 07:05:05.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:05:07.962 [25668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:05:09.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 07:05:09.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423766,ok=423766,error=0, records=41
[INFO ] 2026-06-01 07:05:09.549 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21190/300s
[INFO ] 2026-06-01 07:05:20.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:05:22.967 [25668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:05:24.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 07:05:24.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423767,ok=423767,error=0, records=41
[INFO ] 2026-06-01 07:05:35.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:05:37.973 [25757] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:05:39.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 07:05:39.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423768,ok=423768,error=0, records=41
[INFO ] 2026-06-01 07:05:41.032 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21203/300s
[INFO ] 2026-06-01 07:05:50.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:05:52.979 [25757] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:05:54.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:05:54.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423769,ok=423769,error=0, records=41
[INFO ] 2026-06-01 07:06:03.293 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21190/300s
[INFO ] 2026-06-01 07:06:05.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:06:07.984 [25757] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:06:09.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 07:06:09.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423770,ok=423770,error=0, records=41
[INFO ] 2026-06-01 07:06:20.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:06:22.989 [25799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:06:24.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 07:06:24.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423771,ok=423771,error=0, records=41
[INFO ] 2026-06-01 07:06:32.882 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21199/300s
[INFO ] 2026-06-01 07:06:35.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:06:37.993 [25799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:06:39.587 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 07:06:39.587 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423772,ok=423772,error=0, records=41
[INFO ] 2026-06-01 07:06:45.115 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869864},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:06:45.317 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:06:45.317 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:06:50.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:06:52.998 [25829] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:06:54.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 07:06:54.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423773,ok=423773,error=0, records=41
[INFO ] 2026-06-01 07:07:05.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:07:05.327 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21202/300s
[WARN ] 2026-06-01 07:07:08.003 [25799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:07:09.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 07:07:09.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423774,ok=423774,error=0, records=41
[INFO ] 2026-06-01 07:07:20.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:07:23.009 [25799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:07:24.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 07:07:24.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423775,ok=423775,error=0, records=41
[INFO ] 2026-06-01 07:07:35.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:07:38.014 [25814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:07:39.611 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21200/300s
[INFO ] 2026-06-01 07:07:39.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 07:07:39.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423776,ok=423776,error=0, records=41
[INFO ] 2026-06-01 07:07:41.513 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21200/300s
[INFO ] 2026-06-01 07:07:49.216 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21200/300s
[INFO ] 2026-06-01 07:07:50.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:07:53.019 [25743] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:07:54.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 07:07:54.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423777,ok=423777,error=0, records=41
[INFO ] 2026-06-01 07:08:05.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:08:08.024 [25896] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:08:09.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 07:08:09.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423778,ok=423778,error=0, records=41
[INFO ] 2026-06-01 07:08:20.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:08:23.029 [25910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:08:24.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 07:08:24.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423779,ok=423779,error=0, records=41
[INFO ] 2026-06-01 07:08:35.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:08:38.033 [25910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:08:39.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 07:08:39.659 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423780,ok=423780,error=0, records=41
[INFO ] 2026-06-01 07:08:50.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:08:50.330 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 07:08:53.038 [25941] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:08:54.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 07:08:54.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423781,ok=423781,error=0, records=41
[INFO ] 2026-06-01 07:09:05.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:09:08.042 [25946] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:09:09.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 07:09:09.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423782,ok=423782,error=0, records=41
[INFO ] 2026-06-01 07:09:20.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:09:23.047 [25963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:09:24.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 07:09:24.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423783,ok=423783,error=0, records=41
[INFO ] 2026-06-01 07:09:35.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:09:38.053 [25980] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:09:39.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:09:39.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423784,ok=423784,error=0, records=41
[INFO ] 2026-06-01 07:09:45.318 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17653/300s
[INFO ] 2026-06-01 07:09:45.319 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869788},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:09:45.475 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:09:45.475 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 07:09:45.475 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:09:45.475 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:09:45.475 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:09:45.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:09:45.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:09:45.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:09:45.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:09:45.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:09:45.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:09:50.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:09:52.558 [25997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:09:54.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 07:09:54.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423785,ok=423785,error=0, records=41
[INFO ] 2026-06-01 07:09:56.558 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21195/300s
[INFO ] 2026-06-01 07:10:00.815 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21204/300s
[INFO ] 2026-06-01 07:10:05.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:10:07.563 [25997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:10:09.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 07:10:09.694 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423786,ok=423786,error=0, records=41
[INFO ] 2026-06-01 07:10:09.694 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21191/300s
[INFO ] 2026-06-01 07:10:20.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:10:22.568 [26031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:10:24.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 07:10:24.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423787,ok=423787,error=0, records=41
[INFO ] 2026-06-01 07:10:35.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:10:37.572 [26056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:10:39.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 07:10:39.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423788,ok=423788,error=0, records=41
[INFO ] 2026-06-01 07:10:41.038 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21204/300s
[INFO ] 2026-06-01 07:10:50.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:10:52.576 [26056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:10:54.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 07:10:54.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423789,ok=423789,error=0, records=41
[INFO ] 2026-06-01 07:11:03.468 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21191/300s
[INFO ] 2026-06-01 07:11:05.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:11:07.581 [26076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:11:09.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 07:11:09.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423790,ok=423790,error=0, records=41
[INFO ] 2026-06-01 07:11:20.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:11:22.588 [26104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:11:24.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 07:11:24.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423791,ok=423791,error=0, records=41
[INFO ] 2026-06-01 07:11:32.927 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21200/300s
[INFO ] 2026-06-01 07:11:35.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:11:37.592 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:11:39.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 07:11:39.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423792,ok=423792,error=0, records=41
[INFO ] 2026-06-01 07:11:50.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:11:52.596 [26142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:11:54.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 07:11:54.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423793,ok=423793,error=0, records=41
[INFO ] 2026-06-01 07:12:05.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:12:05.338 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21203/300s
[WARN ] 2026-06-01 07:12:07.601 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:12:09.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 07:12:09.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423794,ok=423794,error=0, records=41
[INFO ] 2026-06-01 07:12:20.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:12:22.607 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:12:24.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 07:12:24.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423795,ok=423795,error=0, records=41
[INFO ] 2026-06-01 07:12:35.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:12:37.612 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:12:39.621 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21201/300s
[INFO ] 2026-06-01 07:12:39.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 07:12:39.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423796,ok=423796,error=0, records=41
[INFO ] 2026-06-01 07:12:41.523 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21201/300s
[INFO ] 2026-06-01 07:12:45.476 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869708},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:12:45.634 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:12:45.634 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 07:12:45.634 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:12:45.635 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:12:45.635 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:12:45.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:12:45.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:12:45.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:12:45.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:12:45.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:12:45.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:12:49.225 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21201/300s
[INFO ] 2026-06-01 07:12:50.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:12:52.617 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:12:54.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 07:12:54.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423797,ok=423797,error=0, records=41
[INFO ] 2026-06-01 07:13:05.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:13:07.624 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:13:09.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 07:13:09.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423798,ok=423798,error=0, records=41
[INFO ] 2026-06-01 07:13:20.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:13:22.629 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:13:24.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 07:13:24.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423799,ok=423799,error=0, records=41
[INFO ] 2026-06-01 07:13:35.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 07:13:35.341 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 07:13:37.634 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:13:39.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 07:13:39.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423800,ok=423800,error=0, records=41
[INFO ] 2026-06-01 07:13:50.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:13:52.639 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:13:54.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 07:13:54.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423801,ok=423801,error=0, records=41
[INFO ] 2026-06-01 07:14:05.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:14:07.645 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:14:09.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:14:09.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423802,ok=423802,error=0, records=41
[INFO ] 2026-06-01 07:14:20.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:14:22.650 [26142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:14:24.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 07:14:24.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423803,ok=423803,error=0, records=41
[INFO ] 2026-06-01 07:14:35.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:14:37.655 [26142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:14:39.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 07:14:39.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423804,ok=423804,error=0, records=41
[INFO ] 2026-06-01 07:14:50.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:14:52.660 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:14:54.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 07:14:54.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423805,ok=423805,error=0, records=41
[INFO ] 2026-06-01 07:14:56.661 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21196/300s
[INFO ] 2026-06-01 07:15:00.818 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21205/300s
[INFO ] 2026-06-01 07:15:05.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:15:07.665 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:15:09.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 07:15:09.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423806,ok=423806,error=0, records=41
[INFO ] 2026-06-01 07:15:09.828 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21192/300s
[INFO ] 2026-06-01 07:15:20.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:15:22.669 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:15:24.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 07:15:24.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423807,ok=423807,error=0, records=41
[INFO ] 2026-06-01 07:15:35.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:15:37.674 [26142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:15:39.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 07:15:39.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423808,ok=423808,error=0, records=41
[INFO ] 2026-06-01 07:15:41.044 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21205/300s
[INFO ] 2026-06-01 07:15:45.635 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17654/300s
[INFO ] 2026-06-01 07:15:45.636 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869632},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:15:45.787 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:15:45.787 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 07:15:45.788 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:15:45.788 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:15:45.788 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:15:45.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:15:45.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:15:45.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:15:45.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:15:45.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:15:45.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:15:50.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:15:52.680 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:15:54.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 07:15:54.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423809,ok=423809,error=0, records=41
[INFO ] 2026-06-01 07:16:03.641 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21192/300s
[INFO ] 2026-06-01 07:16:05.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:16:07.685 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:16:09.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10380, records=41
[INFO ] 2026-06-01 07:16:09.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423810,ok=423810,error=0, records=41
[INFO ] 2026-06-01 07:16:20.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:16:22.688 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:16:24.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-01 07:16:24.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423811,ok=423811,error=0, records=41
[INFO ] 2026-06-01 07:16:32.975 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21201/300s
[INFO ] 2026-06-01 07:16:35.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:16:37.693 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:16:39.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 07:16:39.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423812,ok=423812,error=0, records=41
[INFO ] 2026-06-01 07:16:50.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:16:52.698 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:16:54.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 07:16:54.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423813,ok=423813,error=0, records=41
[INFO ] 2026-06-01 07:17:05.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:17:05.349 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21204/300s
[WARN ] 2026-06-01 07:17:07.704 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:17:09.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 07:17:09.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423814,ok=423814,error=0, records=41
[INFO ] 2026-06-01 07:17:20.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:17:22.709 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:17:24.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:17:24.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423815,ok=423815,error=0, records=41
[INFO ] 2026-06-01 07:17:35.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:17:37.715 [26142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:17:39.639 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21202/300s
[INFO ] 2026-06-01 07:17:39.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 07:17:39.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423816,ok=423816,error=0, records=41
[INFO ] 2026-06-01 07:17:41.541 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21202/300s
[INFO ] 2026-06-01 07:17:49.244 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21202/300s
[INFO ] 2026-06-01 07:17:50.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:17:52.720 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:17:54.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 07:17:54.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423817,ok=423817,error=0, records=41
[INFO ] 2026-06-01 07:18:05.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:18:07.726 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:18:09.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 07:18:09.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423818,ok=423818,error=0, records=41
[INFO ] 2026-06-01 07:18:20.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:18:22.732 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:18:24.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 07:18:24.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423819,ok=423819,error=0, records=41
[INFO ] 2026-06-01 07:18:35.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:18:37.738 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:18:39.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 07:18:39.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423820,ok=423820,error=0, records=41
[INFO ] 2026-06-01 07:18:45.789 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869548},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:18:45.934 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:18:45.935 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 07:18:45.935 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:18:45.935 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:18:45.935 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:18:45.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:18:45.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:18:45.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:18:45.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:18:45.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:18:45.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:18:50.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:18:52.743 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:18:54.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 07:18:54.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423821,ok=423821,error=0, records=41
[INFO ] 2026-06-01 07:19:05.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:19:07.748 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:19:09.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 07:19:09.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423822,ok=423822,error=0, records=41
[INFO ] 2026-06-01 07:19:20.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:19:22.753 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:19:24.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 07:19:24.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423823,ok=423823,error=0, records=41
[INFO ] 2026-06-01 07:19:35.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:19:37.759 [26142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:19:39.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 07:19:39.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423824,ok=423824,error=0, records=41
[INFO ] 2026-06-01 07:19:50.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:19:52.766 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:19:55.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 07:19:55.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423825,ok=423825,error=0, records=41
[INFO ] 2026-06-01 07:19:56.767 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21197/300s
[INFO ] 2026-06-01 07:20:00.821 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21206/300s
[INFO ] 2026-06-01 07:20:05.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:20:07.771 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:20:10.006 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 07:20:10.006 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423826,ok=423826,error=0, records=41
[INFO ] 2026-06-01 07:20:10.006 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21193/300s
[INFO ] 2026-06-01 07:20:20.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:20:22.775 [26156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:20:25.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 07:20:25.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423827,ok=423827,error=0, records=41
[WARN ] 2026-06-01 07:20:32.780 [26156] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/25540/stat), No such file or directory
[WARN ] 2026-06-01 07:20:32.780 [26156] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/25591/stat), No such file or directory
[INFO ] 2026-06-01 07:20:35.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:20:37.780 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:20:40.026 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 07:20:40.026 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423828,ok=423828,error=0, records=41
[INFO ] 2026-06-01 07:20:41.049 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21206/300s
[WARN ] 2026-06-01 07:20:47.785 [26156] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/25540/stat), No such file or directory
[WARN ] 2026-06-01 07:20:47.785 [26156] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/25591/stat), No such file or directory
[WARN ] 2026-06-01 07:20:47.785 [26156] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/25441/stat), No such file or directory
[INFO ] 2026-06-01 07:20:50.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:20:52.786 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:20:55.031 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 07:20:55.031 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423829,ok=423829,error=0, records=41
[INFO ] 2026-06-01 07:21:03.815 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21193/300s
[INFO ] 2026-06-01 07:21:05.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:21:07.791 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:21:10.037 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 07:21:10.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423830,ok=423830,error=0, records=41
[INFO ] 2026-06-01 07:21:20.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:21:22.795 [26137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:21:25.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 07:21:25.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423831,ok=423831,error=0, records=41
[INFO ] 2026-06-01 07:21:33.020 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21202/300s
[INFO ] 2026-06-01 07:21:35.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:21:37.801 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:21:40.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 07:21:40.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423832,ok=423832,error=0, records=41
[INFO ] 2026-06-01 07:21:45.935 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17655/300s
[INFO ] 2026-06-01 07:21:45.936 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869452},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:21:46.100 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:21:46.100 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 07:21:46.101 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:21:46.101 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:21:46.101 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:21:46.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:21:46.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:21:46.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:21:46.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:21:46.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:21:46.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:21:50.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:21:52.806 [26705] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:21:55.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 07:21:55.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423833,ok=423833,error=0, records=41
[INFO ] 2026-06-01 07:22:05.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:22:05.360 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21205/300s
[WARN ] 2026-06-01 07:22:07.812 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:22:10.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10417, records=41
[INFO ] 2026-06-01 07:22:10.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423834,ok=423834,error=0, records=41
[INFO ] 2026-06-01 07:22:20.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:22:22.817 [26738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:22:25.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 07:22:25.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423835,ok=423835,error=0, records=41
[INFO ] 2026-06-01 07:22:35.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:22:37.823 [26738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:22:39.666 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21203/300s
[INFO ] 2026-06-01 07:22:40.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10405, records=41
[INFO ] 2026-06-01 07:22:40.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423836,ok=423836,error=0, records=41
[INFO ] 2026-06-01 07:22:41.568 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21203/300s
[INFO ] 2026-06-01 07:22:49.274 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21203/300s
[INFO ] 2026-06-01 07:22:50.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:22:52.829 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:22:55.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-01 07:22:55.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423837,ok=423837,error=0, records=41
[INFO ] 2026-06-01 07:23:05.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:23:07.834 [26738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:23:10.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 07:23:10.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423838,ok=423838,error=0, records=41
[INFO ] 2026-06-01 07:23:20.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:23:22.839 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:23:25.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 07:23:25.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423839,ok=423839,error=0, records=41
[INFO ] 2026-06-01 07:23:35.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 07:23:35.363 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 07:23:37.844 [26766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:23:40.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 07:23:40.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423840,ok=423840,error=0, records=41
[INFO ] 2026-06-01 07:23:50.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:23:50.364 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 07:23:52.849 [26119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:23:55.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 07:23:55.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423841,ok=423841,error=0, records=41
[INFO ] 2026-06-01 07:24:05.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:24:07.854 [26830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:24:10.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 07:24:10.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423842,ok=423842,error=0, records=41
[INFO ] 2026-06-01 07:24:20.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:24:22.858 [26830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:24:25.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 07:24:25.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423843,ok=423843,error=0, records=41
[INFO ] 2026-06-01 07:24:35.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:24:37.864 [26766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:24:40.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 07:24:40.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423844,ok=423844,error=0, records=41
[INFO ] 2026-06-01 07:24:46.102 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869372},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:24:46.281 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:24:46.282 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 07:24:46.282 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:24:46.282 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:24:46.282 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:24:46.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:24:46.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:24:46.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:24:46.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:24:46.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:24:46.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:24:50.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:24:52.869 [26858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:24:55.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 07:24:55.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423845,ok=423845,error=0, records=41
[INFO ] 2026-06-01 07:24:56.870 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21198/300s
[INFO ] 2026-06-01 07:25:00.824 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21207/300s
[INFO ] 2026-06-01 07:25:05.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:25:07.874 [26766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:25:10.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 07:25:10.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423846,ok=423846,error=0, records=41
[INFO ] 2026-06-01 07:25:10.265 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21194/300s
[INFO ] 2026-06-01 07:25:20.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:25:22.879 [26858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:25:25.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 07:25:25.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423847,ok=423847,error=0, records=41
[INFO ] 2026-06-01 07:25:35.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:25:37.883 [26921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:25:40.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 07:25:40.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423848,ok=423848,error=0, records=41
[INFO ] 2026-06-01 07:25:41.056 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21207/300s
[INFO ] 2026-06-01 07:25:50.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:25:52.888 [26938] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:25:55.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 07:25:55.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423849,ok=423849,error=0, records=41
[INFO ] 2026-06-01 07:26:03.997 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21194/300s
[INFO ] 2026-06-01 07:26:05.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:26:07.892 [26960] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:26:10.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 07:26:10.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423850,ok=423850,error=0, records=41
[INFO ] 2026-06-01 07:26:20.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:26:22.897 [26938] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:26:25.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 07:26:25.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423851,ok=423851,error=0, records=41
[INFO ] 2026-06-01 07:26:33.074 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21203/300s
[INFO ] 2026-06-01 07:26:35.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:26:37.902 [26987] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:26:40.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 07:26:40.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423852,ok=423852,error=0, records=41
[INFO ] 2026-06-01 07:26:50.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:26:52.909 [26988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:26:55.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 07:26:55.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423853,ok=423853,error=0, records=41
[INFO ] 2026-06-01 07:27:05.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:27:05.372 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21206/300s
[WARN ] 2026-06-01 07:27:07.915 [27025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:27:10.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 07:27:10.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423854,ok=423854,error=0, records=41
[INFO ] 2026-06-01 07:27:20.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:27:22.922 [27019] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:27:25.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 07:27:25.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423855,ok=423855,error=0, records=41
[INFO ] 2026-06-01 07:27:35.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:27:37.927 [27037] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:27:39.715 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21204/300s
[INFO ] 2026-06-01 07:27:40.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 07:27:40.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423856,ok=423856,error=0, records=41
[INFO ] 2026-06-01 07:27:41.616 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21204/300s
[INFO ] 2026-06-01 07:27:46.282 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17656/300s
[INFO ] 2026-06-01 07:27:46.283 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869296},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:27:46.453 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:27:46.454 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 07:27:46.454 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:27:46.454 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:27:46.454 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:27:46.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:27:46.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:27:46.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:27:46.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:27:46.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:27:46.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:27:49.323 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21204/300s
[INFO ] 2026-06-01 07:27:50.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:27:52.933 [27064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:27:55.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:27:55.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423857,ok=423857,error=0, records=41
[INFO ] 2026-06-01 07:28:05.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:28:07.938 [27064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:28:10.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 07:28:10.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423858,ok=423858,error=0, records=41
[INFO ] 2026-06-01 07:28:20.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:28:22.943 [27109] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:28:25.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 07:28:25.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423859,ok=423859,error=0, records=41
[INFO ] 2026-06-01 07:28:35.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:28:37.947 [27121] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:28:40.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 07:28:40.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423860,ok=423860,error=0, records=41
[INFO ] 2026-06-01 07:28:50.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:28:52.953 [27103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:28:55.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 07:28:55.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423861,ok=423861,error=0, records=41
[INFO ] 2026-06-01 07:29:05.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:29:07.958 [27150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:29:10.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 07:29:10.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423862,ok=423862,error=0, records=41
[INFO ] 2026-06-01 07:29:20.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:29:22.963 [27164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:29:25.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 07:29:25.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423863,ok=423863,error=0, records=41
[INFO ] 2026-06-01 07:29:35.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:29:37.967 [27150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:29:40.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 07:29:40.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423864,ok=423864,error=0, records=41
[INFO ] 2026-06-01 07:29:50.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:29:52.972 [27136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:29:55.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 07:29:55.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423865,ok=423865,error=0, records=41
[INFO ] 2026-06-01 07:29:56.973 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21199/300s
[INFO ] 2026-06-01 07:30:00.827 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21208/300s
[INFO ] 2026-06-01 07:30:05.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:30:07.977 [27103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:30:10.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 07:30:10.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423866,ok=423866,error=0, records=41
[INFO ] 2026-06-01 07:30:10.385 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21195/300s
[INFO ] 2026-06-01 07:30:20.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:30:22.983 [27136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:30:25.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 07:30:25.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423867,ok=423867,error=0, records=41
[INFO ] 2026-06-01 07:30:35.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:30:37.988 [27164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:30:40.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 07:30:40.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423868,ok=423868,error=0, records=41
[INFO ] 2026-06-01 07:30:41.062 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21208/300s
[INFO ] 2026-06-01 07:30:46.455 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869208},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:30:46.631 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:30:46.631 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 07:30:46.631 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:30:46.631 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:30:46.631 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:30:46.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:30:46.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:30:46.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:30:46.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:30:46.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:30:46.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:30:50.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:30:52.993 [27238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:30:55.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 07:30:55.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423869,ok=423869,error=0, records=41
[INFO ] 2026-06-01 07:31:04.177 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21195/300s
[INFO ] 2026-06-01 07:31:05.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:31:07.999 [27254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:31:10.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 07:31:10.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423870,ok=423870,error=0, records=41
[INFO ] 2026-06-01 07:31:20.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:31:23.004 [27164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:31:25.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 07:31:25.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423871,ok=423871,error=0, records=41
[INFO ] 2026-06-01 07:31:33.126 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21204/300s
[INFO ] 2026-06-01 07:31:35.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:31:38.008 [27254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:31:40.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 07:31:40.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423872,ok=423872,error=0, records=41
[INFO ] 2026-06-01 07:31:50.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:31:53.013 [27254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:31:55.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 07:31:55.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423873,ok=423873,error=0, records=41
[INFO ] 2026-06-01 07:32:05.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:32:05.385 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21207/300s
[WARN ] 2026-06-01 07:32:08.019 [27224] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:32:10.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 07:32:10.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423874,ok=423874,error=0, records=41
[INFO ] 2026-06-01 07:32:20.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:32:23.023 [27296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:32:25.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 07:32:25.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423875,ok=423875,error=0, records=41
[INFO ] 2026-06-01 07:32:35.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:32:38.028 [27352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:32:39.760 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21205/300s
[INFO ] 2026-06-01 07:32:40.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 07:32:40.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423876,ok=423876,error=0, records=41
[INFO ] 2026-06-01 07:32:41.661 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21205/300s
[INFO ] 2026-06-01 07:32:49.365 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21205/300s
[INFO ] 2026-06-01 07:32:50.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:32:53.032 [27296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:32:55.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 07:32:55.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423877,ok=423877,error=0, records=41
[INFO ] 2026-06-01 07:33:05.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:33:08.037 [27388] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:33:10.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 07:33:10.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423878,ok=423878,error=0, records=41
[INFO ] 2026-06-01 07:33:20.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:33:23.041 [27224] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:33:25.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 07:33:25.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423879,ok=423879,error=0, records=41
[INFO ] 2026-06-01 07:33:35.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 07:33:35.388 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 07:33:38.047 [27224] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:33:40.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:33:40.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423880,ok=423880,error=0, records=41
[INFO ] 2026-06-01 07:33:46.632 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17657/300s
[INFO ] 2026-06-01 07:33:46.633 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869132},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:33:46.814 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:33:46.814 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:33:46.814 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:33:46.814 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:33:46.814 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:33:46.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:33:46.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:33:46.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:33:46.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:33:46.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:33:46.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:33:50.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:33:53.053 [27430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:33:55.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 07:33:55.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423881,ok=423881,error=0, records=41
[INFO ] 2026-06-01 07:34:05.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:34:07.558 [27430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:34:10.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 07:34:10.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423882,ok=423882,error=0, records=41
[INFO ] 2026-06-01 07:34:20.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:34:22.562 [27458] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:34:25.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 07:34:25.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423883,ok=423883,error=0, records=41
[INFO ] 2026-06-01 07:34:35.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:34:37.568 [27551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:34:40.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 07:34:40.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423884,ok=423884,error=0, records=41
[WARN ] 2026-06-01 07:34:47.571 [27540] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/25584/stat), No such file or directory
[INFO ] 2026-06-01 07:34:50.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:34:52.572 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:34:55.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10402, records=41
[INFO ] 2026-06-01 07:34:55.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423885,ok=423885,error=0, records=41
[INFO ] 2026-06-01 07:34:57.073 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21200/300s
[INFO ] 2026-06-01 07:35:00.830 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21209/300s
[INFO ] 2026-06-01 07:35:05.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:35:07.576 [27573] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:35:10.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 07:35:10.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423886,ok=423886,error=0, records=41
[INFO ] 2026-06-01 07:35:10.520 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21196/300s
[WARN ] 2026-06-01 07:35:17.579 [27573] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/26582/stat), No such file or directory
[INFO ] 2026-06-01 07:35:20.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:35:22.580 [27573] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:35:25.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 07:35:25.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423887,ok=423887,error=0, records=41
[WARN ] 2026-06-01 07:35:32.584 [27607] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/26582/stat), No such file or directory
[INFO ] 2026-06-01 07:35:35.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:35:37.586 [27623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:35:40.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 07:35:40.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423888,ok=423888,error=0, records=41
[INFO ] 2026-06-01 07:35:41.067 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21209/300s
[WARN ] 2026-06-01 07:35:47.590 [27630] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/26582/stat), No such file or directory
[INFO ] 2026-06-01 07:35:50.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:35:52.592 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:35:55.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 07:35:55.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423889,ok=423889,error=0, records=41
[INFO ] 2026-06-01 07:36:04.354 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21196/300s
[INFO ] 2026-06-01 07:36:05.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:36:07.598 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:36:10.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 07:36:10.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423890,ok=423890,error=0, records=41
[INFO ] 2026-06-01 07:36:20.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:36:22.603 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:36:25.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 07:36:25.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423891,ok=423891,error=0, records=41
[INFO ] 2026-06-01 07:36:33.174 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21205/300s
[INFO ] 2026-06-01 07:36:35.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:36:37.608 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:36:40.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 07:36:40.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423892,ok=423892,error=0, records=41
[INFO ] 2026-06-01 07:36:46.816 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20869032},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:36:46.991 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:36:46.991 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:36:46.991 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:36:46.991 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:36:46.991 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:36:47.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:36:47.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:36:47.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:36:47.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:36:47.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:36:47.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:36:50.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:36:52.614 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:36:55.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 07:36:55.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423893,ok=423893,error=0, records=41
[INFO ] 2026-06-01 07:37:05.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:37:05.396 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21208/300s
[WARN ] 2026-06-01 07:37:07.619 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:37:10.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 07:37:10.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423894,ok=423894,error=0, records=41
[INFO ] 2026-06-01 07:37:20.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:37:22.626 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:37:25.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 07:37:25.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423895,ok=423895,error=0, records=41
[INFO ] 2026-06-01 07:37:35.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:37:37.630 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:37:39.783 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21206/300s
[INFO ] 2026-06-01 07:37:40.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 07:37:40.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423896,ok=423896,error=0, records=41
[INFO ] 2026-06-01 07:37:41.684 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21206/300s
[INFO ] 2026-06-01 07:37:49.390 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21206/300s
[INFO ] 2026-06-01 07:37:50.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:37:52.636 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:37:55.592 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 07:37:55.592 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423897,ok=423897,error=0, records=41
[INFO ] 2026-06-01 07:38:05.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:38:07.641 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:38:10.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 07:38:10.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423898,ok=423898,error=0, records=41
[INFO ] 2026-06-01 07:38:20.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:38:22.646 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:38:25.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 07:38:25.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423899,ok=423899,error=0, records=41
[INFO ] 2026-06-01 07:38:35.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:38:37.652 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:38:40.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 07:38:40.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423900,ok=423900,error=0, records=41
[INFO ] 2026-06-01 07:38:50.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:38:50.400 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 07:38:52.657 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:38:55.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 07:38:55.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423901,ok=423901,error=0, records=41
[INFO ] 2026-06-01 07:39:05.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:39:07.662 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:39:10.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 07:39:10.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423902,ok=423902,error=0, records=41
[INFO ] 2026-06-01 07:39:20.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:39:22.668 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:39:25.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 07:39:25.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423903,ok=423903,error=0, records=41
[INFO ] 2026-06-01 07:39:35.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:39:37.676 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:39:40.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 07:39:40.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423904,ok=423904,error=0, records=41
[INFO ] 2026-06-01 07:39:46.991 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17658/300s
[INFO ] 2026-06-01 07:39:46.992 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868952},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:39:47.167 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:39:47.167 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:39:47.167 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:39:47.167 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:39:47.167 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:39:47.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:39:47.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:39:47.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:39:47.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:39:47.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:39:47.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:39:50.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:39:52.682 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:39:55.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 07:39:55.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423905,ok=423905,error=0, records=41
[INFO ] 2026-06-01 07:39:57.183 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21201/300s
[INFO ] 2026-06-01 07:40:00.833 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21210/300s
[INFO ] 2026-06-01 07:40:05.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:40:07.687 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:40:10.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 07:40:10.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423906,ok=423906,error=0, records=41
[INFO ] 2026-06-01 07:40:10.652 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21197/300s
[INFO ] 2026-06-01 07:40:20.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:40:22.692 [27655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:40:25.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 07:40:25.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423907,ok=423907,error=0, records=41
[INFO ] 2026-06-01 07:40:35.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:40:37.697 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:40:40.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 07:40:40.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423908,ok=423908,error=0, records=41
[INFO ] 2026-06-01 07:40:41.073 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21210/300s
[INFO ] 2026-06-01 07:40:50.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:40:52.702 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:40:55.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 07:40:55.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423909,ok=423909,error=0, records=41
[INFO ] 2026-06-01 07:41:04.531 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21197/300s
[INFO ] 2026-06-01 07:41:05.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:41:07.707 [27655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:41:10.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 07:41:10.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423910,ok=423910,error=0, records=41
[INFO ] 2026-06-01 07:41:20.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:41:22.712 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:41:25.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 07:41:25.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423911,ok=423911,error=0, records=41
[INFO ] 2026-06-01 07:41:33.221 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21206/300s
[INFO ] 2026-06-01 07:41:35.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:41:37.717 [27655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:41:40.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 07:41:40.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423912,ok=423912,error=0, records=41
[INFO ] 2026-06-01 07:41:50.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:41:52.722 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:41:55.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 07:41:55.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423913,ok=423913,error=0, records=41
[INFO ] 2026-06-01 07:42:05.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:42:05.408 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21209/300s
[WARN ] 2026-06-01 07:42:07.728 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:42:10.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 07:42:10.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423914,ok=423914,error=0, records=41
[INFO ] 2026-06-01 07:42:20.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:42:22.733 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:42:25.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 07:42:25.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423915,ok=423915,error=0, records=41
[WARN ] 2026-06-01 07:42:32.737 [27655] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/26628/stat), No such file or directory
[INFO ] 2026-06-01 07:42:35.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:42:37.737 [27655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:42:39.812 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21207/300s
[INFO ] 2026-06-01 07:42:40.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 07:42:40.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423916,ok=423916,error=0, records=41
[INFO ] 2026-06-01 07:42:41.714 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21207/300s
[INFO ] 2026-06-01 07:42:47.169 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:42:47.340 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:42:47.340 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:42:47.340 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:42:47.340 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:42:47.340 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:42:47.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:42:47.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:42:47.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:42:47.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:42:47.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:42:47.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 07:42:47.742 [27567] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/26628/stat), No such file or directory
[INFO ] 2026-06-01 07:42:49.419 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21207/300s
[INFO ] 2026-06-01 07:42:50.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:42:52.742 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:42:55.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 07:42:55.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423917,ok=423917,error=0, records=41
[INFO ] 2026-06-01 07:43:05.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:43:07.748 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:43:10.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 07:43:10.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423918,ok=423918,error=0, records=41
[INFO ] 2026-06-01 07:43:20.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:43:22.753 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:43:25.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 07:43:25.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423919,ok=423919,error=0, records=41
[INFO ] 2026-06-01 07:43:35.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 07:43:35.411 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 07:43:37.758 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:43:40.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 07:43:40.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423920,ok=423920,error=0, records=41
[WARN ] 2026-06-01 07:43:47.762 [27655] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27468/stat), No such file or directory
[INFO ] 2026-06-01 07:43:50.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:43:52.763 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:43:55.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 07:43:55.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423921,ok=423921,error=0, records=41
[INFO ] 2026-06-01 07:44:05.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:44:07.768 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:44:10.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 07:44:10.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423922,ok=423922,error=0, records=41
[INFO ] 2026-06-01 07:44:20.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:44:22.773 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:44:25.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 07:44:25.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423923,ok=423923,error=0, records=41
[INFO ] 2026-06-01 07:44:35.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:44:37.778 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:44:40.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 07:44:40.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423924,ok=423924,error=0, records=41
[INFO ] 2026-06-01 07:44:50.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:44:52.783 [27655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:44:55.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 07:44:55.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423925,ok=423925,error=0, records=41
[INFO ] 2026-06-01 07:44:57.284 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21202/300s
[INFO ] 2026-06-01 07:45:00.836 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21211/300s
[INFO ] 2026-06-01 07:45:05.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:45:07.788 [27636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:45:10.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 07:45:10.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423926,ok=423926,error=0, records=41
[INFO ] 2026-06-01 07:45:10.784 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21198/300s
[INFO ] 2026-06-01 07:45:20.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:45:22.794 [27630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:45:25.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 07:45:25.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423927,ok=423927,error=0, records=41
[INFO ] 2026-06-01 07:45:35.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:45:37.799 [27655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:45:40.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 07:45:40.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423928,ok=423928,error=0, records=41
[INFO ] 2026-06-01 07:45:41.078 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21211/300s
[INFO ] 2026-06-01 07:45:47.340 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17659/300s
[INFO ] 2026-06-01 07:45:47.342 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868756},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:45:47.505 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:45:47.505 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 07:45:47.505 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:45:47.505 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:45:47.505 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:45:47.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:45:47.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:45:47.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:45:47.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:45:47.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:45:47.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:45:50.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:45:52.805 [27650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:45:55.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10131, records=41
[INFO ] 2026-06-01 07:45:55.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423929,ok=423929,error=0, records=41
[INFO ] 2026-06-01 07:46:04.707 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21198/300s
[INFO ] 2026-06-01 07:46:05.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:46:07.810 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:46:10.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 07:46:10.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423930,ok=423930,error=0, records=41
[INFO ] 2026-06-01 07:46:20.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:46:22.816 [28233] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:46:25.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 07:46:25.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423931,ok=423931,error=0, records=41
[INFO ] 2026-06-01 07:46:33.269 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21207/300s
[INFO ] 2026-06-01 07:46:35.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:46:37.822 [27567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:46:40.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 07:46:40.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423932,ok=423932,error=0, records=41
[INFO ] 2026-06-01 07:46:50.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:46:52.828 [28248] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:46:55.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 07:46:55.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423933,ok=423933,error=0, records=41
[INFO ] 2026-06-01 07:47:05.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:47:05.419 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21210/300s
[WARN ] 2026-06-01 07:47:07.834 [28238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:47:10.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 07:47:10.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423934,ok=423934,error=0, records=41
[INFO ] 2026-06-01 07:47:20.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:47:22.839 [28238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:47:26.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 07:47:26.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423935,ok=423935,error=0, records=41
[INFO ] 2026-06-01 07:47:35.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:47:37.844 [28316] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:47:39.822 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21208/300s
[INFO ] 2026-06-01 07:47:41.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 07:47:41.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423936,ok=423936,error=0, records=41
[INFO ] 2026-06-01 07:47:41.724 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21208/300s
[INFO ] 2026-06-01 07:47:49.431 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21208/300s
[INFO ] 2026-06-01 07:47:50.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:47:52.849 [28248] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:47:56.021 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 07:47:56.021 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423937,ok=423937,error=0, records=41
[INFO ] 2026-06-01 07:48:05.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:48:07.854 [28280] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:48:11.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 07:48:11.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423938,ok=423938,error=0, records=41
[INFO ] 2026-06-01 07:48:20.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:48:22.859 [28233] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:48:26.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 07:48:26.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423939,ok=423939,error=0, records=41
[INFO ] 2026-06-01 07:48:35.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:48:37.865 [28358] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:48:41.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 07:48:41.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423940,ok=423940,error=0, records=41
[INFO ] 2026-06-01 07:48:47.506 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868680},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:48:47.660 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:48:47.660 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:48:47.661 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:48:47.661 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:48:47.661 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:48:47.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:48:47.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:48:47.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:48:47.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:48:47.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:48:47.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:48:50.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:48:52.870 [28344] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:48:56.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-06-01 07:48:56.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423941,ok=423941,error=0, records=41
[INFO ] 2026-06-01 07:49:05.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:49:07.875 [28401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:49:11.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 07:49:11.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423942,ok=423942,error=0, records=41
[INFO ] 2026-06-01 07:49:20.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:49:22.881 [28423] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:49:26.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 07:49:26.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423943,ok=423943,error=0, records=41
[INFO ] 2026-06-01 07:49:35.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:49:37.887 [28412] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:49:41.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 07:49:41.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423944,ok=423944,error=0, records=41
[INFO ] 2026-06-01 07:49:50.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:49:52.892 [28449] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:49:56.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 07:49:56.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423945,ok=423945,error=0, records=41
[INFO ] 2026-06-01 07:49:57.394 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21203/300s
[INFO ] 2026-06-01 07:50:00.839 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21212/300s
[INFO ] 2026-06-01 07:50:05.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:50:07.898 [28412] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:50:11.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 07:50:11.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423946,ok=423946,error=0, records=41
[INFO ] 2026-06-01 07:50:11.100 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21199/300s
[INFO ] 2026-06-01 07:50:20.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:50:22.904 [28489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:50:26.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 07:50:26.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423947,ok=423947,error=0, records=41
[INFO ] 2026-06-01 07:50:35.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:50:37.909 [28499] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:50:41.084 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21212/300s
[INFO ] 2026-06-01 07:50:41.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 07:50:41.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423948,ok=423948,error=0, records=41
[INFO ] 2026-06-01 07:50:50.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:50:52.914 [28461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:50:56.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 07:50:56.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423949,ok=423949,error=0, records=41
[INFO ] 2026-06-01 07:51:04.889 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21199/300s
[INFO ] 2026-06-01 07:51:05.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:51:07.920 [28477] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:51:11.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 07:51:11.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423950,ok=423950,error=0, records=41
[INFO ] 2026-06-01 07:51:20.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:51:22.926 [28504] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:51:26.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 07:51:26.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423951,ok=423951,error=0, records=41
[INFO ] 2026-06-01 07:51:33.318 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21208/300s
[INFO ] 2026-06-01 07:51:35.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=33.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:51:37.931 [28570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:51:41.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 07:51:41.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423952,ok=423952,error=0, records=41
[INFO ] 2026-06-01 07:51:47.661 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17660/300s
[INFO ] 2026-06-01 07:51:47.662 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868588},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:51:47.822 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:51:47.822 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:51:47.822 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:51:47.822 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:51:47.822 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:51:47.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:51:47.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:51:47.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:51:47.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:51:47.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:51:47.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:51:50.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:51:52.937 [28504] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:51:56.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 07:51:56.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423953,ok=423953,error=0, records=41
[INFO ] 2026-06-01 07:52:05.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:52:05.430 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21211/300s
[WARN ] 2026-06-01 07:52:07.943 [28538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:52:11.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 07:52:11.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423954,ok=423954,error=0, records=41
[INFO ] 2026-06-01 07:52:20.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=33.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:52:22.948 [28598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:52:26.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 07:52:26.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423955,ok=423955,error=0, records=41
[INFO ] 2026-06-01 07:52:35.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:52:37.953 [28635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:52:39.848 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21209/300s
[INFO ] 2026-06-01 07:52:41.195 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 07:52:41.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423956,ok=423956,error=0, records=41
[INFO ] 2026-06-01 07:52:41.750 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21209/300s
[INFO ] 2026-06-01 07:52:49.453 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21209/300s
[INFO ] 2026-06-01 07:52:50.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:52:52.958 [28586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:52:56.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-01 07:52:56.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423957,ok=423957,error=0, records=41
[INFO ] 2026-06-01 07:53:05.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:53:07.963 [28586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:53:11.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 07:53:11.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423958,ok=423958,error=0, records=41
[INFO ] 2026-06-01 07:53:20.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:53:22.967 [28664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:53:26.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 07:53:26.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423959,ok=423959,error=0, records=41
[INFO ] 2026-06-01 07:53:35.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 07:53:35.434 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 07:53:37.973 [28650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:53:41.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 07:53:41.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423960,ok=423960,error=0, records=41
[INFO ] 2026-06-01 07:53:50.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:53:50.434 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 07:53:52.979 [28650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:53:56.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 07:53:56.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423961,ok=423961,error=0, records=41
[INFO ] 2026-06-01 07:54:05.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:54:07.983 [28720] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:54:11.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 07:54:11.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423962,ok=423962,error=0, records=41
[INFO ] 2026-06-01 07:54:20.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:54:22.990 [28598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:54:26.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 07:54:26.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423963,ok=423963,error=0, records=41
[INFO ] 2026-06-01 07:54:35.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:54:37.994 [28706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:54:41.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 07:54:41.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423964,ok=423964,error=0, records=41
[INFO ] 2026-06-01 07:54:47.824 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868516},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:54:47.995 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:54:47.995 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 07:54:47.996 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:54:47.996 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:54:47.996 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:54:48.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:54:48.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:54:48.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:54:48.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:54:48.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:54:48.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:54:50.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:54:52.998 [28650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:54:56.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 07:54:56.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423965,ok=423965,error=0, records=41
[INFO ] 2026-06-01 07:54:57.500 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21204/300s
[INFO ] 2026-06-01 07:55:00.842 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21213/300s
[INFO ] 2026-06-01 07:55:05.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:55:08.004 [28748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:55:11.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 07:55:11.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423966,ok=423966,error=0, records=41
[INFO ] 2026-06-01 07:55:11.247 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21200/300s
[INFO ] 2026-06-01 07:55:20.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:55:23.009 [28791] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:55:26.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 07:55:26.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423967,ok=423967,error=0, records=41
[INFO ] 2026-06-01 07:55:35.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:55:38.014 [28805] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:55:41.090 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21213/300s
[INFO ] 2026-06-01 07:55:41.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 07:55:41.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423968,ok=423968,error=0, records=41
[INFO ] 2026-06-01 07:55:50.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:55:53.018 [28805] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:55:56.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 07:55:56.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423969,ok=423969,error=0, records=41
[INFO ] 2026-06-01 07:56:05.059 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21200/300s
[INFO ] 2026-06-01 07:56:05.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:56:08.022 [28706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:56:11.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 07:56:11.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423970,ok=423970,error=0, records=41
[INFO ] 2026-06-01 07:56:20.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:56:23.028 [28734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:56:26.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 07:56:26.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423971,ok=423971,error=0, records=41
[INFO ] 2026-06-01 07:56:33.368 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21209/300s
[INFO ] 2026-06-01 07:56:35.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:56:38.033 [28598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:56:41.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 07:56:41.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423972,ok=423972,error=0, records=41
[WARN ] 2026-06-01 07:56:47.538 [28910] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27507/stat), No such file or directory
[INFO ] 2026-06-01 07:56:50.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:56:53.039 [28832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:56:56.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 07:56:56.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423973,ok=423973,error=0, records=41
[INFO ] 2026-06-01 07:57:05.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 07:57:05.443 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21212/300s
[WARN ] 2026-06-01 07:57:08.044 [28932] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:57:11.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 07:57:11.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423974,ok=423974,error=0, records=41
[WARN ] 2026-06-01 07:57:17.547 [28805] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27523/stat), No such file or directory
[INFO ] 2026-06-01 07:57:20.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:57:23.048 [28932] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:57:26.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 07:57:26.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423975,ok=423975,error=0, records=41
[WARN ] 2026-06-01 07:57:32.551 [28960] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27523/stat), No such file or directory
[WARN ] 2026-06-01 07:57:32.551 [28960] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28053/stat), No such file or directory
[WARN ] 2026-06-01 07:57:32.551 [28960] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28065/stat), No such file or directory
[INFO ] 2026-06-01 07:57:35.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:57:38.053 [28966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:57:39.905 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21210/300s
[INFO ] 2026-06-01 07:57:41.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 07:57:41.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423976,ok=423976,error=0, records=41
[INFO ] 2026-06-01 07:57:41.806 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21210/300s
[WARN ] 2026-06-01 07:57:47.556 [28965] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27523/stat), No such file or directory
[WARN ] 2026-06-01 07:57:47.556 [28965] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28053/stat), No such file or directory
[WARN ] 2026-06-01 07:57:47.556 [28965] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28065/stat), No such file or directory
[INFO ] 2026-06-01 07:57:47.996 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17661/300s
[INFO ] 2026-06-01 07:57:47.997 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868408},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 07:57:48.165 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 07:57:48.165 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 07:57:48.166 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 07:57:48.166 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 07:57:48.166 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 07:57:48.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:57:48.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 07:57:48.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:57:48.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 07:57:48.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:57:48.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 07:57:49.491 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21210/300s
[INFO ] 2026-06-01 07:57:50.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:57:52.557 [28982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:57:56.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 07:57:56.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423977,ok=423977,error=0, records=41
[INFO ] 2026-06-01 07:58:05.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:58:07.561 [28995] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:58:11.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 07:58:11.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423978,ok=423978,error=0, records=41
[INFO ] 2026-06-01 07:58:20.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:58:22.565 [28989] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:58:26.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 07:58:26.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423979,ok=423979,error=0, records=41
[INFO ] 2026-06-01 07:58:35.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:58:37.569 [29042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:58:41.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 07:58:41.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423980,ok=423980,error=0, records=41
[INFO ] 2026-06-01 07:58:50.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:58:52.574 [29037] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:58:56.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 07:58:56.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423981,ok=423981,error=0, records=41
[INFO ] 2026-06-01 07:59:05.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:59:07.578 [29054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:59:11.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 07:59:11.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423982,ok=423982,error=0, records=41
[INFO ] 2026-06-01 07:59:20.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:59:22.584 [29093] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:59:26.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 07:59:26.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423983,ok=423983,error=0, records=41
[INFO ] 2026-06-01 07:59:35.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:59:37.588 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:59:41.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 07:59:41.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423984,ok=423984,error=0, records=41
[INFO ] 2026-06-01 07:59:50.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 07:59:52.594 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 07:59:56.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 07:59:56.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423985,ok=423985,error=0, records=41
[INFO ] 2026-06-01 07:59:57.596 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21205/300s
[INFO ] 2026-06-01 08:00:00.845 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21214/300s
[INFO ] 2026-06-01 08:00:05.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:00:07.602 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:00:11.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 08:00:11.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423986,ok=423986,error=0, records=41
[INFO ] 2026-06-01 08:00:11.361 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21201/300s
[INFO ] 2026-06-01 08:00:20.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:00:22.608 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:00:26.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 08:00:26.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423987,ok=423987,error=0, records=41
[INFO ] 2026-06-01 08:00:35.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:00:37.613 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:00:41.096 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21214/300s
[INFO ] 2026-06-01 08:00:41.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 08:00:41.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423988,ok=423988,error=0, records=41
[INFO ] 2026-06-01 08:00:48.167 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:00:48.339 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:00:48.339 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 08:00:48.339 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:00:48.339 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:00:48.339 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:00:48.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:00:48.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:00:48.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:00:48.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:00:48.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:00:48.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:00:50.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:00:52.618 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:00:56.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 08:00:56.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423989,ok=423989,error=0, records=41
[INFO ] 2026-06-01 08:01:05.236 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21201/300s
[INFO ] 2026-06-01 08:01:05.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:01:07.624 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:01:11.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 08:01:11.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423990,ok=423990,error=0, records=41
[INFO ] 2026-06-01 08:01:20.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:01:22.630 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:01:26.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 08:01:26.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423991,ok=423991,error=0, records=41
[INFO ] 2026-06-01 08:01:33.419 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21210/300s
[INFO ] 2026-06-01 08:01:35.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:01:37.635 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:01:41.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 08:01:41.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423992,ok=423992,error=0, records=41
[INFO ] 2026-06-01 08:01:50.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:01:52.640 [29098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:01:56.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 08:01:56.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423993,ok=423993,error=0, records=41
[INFO ] 2026-06-01 08:02:05.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:02:05.454 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21213/300s
[WARN ] 2026-06-01 08:02:07.645 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:02:11.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 08:02:11.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423994,ok=423994,error=0, records=41
[INFO ] 2026-06-01 08:02:20.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:02:22.649 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:02:26.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 08:02:26.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423995,ok=423995,error=0, records=41
[INFO ] 2026-06-01 08:02:35.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:02:37.655 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:02:39.962 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21211/300s
[INFO ] 2026-06-01 08:02:41.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 08:02:41.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423996,ok=423996,error=0, records=41
[INFO ] 2026-06-01 08:02:41.863 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21211/300s
[INFO ] 2026-06-01 08:02:49.536 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21211/300s
[INFO ] 2026-06-01 08:02:50.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:02:52.659 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:02:56.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 08:02:56.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423997,ok=423997,error=0, records=41
[INFO ] 2026-06-01 08:03:05.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:03:07.665 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:03:11.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 08:03:11.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423998,ok=423998,error=0, records=41
[INFO ] 2026-06-01 08:03:20.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:03:22.670 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:03:26.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 08:03:26.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=423999,ok=423999,error=0, records=41
[INFO ] 2026-06-01 08:03:35.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 08:03:35.458 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 08:03:37.676 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:03:41.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 08:03:41.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424000,ok=424000,error=0, records=41
[INFO ] 2026-06-01 08:03:48.340 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17662/300s
[INFO ] 2026-06-01 08:03:48.341 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868252},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:03:48.502 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:03:48.502 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:03:48.503 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:03:48.503 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:03:48.503 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:03:48.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:03:48.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:03:48.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:03:48.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:03:48.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:03:48.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:03:50.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:03:52.680 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:03:56.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 08:03:56.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424001,ok=424001,error=0, records=41
[INFO ] 2026-06-01 08:04:05.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:04:07.686 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:04:11.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 08:04:11.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424002,ok=424002,error=0, records=41
[INFO ] 2026-06-01 08:04:20.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:04:22.692 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:04:26.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 08:04:26.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424003,ok=424003,error=0, records=41
[INFO ] 2026-06-01 08:04:35.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:04:37.696 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:04:41.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 08:04:41.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424004,ok=424004,error=0, records=41
[INFO ] 2026-06-01 08:04:50.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:04:52.701 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:04:56.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 08:04:56.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424005,ok=424005,error=0, records=41
[INFO ] 2026-06-01 08:04:57.702 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21206/300s
[INFO ] 2026-06-01 08:05:00.849 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21215/300s
[INFO ] 2026-06-01 08:05:05.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:05:07.706 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:05:11.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 08:05:11.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424006,ok=424006,error=0, records=41
[INFO ] 2026-06-01 08:05:11.547 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21202/300s
[INFO ] 2026-06-01 08:05:20.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:05:22.710 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:05:26.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 08:05:26.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424007,ok=424007,error=0, records=41
[INFO ] 2026-06-01 08:05:35.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:05:37.714 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:05:41.102 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21215/300s
[INFO ] 2026-06-01 08:05:41.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 08:05:41.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424008,ok=424008,error=0, records=41
[INFO ] 2026-06-01 08:05:50.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:05:52.719 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:05:56.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 08:05:56.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424009,ok=424009,error=0, records=41
[INFO ] 2026-06-01 08:06:05.417 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21202/300s
[INFO ] 2026-06-01 08:06:05.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:06:07.724 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:06:11.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 08:06:11.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424010,ok=424010,error=0, records=41
[INFO ] 2026-06-01 08:06:20.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:06:22.728 [29117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:06:26.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 08:06:26.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424011,ok=424011,error=0, records=41
[INFO ] 2026-06-01 08:06:33.472 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21211/300s
[INFO ] 2026-06-01 08:06:35.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:06:37.734 [29098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:06:41.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 08:06:41.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424012,ok=424012,error=0, records=41
[INFO ] 2026-06-01 08:06:48.504 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:06:48.672 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:06:48.673 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:06:48.673 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:06:48.673 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:06:48.673 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:06:48.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:06:48.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:06:48.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:06:48.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:06:48.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:06:48.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:06:50.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:06:52.739 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:06:56.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 08:06:56.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424013,ok=424013,error=0, records=41
[INFO ] 2026-06-01 08:07:05.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:07:05.466 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21214/300s
[WARN ] 2026-06-01 08:07:07.744 [29098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:07:11.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 08:07:11.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424014,ok=424014,error=0, records=41
[INFO ] 2026-06-01 08:07:20.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:07:22.749 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:07:26.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 08:07:26.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424015,ok=424015,error=0, records=41
[INFO ] 2026-06-01 08:07:35.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:07:37.754 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:07:40.016 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21212/300s
[INFO ] 2026-06-01 08:07:41.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 08:07:41.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424016,ok=424016,error=0, records=41
[INFO ] 2026-06-01 08:07:41.918 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21212/300s
[INFO ] 2026-06-01 08:07:49.577 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21212/300s
[INFO ] 2026-06-01 08:07:50.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:07:52.760 [29098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:07:56.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 08:07:56.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424017,ok=424017,error=0, records=41
[INFO ] 2026-06-01 08:08:05.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:08:07.766 [29098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:08:11.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 08:08:11.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424018,ok=424018,error=0, records=41
[INFO ] 2026-06-01 08:08:20.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:08:22.771 [29098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:08:26.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 08:08:26.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424019,ok=424019,error=0, records=41
[INFO ] 2026-06-01 08:08:35.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:08:37.776 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:08:41.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 08:08:41.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424020,ok=424020,error=0, records=41
[INFO ] 2026-06-01 08:08:50.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:08:50.470 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 08:08:52.783 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:08:56.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 08:08:56.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424021,ok=424021,error=0, records=41
[INFO ] 2026-06-01 08:09:05.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:09:07.788 [29128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:09:11.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 08:09:11.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424022,ok=424022,error=0, records=41
[INFO ] 2026-06-01 08:09:20.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:09:22.793 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:09:26.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 08:09:26.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424023,ok=424023,error=0, records=41
[INFO ] 2026-06-01 08:09:35.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:09:37.799 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:09:41.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 08:09:41.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424024,ok=424024,error=0, records=41
[INFO ] 2026-06-01 08:09:48.673 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17663/300s
[INFO ] 2026-06-01 08:09:48.674 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868092},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:09:48.857 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:09:48.857 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 08:09:48.858 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:09:48.858 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:09:48.858 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:09:48.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:09:48.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:09:48.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:09:48.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:09:48.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:09:48.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:09:50.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:09:52.804 [29114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:09:56.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 08:09:56.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424025,ok=424025,error=0, records=41
[INFO ] 2026-06-01 08:09:57.806 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21207/300s
[INFO ] 2026-06-01 08:10:00.852 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21216/300s
[INFO ] 2026-06-01 08:10:05.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:10:07.810 [29687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:10:11.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 08:10:11.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424026,ok=424026,error=0, records=41
[INFO ] 2026-06-01 08:10:11.801 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21203/300s
[INFO ] 2026-06-01 08:10:20.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:10:22.815 [29717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:10:26.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 08:10:26.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424027,ok=424027,error=0, records=41
[INFO ] 2026-06-01 08:10:35.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:10:37.820 [29118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:10:41.108 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21216/300s
[INFO ] 2026-06-01 08:10:41.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 08:10:41.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424028,ok=424028,error=0, records=41
[INFO ] 2026-06-01 08:10:50.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:10:52.825 [29707] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:10:56.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 08:10:56.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424029,ok=424029,error=0, records=41
[INFO ] 2026-06-01 08:11:05.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:11:05.594 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21203/300s
[WARN ] 2026-06-01 08:11:07.830 [29750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:11:11.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 08:11:11.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424030,ok=424030,error=0, records=41
[INFO ] 2026-06-01 08:11:20.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:11:22.835 [29687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:11:26.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 08:11:26.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424031,ok=424031,error=0, records=41
[INFO ] 2026-06-01 08:11:33.522 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21212/300s
[INFO ] 2026-06-01 08:11:35.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:11:37.841 [29765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:11:41.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 08:11:41.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424032,ok=424032,error=0, records=41
[INFO ] 2026-06-01 08:11:50.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:11:52.846 [29687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:11:56.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 08:11:56.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424033,ok=424033,error=0, records=41
[INFO ] 2026-06-01 08:12:05.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:12:05.479 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21215/300s
[WARN ] 2026-06-01 08:12:07.851 [29765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:12:11.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 08:12:11.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424034,ok=424034,error=0, records=41
[INFO ] 2026-06-01 08:12:20.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:12:22.855 [29816] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:12:26.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 08:12:26.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424035,ok=424035,error=0, records=41
[INFO ] 2026-06-01 08:12:35.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:12:37.860 [29830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:12:40.080 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21213/300s
[INFO ] 2026-06-01 08:12:41.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 08:12:41.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424036,ok=424036,error=0, records=41
[INFO ] 2026-06-01 08:12:41.981 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21213/300s
[INFO ] 2026-06-01 08:12:48.859 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20868012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:12:49.006 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:12:49.006 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 08:12:49.006 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:12:49.006 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:12:49.006 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:12:49.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:12:49.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:12:49.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:12:49.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:12:49.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:12:49.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:12:49.618 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21213/300s
[INFO ] 2026-06-01 08:12:50.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:12:52.864 [29830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:12:56.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 08:12:56.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424037,ok=424037,error=0, records=41
[INFO ] 2026-06-01 08:13:05.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:13:07.870 [29830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:13:11.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 08:13:11.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424038,ok=424038,error=0, records=41
[INFO ] 2026-06-01 08:13:20.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:13:22.876 [29874] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:13:26.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 08:13:26.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424039,ok=424039,error=0, records=41
[INFO ] 2026-06-01 08:13:35.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 08:13:35.482 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 08:13:37.882 [29888] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:13:41.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 08:13:41.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424040,ok=424040,error=0, records=41
[INFO ] 2026-06-01 08:13:50.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:13:52.887 [29874] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:13:56.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 08:13:56.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424041,ok=424041,error=0, records=41
[INFO ] 2026-06-01 08:14:05.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:14:07.893 [29921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:14:11.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 08:14:11.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424042,ok=424042,error=0, records=41
[INFO ] 2026-06-01 08:14:20.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:14:22.899 [29954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:14:26.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 08:14:26.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424043,ok=424043,error=0, records=41
[INFO ] 2026-06-01 08:14:35.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:14:37.905 [29954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:14:41.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 08:14:41.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424044,ok=424044,error=0, records=41
[INFO ] 2026-06-01 08:14:50.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:14:52.910 [29972] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:14:56.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 08:14:56.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424045,ok=424045,error=0, records=41
[INFO ] 2026-06-01 08:14:57.912 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21208/300s
[INFO ] 2026-06-01 08:15:00.855 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21217/300s
[INFO ] 2026-06-01 08:15:05.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:15:07.916 [29993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:15:11.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 08:15:11.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424046,ok=424046,error=0, records=41
[INFO ] 2026-06-01 08:15:11.977 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21204/300s
[INFO ] 2026-06-01 08:15:20.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:15:22.921 [30027] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:15:26.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 08:15:26.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424047,ok=424047,error=0, records=41
[INFO ] 2026-06-01 08:15:35.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:15:37.928 [30043] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:15:41.114 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21217/300s
[INFO ] 2026-06-01 08:15:41.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 08:15:41.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424048,ok=424048,error=0, records=41
[INFO ] 2026-06-01 08:15:49.006 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17664/300s
[INFO ] 2026-06-01 08:15:49.008 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867940},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:15:49.168 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:15:49.168 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 08:15:49.168 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:15:49.169 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:15:49.169 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:15:49.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:15:49.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:15:49.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:15:49.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:15:49.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:15:49.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:15:50.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:15:52.933 [30054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:15:57.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 08:15:57.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424049,ok=424049,error=0, records=41
[INFO ] 2026-06-01 08:16:05.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:16:05.774 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21204/300s
[WARN ] 2026-06-01 08:16:07.938 [30076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:16:12.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 08:16:12.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424050,ok=424050,error=0, records=41
[INFO ] 2026-06-01 08:16:20.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:16:22.943 [30087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:16:27.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 08:16:27.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424051,ok=424051,error=0, records=41
[INFO ] 2026-06-01 08:16:33.573 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21213/300s
[INFO ] 2026-06-01 08:16:35.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:16:37.949 [30076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:16:42.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 08:16:42.044 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424052,ok=424052,error=0, records=41
[INFO ] 2026-06-01 08:16:50.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:16:52.953 [30054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:16:57.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 08:16:57.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424053,ok=424053,error=0, records=41
[INFO ] 2026-06-01 08:17:05.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:17:05.490 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21216/300s
[WARN ] 2026-06-01 08:17:07.957 [30087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:17:12.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10139, records=41
[INFO ] 2026-06-01 08:17:12.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424054,ok=424054,error=0, records=41
[INFO ] 2026-06-01 08:17:20.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:17:22.962 [30130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:17:27.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10077, records=41
[INFO ] 2026-06-01 08:17:27.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424055,ok=424055,error=0, records=41
[INFO ] 2026-06-01 08:17:35.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:17:37.966 [30053] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:17:40.098 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21214/300s
[INFO ] 2026-06-01 08:17:42.026 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21214/300s
[INFO ] 2026-06-01 08:17:42.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10093, records=41
[INFO ] 2026-06-01 08:17:42.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424056,ok=424056,error=0, records=41
[INFO ] 2026-06-01 08:17:49.638 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21214/300s
[INFO ] 2026-06-01 08:17:50.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:17:52.971 [30130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:17:57.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10083, records=41
[INFO ] 2026-06-01 08:17:57.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424057,ok=424057,error=0, records=41
[INFO ] 2026-06-01 08:18:05.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:18:07.976 [30158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:18:12.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 08:18:12.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424058,ok=424058,error=0, records=41
[INFO ] 2026-06-01 08:18:20.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:18:22.982 [30054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:18:27.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 08:18:27.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424059,ok=424059,error=0, records=41
[INFO ] 2026-06-01 08:18:35.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:18:37.986 [30054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:18:42.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 08:18:42.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424060,ok=424060,error=0, records=41
[INFO ] 2026-06-01 08:18:49.170 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867864},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:18:49.336 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:18:49.337 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 08:18:49.337 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:18:49.337 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:18:49.337 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:18:49.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:18:49.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:18:49.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:18:49.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:18:49.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:18:49.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:18:50.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:18:52.990 [30054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:18:57.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 08:18:57.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424061,ok=424061,error=0, records=41
[INFO ] 2026-06-01 08:19:05.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:19:07.995 [30172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:19:12.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 08:19:12.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424062,ok=424062,error=0, records=41
[INFO ] 2026-06-01 08:19:20.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:19:23.000 [30258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:19:27.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 08:19:27.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424063,ok=424063,error=0, records=41
[INFO ] 2026-06-01 08:19:35.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:19:38.006 [30172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:19:42.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 08:19:42.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424064,ok=424064,error=0, records=41
[INFO ] 2026-06-01 08:19:50.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:19:53.011 [30158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:19:57.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 08:19:57.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424065,ok=424065,error=0, records=41
[INFO ] 2026-06-01 08:19:58.013 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21209/300s
[INFO ] 2026-06-01 08:20:00.858 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21218/300s
[INFO ] 2026-06-01 08:20:05.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:20:08.017 [30272] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:20:12.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 08:20:12.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424066,ok=424066,error=0, records=41
[INFO ] 2026-06-01 08:20:12.216 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21205/300s
[INFO ] 2026-06-01 08:20:20.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:20:23.022 [30200] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:20:27.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 08:20:27.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424067,ok=424067,error=0, records=41
[INFO ] 2026-06-01 08:20:35.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:20:38.027 [30200] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:20:41.120 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21218/300s
[INFO ] 2026-06-01 08:20:42.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 08:20:42.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424068,ok=424068,error=0, records=41
[INFO ] 2026-06-01 08:20:50.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:20:53.032 [30158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:20:57.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 08:20:57.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424069,ok=424069,error=0, records=41
[INFO ] 2026-06-01 08:21:05.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:21:05.955 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21205/300s
[WARN ] 2026-06-01 08:21:08.037 [30367] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:21:12.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 08:21:12.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424070,ok=424070,error=0, records=41
[INFO ] 2026-06-01 08:21:20.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:21:23.042 [30367] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:21:27.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 08:21:27.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424071,ok=424071,error=0, records=41
[INFO ] 2026-06-01 08:21:33.625 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21214/300s
[INFO ] 2026-06-01 08:21:35.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:21:38.047 [30367] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:21:42.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 08:21:42.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424072,ok=424072,error=0, records=41
[INFO ] 2026-06-01 08:21:49.337 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17665/300s
[INFO ] 2026-06-01 08:21:49.338 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867792},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:21:49.496 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:21:49.496 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 08:21:49.496 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:21:49.496 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:21:49.496 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:21:49.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:21:49.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:21:49.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:21:49.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:21:49.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:21:49.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:21:50.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:21:53.050 [30388] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:21:57.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 08:21:57.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424073,ok=424073,error=0, records=41
[INFO ] 2026-06-01 08:22:05.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:22:05.502 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21217/300s
[WARN ] 2026-06-01 08:22:07.556 [30429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:22:12.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 08:22:12.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424074,ok=424074,error=0, records=41
[INFO ] 2026-06-01 08:22:20.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:22:22.563 [30448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:22:27.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 08:22:27.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424075,ok=424075,error=0, records=41
[INFO ] 2026-06-01 08:22:35.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:22:37.568 [30448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:22:40.148 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21215/300s
[INFO ] 2026-06-01 08:22:42.059 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21215/300s
[INFO ] 2026-06-01 08:22:42.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 08:22:42.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424076,ok=424076,error=0, records=41
[INFO ] 2026-06-01 08:22:49.661 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21215/300s
[INFO ] 2026-06-01 08:22:50.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:22:52.572 [30460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:22:57.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 08:22:57.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424077,ok=424077,error=0, records=41
[INFO ] 2026-06-01 08:23:05.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:23:07.576 [30472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:23:12.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 08:23:12.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424078,ok=424078,error=0, records=41
[INFO ] 2026-06-01 08:23:20.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:23:22.580 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:23:27.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 08:23:27.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424079,ok=424079,error=0, records=41
[INFO ] 2026-06-01 08:23:35.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 08:23:35.505 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 08:23:37.584 [30478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:23:42.416 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 08:23:42.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424080,ok=424080,error=0, records=41
[INFO ] 2026-06-01 08:23:50.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=33.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:23:50.506 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 08:23:52.588 [30536] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:23:57.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 08:23:57.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424081,ok=424081,error=0, records=41
[INFO ] 2026-06-01 08:24:05.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:24:07.594 [30536] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:24:12.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 08:24:12.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424082,ok=424082,error=0, records=41
[INFO ] 2026-06-01 08:24:20.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:24:22.599 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:24:27.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 08:24:27.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424083,ok=424083,error=0, records=41
[INFO ] 2026-06-01 08:24:35.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:24:37.605 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:24:42.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 08:24:42.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424084,ok=424084,error=0, records=41
[INFO ] 2026-06-01 08:24:49.498 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867728},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:24:49.663 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:24:49.663 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:24:49.663 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:24:49.663 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:24:49.664 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:24:49.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:24:49.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:24:49.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:24:49.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:24:49.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:24:49.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:24:50.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:24:52.610 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:24:57.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 08:24:57.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424085,ok=424085,error=0, records=41
[INFO ] 2026-06-01 08:24:58.113 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21210/300s
[INFO ] 2026-06-01 08:25:00.861 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21219/300s
[INFO ] 2026-06-01 08:25:05.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:25:07.618 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:25:12.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 08:25:12.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424086,ok=424086,error=0, records=41
[INFO ] 2026-06-01 08:25:12.466 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21206/300s
[INFO ] 2026-06-01 08:25:20.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:25:22.624 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:25:27.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 08:25:27.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424087,ok=424087,error=0, records=41
[INFO ] 2026-06-01 08:25:35.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:25:37.629 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:25:41.125 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21219/300s
[INFO ] 2026-06-01 08:25:42.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 08:25:42.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424088,ok=424088,error=0, records=41
[INFO ] 2026-06-01 08:25:50.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:25:52.635 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:25:57.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 08:25:57.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424089,ok=424089,error=0, records=41
[INFO ] 2026-06-01 08:26:05.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:26:06.136 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21206/300s
[WARN ] 2026-06-01 08:26:07.639 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:26:12.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 08:26:12.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424090,ok=424090,error=0, records=41
[INFO ] 2026-06-01 08:26:20.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:26:22.643 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:26:27.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 08:26:27.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424091,ok=424091,error=0, records=41
[INFO ] 2026-06-01 08:26:33.674 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21215/300s
[INFO ] 2026-06-01 08:26:35.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:26:37.648 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:26:42.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 08:26:42.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424092,ok=424092,error=0, records=41
[INFO ] 2026-06-01 08:26:50.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:26:52.654 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:26:57.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 08:26:57.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424093,ok=424093,error=0, records=41
[INFO ] 2026-06-01 08:27:05.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:27:05.514 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21218/300s
[WARN ] 2026-06-01 08:27:07.659 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:27:12.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 08:27:12.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424094,ok=424094,error=0, records=41
[INFO ] 2026-06-01 08:27:20.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:27:22.663 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:27:27.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 08:27:27.519 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424095,ok=424095,error=0, records=41
[INFO ] 2026-06-01 08:27:35.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:27:37.668 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:27:40.193 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21216/300s
[INFO ] 2026-06-01 08:27:42.095 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21216/300s
[INFO ] 2026-06-01 08:27:42.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 08:27:42.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424096,ok=424096,error=0, records=41
[INFO ] 2026-06-01 08:27:49.664 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17666/300s
[INFO ] 2026-06-01 08:27:49.665 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867656},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:27:49.702 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21216/300s
[INFO ] 2026-06-01 08:27:49.829 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:27:49.829 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 08:27:49.830 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:27:49.830 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:27:49.830 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:27:49.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:27:49.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:27:49.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:27:49.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:27:49.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:27:49.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:27:50.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:27:52.672 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:27:57.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 08:27:57.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424097,ok=424097,error=0, records=41
[INFO ] 2026-06-01 08:28:05.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:28:07.678 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:28:12.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 08:28:12.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424098,ok=424098,error=0, records=41
[INFO ] 2026-06-01 08:28:20.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:28:22.683 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:28:27.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 08:28:27.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424099,ok=424099,error=0, records=41
[INFO ] 2026-06-01 08:28:35.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:28:37.688 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:28:42.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 08:28:42.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424100,ok=424100,error=0, records=41
[INFO ] 2026-06-01 08:28:50.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:28:52.692 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:28:57.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 08:28:57.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424101,ok=424101,error=0, records=41
[INFO ] 2026-06-01 08:29:05.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:29:07.697 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:29:12.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 08:29:12.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424102,ok=424102,error=0, records=41
[INFO ] 2026-06-01 08:29:20.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:29:22.703 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:29:27.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 08:29:27.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424103,ok=424103,error=0, records=41
[INFO ] 2026-06-01 08:29:35.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:29:37.708 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:29:42.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 08:29:42.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424104,ok=424104,error=0, records=41
[INFO ] 2026-06-01 08:29:50.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:29:52.713 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:29:57.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 08:29:57.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424105,ok=424105,error=0, records=41
[INFO ] 2026-06-01 08:29:58.215 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21211/300s
[INFO ] 2026-06-01 08:30:00.864 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21220/300s
[INFO ] 2026-06-01 08:30:05.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:30:07.718 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:30:12.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 08:30:12.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424106,ok=424106,error=0, records=41
[INFO ] 2026-06-01 08:30:12.664 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21207/300s
[INFO ] 2026-06-01 08:30:20.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:30:22.722 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:30:27.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 08:30:27.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424107,ok=424107,error=0, records=41
[INFO ] 2026-06-01 08:30:35.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:30:37.728 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:30:41.131 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21220/300s
[INFO ] 2026-06-01 08:30:42.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 08:30:42.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424108,ok=424108,error=0, records=41
[INFO ] 2026-06-01 08:30:49.831 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:30:50.005 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:30:50.005 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:30:50.005 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:30:50.005 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:30:50.005 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:30:50.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:30:50.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:30:50.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:30:50.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:30:50.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:30:50.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:30:50.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:30:52.732 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:30:57.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 08:30:57.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424109,ok=424109,error=0, records=41
[INFO ] 2026-06-01 08:31:05.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:31:06.290 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21207/300s
[WARN ] 2026-06-01 08:31:07.736 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:31:12.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 08:31:12.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424110,ok=424110,error=0, records=41
[INFO ] 2026-06-01 08:31:20.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:31:22.742 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:31:27.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 08:31:27.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424111,ok=424111,error=0, records=41
[INFO ] 2026-06-01 08:31:33.737 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21216/300s
[INFO ] 2026-06-01 08:31:35.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:31:37.747 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:31:42.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 08:31:42.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424112,ok=424112,error=0, records=41
[INFO ] 2026-06-01 08:31:50.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:31:52.753 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:31:57.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 08:31:57.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424113,ok=424113,error=0, records=41
[INFO ] 2026-06-01 08:32:05.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:32:05.526 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21219/300s
[WARN ] 2026-06-01 08:32:07.757 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:32:12.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 08:32:12.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424114,ok=424114,error=0, records=41
[INFO ] 2026-06-01 08:32:20.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:32:22.763 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:32:27.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 08:32:27.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424115,ok=424115,error=0, records=41
[INFO ] 2026-06-01 08:32:35.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:32:37.768 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:32:40.244 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21217/300s
[INFO ] 2026-06-01 08:32:42.146 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21217/300s
[INFO ] 2026-06-01 08:32:42.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 08:32:42.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424116,ok=424116,error=0, records=41
[INFO ] 2026-06-01 08:32:49.753 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21217/300s
[INFO ] 2026-06-01 08:32:50.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:32:52.773 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:32:57.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 08:32:57.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424117,ok=424117,error=0, records=41
[INFO ] 2026-06-01 08:33:05.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:33:07.778 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:33:12.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 08:33:12.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424118,ok=424118,error=0, records=41
[INFO ] 2026-06-01 08:33:20.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:33:22.783 [30569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:33:27.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 08:33:27.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424119,ok=424119,error=0, records=41
[INFO ] 2026-06-01 08:33:35.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 08:33:35.530 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 08:33:37.788 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:33:42.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 08:33:42.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424120,ok=424120,error=0, records=41
[INFO ] 2026-06-01 08:33:50.006 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17667/300s
[INFO ] 2026-06-01 08:33:50.007 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867496},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:33:50.171 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:33:50.171 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:33:50.172 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:33:50.172 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:33:50.172 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:33:50.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:33:50.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:33:50.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:33:50.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:33:50.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:33:50.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:33:50.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:33:52.794 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:33:57.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 08:33:57.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424121,ok=424121,error=0, records=41
[INFO ] 2026-06-01 08:34:05.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:34:07.799 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:34:12.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 08:34:12.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424122,ok=424122,error=0, records=41
[INFO ] 2026-06-01 08:34:20.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:34:22.805 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:34:27.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 08:34:27.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424123,ok=424123,error=0, records=41
[INFO ] 2026-06-01 08:34:35.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:34:37.811 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:34:42.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 08:34:42.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424124,ok=424124,error=0, records=41
[INFO ] 2026-06-01 08:34:50.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:34:52.815 [31156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:34:57.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 08:34:57.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424125,ok=424125,error=0, records=41
[INFO ] 2026-06-01 08:34:58.317 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21212/300s
[INFO ] 2026-06-01 08:35:00.867 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21221/300s
[INFO ] 2026-06-01 08:35:05.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:35:07.821 [31204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:35:12.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 08:35:12.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424126,ok=424126,error=0, records=41
[INFO ] 2026-06-01 08:35:12.774 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21208/300s
[INFO ] 2026-06-01 08:35:20.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:35:22.826 [31204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:35:27.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 08:35:27.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424127,ok=424127,error=0, records=41
[INFO ] 2026-06-01 08:35:35.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:35:37.831 [31190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:35:41.137 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21221/300s
[INFO ] 2026-06-01 08:35:42.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 08:35:42.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424128,ok=424128,error=0, records=41
[INFO ] 2026-06-01 08:35:50.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:35:52.837 [31219] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:35:57.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 08:35:57.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424129,ok=424129,error=0, records=41
[INFO ] 2026-06-01 08:36:05.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:36:06.466 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21208/300s
[WARN ] 2026-06-01 08:36:07.843 [31156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:36:12.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 08:36:12.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424130,ok=424130,error=0, records=41
[INFO ] 2026-06-01 08:36:20.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:36:22.849 [31156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:36:27.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 08:36:27.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424131,ok=424131,error=0, records=41
[INFO ] 2026-06-01 08:36:33.785 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21217/300s
[INFO ] 2026-06-01 08:36:35.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:36:37.855 [31247] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:36:42.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 08:36:42.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424132,ok=424132,error=0, records=41
[INFO ] 2026-06-01 08:36:50.173 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867412},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:36:50.337 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:36:50.337 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 08:36:50.337 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:36:50.337 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:36:50.337 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:36:50.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:36:50.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:36:50.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:36:50.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:36:50.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:36:50.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:36:50.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:36:52.860 [31257] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:36:57.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 08:36:57.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424133,ok=424133,error=0, records=41
[INFO ] 2026-06-01 08:37:05.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:37:05.538 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21220/300s
[WARN ] 2026-06-01 08:37:07.864 [31156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:37:12.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-01 08:37:12.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424134,ok=424134,error=0, records=41
[INFO ] 2026-06-01 08:37:20.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:37:22.869 [31312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:37:27.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-01 08:37:27.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424135,ok=424135,error=0, records=41
[INFO ] 2026-06-01 08:37:35.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:37:37.875 [31156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:37:40.257 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21218/300s
[INFO ] 2026-06-01 08:37:42.161 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21218/300s
[INFO ] 2026-06-01 08:37:42.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 08:37:42.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424136,ok=424136,error=0, records=41
[INFO ] 2026-06-01 08:37:49.765 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21218/300s
[INFO ] 2026-06-01 08:37:50.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:37:52.880 [31156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:37:57.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 08:37:57.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424137,ok=424137,error=0, records=41
[INFO ] 2026-06-01 08:38:05.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:38:07.886 [31372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:38:12.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 08:38:12.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424138,ok=424138,error=0, records=41
[INFO ] 2026-06-01 08:38:20.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:38:22.892 [31389] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:38:27.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 08:38:27.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424139,ok=424139,error=0, records=41
[INFO ] 2026-06-01 08:38:35.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:38:37.898 [31372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:38:42.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 08:38:42.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424140,ok=424140,error=0, records=41
[INFO ] 2026-06-01 08:38:50.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:38:50.541 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 08:38:52.903 [31424] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:38:57.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 08:38:57.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424141,ok=424141,error=0, records=41
[INFO ] 2026-06-01 08:39:05.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:39:07.909 [31366] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:39:12.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 08:39:12.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424142,ok=424142,error=0, records=41
[INFO ] 2026-06-01 08:39:20.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:39:22.915 [31366] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:39:27.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 08:39:27.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424143,ok=424143,error=0, records=41
[INFO ] 2026-06-01 08:39:35.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:39:37.920 [31458] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:39:42.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 08:39:42.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424144,ok=424144,error=0, records=41
[INFO ] 2026-06-01 08:39:50.337 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17668/300s
[INFO ] 2026-06-01 08:39:50.339 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867272},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:39:50.491 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:39:50.491 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 08:39:50.491 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:39:50.491 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:39:50.491 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:39:50.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:39:50.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:39:50.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:39:50.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:39:50.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:39:50.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:39:50.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:39:52.925 [31480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:39:57.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 08:39:57.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424145,ok=424145,error=0, records=41
[INFO ] 2026-06-01 08:39:58.426 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21213/300s
[INFO ] 2026-06-01 08:40:00.869 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21222/300s
[INFO ] 2026-06-01 08:40:05.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:40:07.930 [31508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:40:12.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 08:40:12.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424146,ok=424146,error=0, records=41
[INFO ] 2026-06-01 08:40:12.978 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21209/300s
[INFO ] 2026-06-01 08:40:20.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:40:22.937 [31491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:40:27.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 08:40:27.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424147,ok=424147,error=0, records=41
[INFO ] 2026-06-01 08:40:35.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:40:37.942 [31527] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:40:41.144 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21222/300s
[INFO ] 2026-06-01 08:40:42.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 08:40:42.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424148,ok=424148,error=0, records=41
[INFO ] 2026-06-01 08:40:50.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:40:52.947 [31556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:40:57.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 08:40:57.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424149,ok=424149,error=0, records=41
[INFO ] 2026-06-01 08:41:05.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:41:06.648 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21209/300s
[WARN ] 2026-06-01 08:41:07.952 [31544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:41:12.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 08:41:12.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424150,ok=424150,error=0, records=41
[INFO ] 2026-06-01 08:41:20.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:41:22.957 [31549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:41:28.006 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 08:41:28.006 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424151,ok=424151,error=0, records=41
[INFO ] 2026-06-01 08:41:33.837 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21218/300s
[INFO ] 2026-06-01 08:41:35.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:41:37.962 [31556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:41:43.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 08:41:43.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424152,ok=424152,error=0, records=41
[INFO ] 2026-06-01 08:41:50.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:41:52.967 [31561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:41:58.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 08:41:58.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424153,ok=424153,error=0, records=41
[INFO ] 2026-06-01 08:42:05.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:42:05.550 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21221/300s
[WARN ] 2026-06-01 08:42:07.972 [31561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:42:13.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 08:42:13.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424154,ok=424154,error=0, records=41
[INFO ] 2026-06-01 08:42:20.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:42:22.976 [31561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:42:28.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 08:42:28.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424155,ok=424155,error=0, records=41
[INFO ] 2026-06-01 08:42:35.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:42:37.981 [31549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:42:40.287 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21219/300s
[INFO ] 2026-06-01 08:42:42.188 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21219/300s
[INFO ] 2026-06-01 08:42:43.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 08:42:43.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424156,ok=424156,error=0, records=41
[INFO ] 2026-06-01 08:42:49.793 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21219/300s
[INFO ] 2026-06-01 08:42:50.492 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867160},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:42:50.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 08:42:50.645 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:42:50.645 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:42:50.645 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:42:50.645 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:42:50.645 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:42:50.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:42:50.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:42:50.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:42:50.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:42:50.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:42:50.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:42:52.987 [31549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:42:58.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 08:42:58.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424157,ok=424157,error=0, records=41
[INFO ] 2026-06-01 08:43:05.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:43:07.992 [31666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:43:13.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 08:43:13.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424158,ok=424158,error=0, records=41
[INFO ] 2026-06-01 08:43:20.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:43:22.997 [31681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:43:28.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 08:43:28.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424159,ok=424159,error=0, records=41
[INFO ] 2026-06-01 08:43:35.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 08:43:35.553 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 08:43:38.001 [31586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:43:43.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 08:43:43.106 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424160,ok=424160,error=0, records=41
[INFO ] 2026-06-01 08:43:50.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:43:53.007 [31709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:43:58.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 08:43:58.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424161,ok=424161,error=0, records=41
[INFO ] 2026-06-01 08:44:05.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:44:08.012 [31666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:44:13.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 08:44:13.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424162,ok=424162,error=0, records=41
[INFO ] 2026-06-01 08:44:20.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:44:23.017 [31556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:44:28.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 08:44:28.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424163,ok=424163,error=0, records=41
[INFO ] 2026-06-01 08:44:35.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:44:38.022 [31763] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:44:43.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 08:44:43.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424164,ok=424164,error=0, records=41
[INFO ] 2026-06-01 08:44:50.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:44:53.027 [31681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:44:58.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 08:44:58.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424165,ok=424165,error=0, records=41
[INFO ] 2026-06-01 08:44:58.528 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21214/300s
[INFO ] 2026-06-01 08:45:00.872 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21223/300s
[INFO ] 2026-06-01 08:45:05.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:45:08.032 [31777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:45:13.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-01 08:45:13.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424166,ok=424166,error=0, records=41
[INFO ] 2026-06-01 08:45:13.139 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21210/300s
[INFO ] 2026-06-01 08:45:20.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:45:23.036 [31666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:45:28.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10405, records=41
[INFO ] 2026-06-01 08:45:28.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424167,ok=424167,error=0, records=41
[INFO ] 2026-06-01 08:45:35.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:45:38.041 [31806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:45:41.149 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21223/300s
[INFO ] 2026-06-01 08:45:43.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 08:45:43.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424168,ok=424168,error=0, records=41
[INFO ] 2026-06-01 08:45:50.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:45:50.645 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17669/300s
[INFO ] 2026-06-01 08:45:50.647 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867080},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:45:50.787 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:45:50.787 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 08:45:50.787 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:45:50.788 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:45:50.788 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:45:50.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:45:50.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:45:50.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:45:50.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:45:50.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:45:50.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:45:53.046 [31836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:45:58.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 08:45:58.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424169,ok=424169,error=0, records=41
[INFO ] 2026-06-01 08:46:05.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:46:06.821 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21210/300s
[WARN ] 2026-06-01 08:46:08.052 [31854] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:46:13.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 08:46:13.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424170,ok=424170,error=0, records=41
[INFO ] 2026-06-01 08:46:20.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:46:22.557 [31871] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:46:28.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 08:46:28.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424171,ok=424171,error=0, records=41
[INFO ] 2026-06-01 08:46:33.880 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21219/300s
[INFO ] 2026-06-01 08:46:35.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:46:37.563 [31891] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:46:43.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 08:46:43.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424172,ok=424172,error=0, records=41
[INFO ] 2026-06-01 08:46:50.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:46:52.568 [31896] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:46:58.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 08:46:58.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424173,ok=424173,error=0, records=41
[INFO ] 2026-06-01 08:47:05.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:47:05.560 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21222/300s
[WARN ] 2026-06-01 08:47:07.572 [31926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:47:13.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 08:47:13.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424174,ok=424174,error=0, records=41
[INFO ] 2026-06-01 08:47:20.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:47:22.578 [31909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:47:28.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 08:47:28.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424175,ok=424175,error=0, records=41
[INFO ] 2026-06-01 08:47:35.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:47:37.584 [31956] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:47:40.298 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21220/300s
[INFO ] 2026-06-01 08:47:42.200 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21220/300s
[INFO ] 2026-06-01 08:47:43.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 08:47:43.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424176,ok=424176,error=0, records=41
[INFO ] 2026-06-01 08:47:49.806 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21220/300s
[INFO ] 2026-06-01 08:47:50.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:47:52.589 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:47:58.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 08:47:58.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424177,ok=424177,error=0, records=41
[INFO ] 2026-06-01 08:48:05.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:48:07.594 [31956] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:48:13.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 08:48:13.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424178,ok=424178,error=0, records=41
[INFO ] 2026-06-01 08:48:20.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:48:22.599 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:48:28.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 08:48:28.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424179,ok=424179,error=0, records=41
[INFO ] 2026-06-01 08:48:35.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:48:37.604 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:48:43.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 08:48:43.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424180,ok=424180,error=0, records=41
[INFO ] 2026-06-01 08:48:50.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:48:50.789 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867008},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:48:50.962 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:48:50.962 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 08:48:50.962 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:48:50.962 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:48:50.962 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:48:51.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:48:51.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:48:51.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:48:51.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:48:51.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:48:51.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:48:52.611 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:48:58.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 08:48:58.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424181,ok=424181,error=0, records=41
[INFO ] 2026-06-01 08:49:05.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:49:07.616 [32003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:49:13.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 08:49:13.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424182,ok=424182,error=0, records=41
[INFO ] 2026-06-01 08:49:20.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:49:22.622 [32003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:49:28.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 08:49:28.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424183,ok=424183,error=0, records=41
[INFO ] 2026-06-01 08:49:35.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:49:37.627 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:49:43.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 08:49:43.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424184,ok=424184,error=0, records=41
[INFO ] 2026-06-01 08:49:50.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:49:52.632 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:49:58.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 08:49:58.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424185,ok=424185,error=0, records=41
[INFO ] 2026-06-01 08:49:58.633 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21215/300s
[INFO ] 2026-06-01 08:50:00.875 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21224/300s
[INFO ] 2026-06-01 08:50:05.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:50:07.636 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:50:13.278 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 08:50:13.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424186,ok=424186,error=0, records=41
[INFO ] 2026-06-01 08:50:13.279 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21211/300s
[INFO ] 2026-06-01 08:50:20.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:50:22.640 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:50:28.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 08:50:28.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424187,ok=424187,error=0, records=41
[INFO ] 2026-06-01 08:50:35.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:50:37.645 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:50:41.156 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21224/300s
[INFO ] 2026-06-01 08:50:43.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 08:50:43.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424188,ok=424188,error=0, records=41
[INFO ] 2026-06-01 08:50:50.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:50:52.650 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:50:58.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 08:50:58.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424189,ok=424189,error=0, records=41
[INFO ] 2026-06-01 08:51:05.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:51:07.004 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21211/300s
[WARN ] 2026-06-01 08:51:07.655 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:51:13.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 08:51:13.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424190,ok=424190,error=0, records=41
[INFO ] 2026-06-01 08:51:20.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:51:22.660 [32003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:51:28.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 08:51:28.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424191,ok=424191,error=0, records=41
[INFO ] 2026-06-01 08:51:33.936 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21220/300s
[INFO ] 2026-06-01 08:51:35.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:51:37.664 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:51:43.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 08:51:43.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424192,ok=424192,error=0, records=41
[INFO ] 2026-06-01 08:51:50.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:51:50.962 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17670/300s
[INFO ] 2026-06-01 08:51:50.964 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866920},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:51:51.122 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:51:51.123 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 08:51:51.123 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:51:51.123 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:51:51.123 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:51:51.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:51:51.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:51:51.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:51:51.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:51:51.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:51:51.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:51:52.669 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:51:58.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 08:51:58.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424193,ok=424193,error=0, records=41
[INFO ] 2026-06-01 08:52:05.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:52:05.572 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21223/300s
[WARN ] 2026-06-01 08:52:07.674 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:52:13.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 08:52:13.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424194,ok=424194,error=0, records=41
[INFO ] 2026-06-01 08:52:20.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:52:22.680 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:52:28.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 08:52:28.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424195,ok=424195,error=0, records=41
[INFO ] 2026-06-01 08:52:35.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:52:37.685 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:52:40.346 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21221/300s
[INFO ] 2026-06-01 08:52:42.248 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21221/300s
[INFO ] 2026-06-01 08:52:43.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 08:52:43.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424196,ok=424196,error=0, records=41
[INFO ] 2026-06-01 08:52:49.855 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21221/300s
[INFO ] 2026-06-01 08:52:50.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:52:52.690 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:52:58.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 08:52:58.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424197,ok=424197,error=0, records=41
[INFO ] 2026-06-01 08:53:05.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:53:07.694 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:53:13.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 08:53:13.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424198,ok=424198,error=0, records=41
[INFO ] 2026-06-01 08:53:20.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:53:22.699 [32003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:53:28.407 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 08:53:28.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424199,ok=424199,error=0, records=41
[INFO ] 2026-06-01 08:53:35.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 08:53:35.575 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 08:53:37.703 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:53:43.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 08:53:43.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424200,ok=424200,error=0, records=41
[INFO ] 2026-06-01 08:53:50.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:53:50.576 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 08:53:52.709 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:53:58.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 08:53:58.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424201,ok=424201,error=0, records=41
[INFO ] 2026-06-01 08:54:05.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:54:07.714 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:54:13.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 08:54:13.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424202,ok=424202,error=0, records=41
[INFO ] 2026-06-01 08:54:20.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:54:22.720 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:54:28.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 08:54:28.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424203,ok=424203,error=0, records=41
[INFO ] 2026-06-01 08:54:35.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:54:37.725 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:54:43.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 08:54:43.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424204,ok=424204,error=0, records=41
[INFO ] 2026-06-01 08:54:50.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:54:51.124 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:54:51.277 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:54:51.277 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 08:54:51.277 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:54:51.277 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:54:51.277 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:54:51.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:54:51.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:54:51.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:54:51.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:54:51.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:54:51.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:54:52.731 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:54:58.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 08:54:58.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424205,ok=424205,error=0, records=41
[INFO ] 2026-06-01 08:54:58.733 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21216/300s
[INFO ] 2026-06-01 08:55:00.878 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21225/300s
[INFO ] 2026-06-01 08:55:05.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:55:07.736 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:55:13.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 08:55:13.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424206,ok=424206,error=0, records=41
[INFO ] 2026-06-01 08:55:13.512 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21212/300s
[INFO ] 2026-06-01 08:55:20.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:55:22.741 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:55:28.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 08:55:28.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424207,ok=424207,error=0, records=41
[INFO ] 2026-06-01 08:55:35.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:55:37.746 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:55:41.162 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21225/300s
[INFO ] 2026-06-01 08:55:43.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 08:55:43.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424208,ok=424208,error=0, records=41
[INFO ] 2026-06-01 08:55:50.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:55:52.753 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:55:58.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 08:55:58.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424209,ok=424209,error=0, records=41
[INFO ] 2026-06-01 08:56:05.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:56:07.184 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21212/300s
[WARN ] 2026-06-01 08:56:07.759 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:56:13.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 08:56:13.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424210,ok=424210,error=0, records=41
[INFO ] 2026-06-01 08:56:20.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:56:22.765 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:56:28.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 08:56:28.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424211,ok=424211,error=0, records=41
[INFO ] 2026-06-01 08:56:33.987 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21221/300s
[INFO ] 2026-06-01 08:56:35.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:56:37.770 [31944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:56:43.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 08:56:43.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424212,ok=424212,error=0, records=41
[INFO ] 2026-06-01 08:56:50.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:56:52.775 [32003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:56:58.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 08:56:58.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424213,ok=424213,error=0, records=41
[INFO ] 2026-06-01 08:57:05.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:57:05.584 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21224/300s
[WARN ] 2026-06-01 08:57:07.780 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:57:13.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10408, records=41
[INFO ] 2026-06-01 08:57:13.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424214,ok=424214,error=0, records=41
[INFO ] 2026-06-01 08:57:20.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:57:22.785 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:57:28.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-06-01 08:57:28.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424215,ok=424215,error=0, records=41
[INFO ] 2026-06-01 08:57:35.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:57:37.791 [32003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:57:40.391 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21222/300s
[INFO ] 2026-06-01 08:57:42.292 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21222/300s
[INFO ] 2026-06-01 08:57:43.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 08:57:43.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424216,ok=424216,error=0, records=41
[INFO ] 2026-06-01 08:57:49.896 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21222/300s
[INFO ] 2026-06-01 08:57:50.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 08:57:51.277 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17671/300s
[INFO ] 2026-06-01 08:57:51.279 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866760},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 08:57:51.445 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 08:57:51.445 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 08:57:51.446 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 08:57:51.446 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 08:57:51.446 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 08:57:51.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:57:51.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 08:57:51.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:57:51.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 08:57:51.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 08:57:51.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 08:57:52.796 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:57:58.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-01 08:57:58.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424217,ok=424217,error=0, records=41
[INFO ] 2026-06-01 08:58:05.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:58:07.801 [31993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:58:13.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 08:58:13.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424218,ok=424218,error=0, records=41
[WARN ] 2026-06-01 08:58:17.805 [31993] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28884/stat), No such file or directory
[INFO ] 2026-06-01 08:58:20.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:58:22.806 [31988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:58:28.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 08:58:28.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424219,ok=424219,error=0, records=41
[WARN ] 2026-06-01 08:58:32.811 [32547] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28884/stat), No such file or directory
[INFO ] 2026-06-01 08:58:35.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:58:37.811 [32547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:58:43.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10135, records=41
[INFO ] 2026-06-01 08:58:43.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424220,ok=424220,error=0, records=41
[WARN ] 2026-06-01 08:58:47.816 [32566] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28884/stat), No such file or directory
[INFO ] 2026-06-01 08:58:50.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:58:52.817 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:58:58.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 08:58:58.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424221,ok=424221,error=0, records=41
[INFO ] 2026-06-01 08:59:05.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:59:07.822 [32581] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:59:13.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 08:59:13.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424222,ok=424222,error=0, records=41
[INFO ] 2026-06-01 08:59:20.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:59:22.827 [32561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:59:28.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 08:59:28.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424223,ok=424223,error=0, records=41
[INFO ] 2026-06-01 08:59:35.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:59:37.831 [32566] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:59:43.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 08:59:43.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424224,ok=424224,error=0, records=41
[INFO ] 2026-06-01 08:59:50.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 08:59:52.839 [32623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 08:59:58.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 08:59:58.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424225,ok=424225,error=0, records=41
[INFO ] 2026-06-01 08:59:58.841 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21217/300s
[INFO ] 2026-06-01 09:00:00.882 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21226/300s
[INFO ] 2026-06-01 09:00:05.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:00:07.845 [32595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:00:13.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 09:00:13.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424226,ok=424226,error=0, records=41
[INFO ] 2026-06-01 09:00:13.627 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21213/300s
[INFO ] 2026-06-01 09:00:20.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:00:22.850 [32623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:00:28.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 09:00:28.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424227,ok=424227,error=0, records=41
[INFO ] 2026-06-01 09:00:35.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:00:37.856 [32623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:00:41.168 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21226/300s
[INFO ] 2026-06-01 09:00:43.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 09:00:43.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424228,ok=424228,error=0, records=41
[INFO ] 2026-06-01 09:00:50.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:00:51.447 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866676},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:00:51.610 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:00:51.610 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 09:00:51.610 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:00:51.610 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:00:51.610 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:00:51.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:00:51.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:00:51.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:00:51.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:00:51.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:00:51.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:00:52.861 [32595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:00:58.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 09:00:58.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424229,ok=424229,error=0, records=41
[INFO ] 2026-06-01 09:01:05.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:01:07.366 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21213/300s
[WARN ] 2026-06-01 09:01:07.867 [32623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:01:13.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 09:01:13.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424230,ok=424230,error=0, records=41
[INFO ] 2026-06-01 09:01:20.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:01:22.873 [32595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:01:28.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 09:01:28.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424231,ok=424231,error=0, records=41
[WARN ] 2026-06-01 09:01:32.379 [307  ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28887/stat), No such file or directory
[INFO ] 2026-06-01 09:01:34.040 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21222/300s
[INFO ] 2026-06-01 09:01:35.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:01:37.878 [325  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:01:43.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 09:01:43.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424232,ok=424232,error=0, records=41
[WARN ] 2026-06-01 09:01:47.382 [32679] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28909/stat), No such file or directory
[WARN ] 2026-06-01 09:01:47.383 [32679] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28902/stat), No such file or directory
[WARN ] 2026-06-01 09:01:47.383 [32679] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/28887/stat), No such file or directory
[INFO ] 2026-06-01 09:01:50.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:01:52.885 [307  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:01:58.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 09:01:58.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424233,ok=424233,error=0, records=41
[INFO ] 2026-06-01 09:02:05.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:02:05.596 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21225/300s
[WARN ] 2026-06-01 09:02:07.890 [361  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:02:13.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 09:02:13.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424234,ok=424234,error=0, records=41
[INFO ] 2026-06-01 09:02:20.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:02:22.894 [379  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:02:28.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 09:02:28.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424235,ok=424235,error=0, records=41
[INFO ] 2026-06-01 09:02:35.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:02:37.900 [356  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:02:40.449 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21223/300s
[INFO ] 2026-06-01 09:02:42.351 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21223/300s
[INFO ] 2026-06-01 09:02:43.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 09:02:43.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424236,ok=424236,error=0, records=41
[INFO ] 2026-06-01 09:02:49.957 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21223/300s
[INFO ] 2026-06-01 09:02:50.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:02:52.905 [406  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:02:58.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 09:02:58.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424237,ok=424237,error=0, records=41
[INFO ] 2026-06-01 09:03:05.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:03:07.911 [423  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:03:13.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-01 09:03:13.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424238,ok=424238,error=0, records=41
[INFO ] 2026-06-01 09:03:20.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:03:22.916 [446  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:03:28.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 09:03:28.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424239,ok=424239,error=0, records=41
[INFO ] 2026-06-01 09:03:35.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 09:03:35.600 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 09:03:37.921 [446  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:03:43.717 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 09:03:43.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424240,ok=424240,error=0, records=41
[INFO ] 2026-06-01 09:03:50.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:03:51.610 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17672/300s
[INFO ] 2026-06-01 09:03:51.612 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866580},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:03:51.758 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:03:51.758 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 09:03:51.758 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:03:51.758 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:03:51.758 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:03:51.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:03:51.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:03:51.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:03:51.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:03:51.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:03:51.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:03:52.928 [484  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:03:58.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 09:03:58.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424241,ok=424241,error=0, records=41
[INFO ] 2026-06-01 09:04:05.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.73%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:04:07.933 [484  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:04:13.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 09:04:13.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424242,ok=424242,error=0, records=41
[INFO ] 2026-06-01 09:04:20.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:04:22.939 [517  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:04:28.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 09:04:28.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424243,ok=424243,error=0, records=41
[INFO ] 2026-06-01 09:04:35.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:04:37.945 [537  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:04:43.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 09:04:43.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424244,ok=424244,error=0, records=41
[INFO ] 2026-06-01 09:04:50.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:04:52.951 [544  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:04:58.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 09:04:58.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424245,ok=424245,error=0, records=41
[INFO ] 2026-06-01 09:04:58.952 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21218/300s
[INFO ] 2026-06-01 09:05:00.885 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21227/300s
[INFO ] 2026-06-01 09:05:05.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:05:07.956 [572  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:05:13.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 09:05:13.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424246,ok=424246,error=0, records=41
[INFO ] 2026-06-01 09:05:13.758 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21214/300s
[INFO ] 2026-06-01 09:05:20.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:05:22.960 [544  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:05:28.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 09:05:28.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424247,ok=424247,error=0, records=41
[INFO ] 2026-06-01 09:05:35.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:05:37.965 [555  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:05:41.174 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21227/300s
[INFO ] 2026-06-01 09:05:43.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 09:05:43.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424248,ok=424248,error=0, records=41
[INFO ] 2026-06-01 09:05:50.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:05:52.971 [586  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:05:58.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 09:05:58.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424249,ok=424249,error=0, records=41
[INFO ] 2026-06-01 09:06:05.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:06:07.538 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21214/300s
[WARN ] 2026-06-01 09:06:07.976 [586  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:06:13.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 09:06:13.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424250,ok=424250,error=0, records=41
[INFO ] 2026-06-01 09:06:20.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:06:22.981 [586  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:06:28.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 09:06:28.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424251,ok=424251,error=0, records=41
[INFO ] 2026-06-01 09:06:34.095 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21223/300s
[INFO ] 2026-06-01 09:06:35.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:06:37.987 [642  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:06:43.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 09:06:43.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424252,ok=424252,error=0, records=41
[INFO ] 2026-06-01 09:06:50.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:06:51.760 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866504},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:06:51.933 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:06:51.933 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 09:06:51.933 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:06:51.933 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:06:51.933 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:06:51.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:06:51.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:06:51.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:06:51.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:06:51.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:06:51.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:06:52.991 [586  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:06:58.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 09:06:58.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424253,ok=424253,error=0, records=41
[INFO ] 2026-06-01 09:07:05.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:07:05.609 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21226/300s
[WARN ] 2026-06-01 09:07:07.996 [670  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:07:13.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 09:07:13.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424254,ok=424254,error=0, records=41
[INFO ] 2026-06-01 09:07:20.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:07:23.002 [642  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:07:28.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 09:07:28.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424255,ok=424255,error=0, records=41
[INFO ] 2026-06-01 09:07:35.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:07:38.007 [670  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:07:40.516 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21224/300s
[INFO ] 2026-06-01 09:07:42.417 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21224/300s
[INFO ] 2026-06-01 09:07:43.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 09:07:43.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424256,ok=424256,error=0, records=41
[INFO ] 2026-06-01 09:07:50.023 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21224/300s
[INFO ] 2026-06-01 09:07:50.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:07:53.013 [714  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:07:58.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 09:07:58.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424257,ok=424257,error=0, records=41
[INFO ] 2026-06-01 09:08:05.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:08:08.017 [699  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:08:13.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 09:08:13.831 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424258,ok=424258,error=0, records=41
[INFO ] 2026-06-01 09:08:20.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:08:23.022 [714  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:08:28.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 09:08:28.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424259,ok=424259,error=0, records=41
[INFO ] 2026-06-01 09:08:35.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:08:38.027 [742  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:08:43.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 09:08:43.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424260,ok=424260,error=0, records=41
[INFO ] 2026-06-01 09:08:50.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:08:50.613 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 09:08:53.033 [756  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:08:58.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 09:08:58.848 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424261,ok=424261,error=0, records=41
[INFO ] 2026-06-01 09:09:05.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:09:08.038 [742  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:09:13.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 09:09:13.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424262,ok=424262,error=0, records=41
[INFO ] 2026-06-01 09:09:20.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:09:23.044 [714  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:09:28.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 09:09:28.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424263,ok=424263,error=0, records=41
[INFO ] 2026-06-01 09:09:35.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:09:38.049 [685  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:09:43.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 09:09:43.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424264,ok=424264,error=0, records=41
[INFO ] 2026-06-01 09:09:50.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:09:51.934 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17673/300s
[INFO ] 2026-06-01 09:09:51.935 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866424},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:09:52.095 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:09:52.095 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:09:52.095 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:09:52.095 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:09:52.095 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:09:52.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:09:52.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:09:52.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:09:52.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:09:52.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:09:52.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:09:52.554 [853  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:09:58.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 09:09:58.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424265,ok=424265,error=0, records=41
[INFO ] 2026-06-01 09:09:59.055 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21219/300s
[INFO ] 2026-06-01 09:10:00.888 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21228/300s
[INFO ] 2026-06-01 09:10:05.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:10:07.558 [871  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:10:13.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 09:10:13.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424266,ok=424266,error=0, records=41
[INFO ] 2026-06-01 09:10:13.892 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21215/300s
[INFO ] 2026-06-01 09:10:20.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:10:22.564 [871  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:10:28.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-01 09:10:28.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424267,ok=424267,error=0, records=41
[INFO ] 2026-06-01 09:10:35.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:10:37.568 [927  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:10:41.181 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21228/300s
[INFO ] 2026-06-01 09:10:43.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 09:10:43.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424268,ok=424268,error=0, records=41
[INFO ] 2026-06-01 09:10:50.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:10:52.573 [954  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:10:58.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-01 09:10:58.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424269,ok=424269,error=0, records=41
[INFO ] 2026-06-01 09:11:05.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:11:07.577 [984  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:11:07.720 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21215/300s
[INFO ] 2026-06-01 09:11:13.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 09:11:13.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424270,ok=424270,error=0, records=41
[INFO ] 2026-06-01 09:11:20.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:11:22.583 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:11:28.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 09:11:28.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424271,ok=424271,error=0, records=41
[INFO ] 2026-06-01 09:11:34.150 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21224/300s
[INFO ] 2026-06-01 09:11:35.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:11:37.588 [984  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:11:43.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 09:11:43.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424272,ok=424272,error=0, records=41
[INFO ] 2026-06-01 09:11:50.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:11:52.592 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:11:58.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 09:11:58.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424273,ok=424273,error=0, records=41
[INFO ] 2026-06-01 09:12:05.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:12:05.621 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21227/300s
[WARN ] 2026-06-01 09:12:07.598 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:12:13.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 09:12:13.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424274,ok=424274,error=0, records=41
[INFO ] 2026-06-01 09:12:20.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:12:22.604 [1037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:12:28.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-01 09:12:28.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424275,ok=424275,error=0, records=41
[INFO ] 2026-06-01 09:12:35.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:12:37.610 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:12:40.577 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21225/300s
[INFO ] 2026-06-01 09:12:42.479 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21225/300s
[INFO ] 2026-06-01 09:12:43.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 09:12:43.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424276,ok=424276,error=0, records=41
[INFO ] 2026-06-01 09:12:50.086 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21225/300s
[INFO ] 2026-06-01 09:12:50.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:12:52.097 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866340},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:12:52.267 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:12:52.267 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:12:52.267 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:12:52.267 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:12:52.267 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:12:52.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:12:52.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:12:52.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:12:52.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:12:52.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:12:52.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:12:52.615 [1037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:12:58.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 09:12:58.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424277,ok=424277,error=0, records=41
[INFO ] 2026-06-01 09:13:05.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.73%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:13:07.621 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:13:13.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 09:13:13.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424278,ok=424278,error=0, records=41
[INFO ] 2026-06-01 09:13:20.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:13:22.625 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:13:28.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 09:13:28.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424279,ok=424279,error=0, records=41
[INFO ] 2026-06-01 09:13:35.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 09:13:35.626 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 09:13:37.630 [1037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:13:44.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 09:13:44.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424280,ok=424280,error=0, records=41
[INFO ] 2026-06-01 09:13:50.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:13:52.635 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:13:59.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 09:13:59.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424281,ok=424281,error=0, records=41
[INFO ] 2026-06-01 09:14:05.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:14:07.641 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:14:14.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 09:14:14.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424282,ok=424282,error=0, records=41
[INFO ] 2026-06-01 09:14:20.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:14:22.646 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:14:29.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 09:14:29.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424283,ok=424283,error=0, records=41
[INFO ] 2026-06-01 09:14:35.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:14:37.651 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:14:44.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 09:14:44.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424284,ok=424284,error=0, records=41
[INFO ] 2026-06-01 09:14:50.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:14:52.656 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:14:59.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 09:14:59.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424285,ok=424285,error=0, records=41
[INFO ] 2026-06-01 09:14:59.158 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21220/300s
[INFO ] 2026-06-01 09:15:00.892 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21229/300s
[INFO ] 2026-06-01 09:15:05.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:15:07.662 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:15:14.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 09:15:14.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424286,ok=424286,error=0, records=41
[INFO ] 2026-06-01 09:15:14.095 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21216/300s
[INFO ] 2026-06-01 09:15:20.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:15:22.668 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:15:29.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 09:15:29.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424287,ok=424287,error=0, records=41
[INFO ] 2026-06-01 09:15:35.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:15:37.674 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:15:41.188 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21229/300s
[INFO ] 2026-06-01 09:15:44.106 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 09:15:44.106 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424288,ok=424288,error=0, records=41
[INFO ] 2026-06-01 09:15:50.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:15:52.267 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17674/300s
[INFO ] 2026-06-01 09:15:52.269 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:15:52.417 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:15:52.417 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 09:15:52.417 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:15:52.417 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:15:52.417 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:15:52.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:15:52.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:15:52.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:15:52.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:15:52.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:15:52.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:15:52.679 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:15:59.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 09:15:59.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424289,ok=424289,error=0, records=41
[INFO ] 2026-06-01 09:16:05.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:16:07.686 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:16:07.904 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21216/300s
[INFO ] 2026-06-01 09:16:14.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 09:16:14.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424290,ok=424290,error=0, records=41
[INFO ] 2026-06-01 09:16:20.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:16:22.690 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:16:29.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 09:16:29.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424291,ok=424291,error=0, records=41
[INFO ] 2026-06-01 09:16:34.204 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21225/300s
[INFO ] 2026-06-01 09:16:35.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:16:37.696 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:16:44.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 09:16:44.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424292,ok=424292,error=0, records=41
[INFO ] 2026-06-01 09:16:50.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:16:52.702 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:16:59.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 09:16:59.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424293,ok=424293,error=0, records=41
[INFO ] 2026-06-01 09:17:05.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:17:05.635 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21228/300s
[WARN ] 2026-06-01 09:17:07.707 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:17:14.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 09:17:14.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424294,ok=424294,error=0, records=41
[INFO ] 2026-06-01 09:17:20.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:17:22.711 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:17:29.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 09:17:29.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424295,ok=424295,error=0, records=41
[INFO ] 2026-06-01 09:17:35.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:17:37.716 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:17:40.626 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21226/300s
[INFO ] 2026-06-01 09:17:42.527 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21226/300s
[INFO ] 2026-06-01 09:17:44.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 09:17:44.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424296,ok=424296,error=0, records=41
[INFO ] 2026-06-01 09:17:50.131 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21226/300s
[INFO ] 2026-06-01 09:17:50.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:17:52.721 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:17:59.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 09:17:59.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424297,ok=424297,error=0, records=41
[INFO ] 2026-06-01 09:18:05.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:18:07.727 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:18:14.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 09:18:14.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424298,ok=424298,error=0, records=41
[INFO ] 2026-06-01 09:18:20.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:18:22.732 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:18:29.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 09:18:29.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424299,ok=424299,error=0, records=41
[INFO ] 2026-06-01 09:18:35.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:18:37.736 [1037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:18:44.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 09:18:44.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424300,ok=424300,error=0, records=41
[INFO ] 2026-06-01 09:18:50.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:18:52.419 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866188},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:18:52.581 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:18:52.581 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:18:52.581 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:18:52.581 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:18:52.581 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:18:52.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:18:52.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:18:52.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:18:52.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:18:52.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:18:52.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 09:18:52.741 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:18:59.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 09:18:59.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424301,ok=424301,error=0, records=41
[INFO ] 2026-06-01 09:19:05.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:19:07.747 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:19:14.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 09:19:14.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424302,ok=424302,error=0, records=41
[INFO ] 2026-06-01 09:19:20.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:19:22.752 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:19:29.190 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 09:19:29.190 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424303,ok=424303,error=0, records=41
[INFO ] 2026-06-01 09:19:35.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:19:37.757 [1037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:19:44.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 09:19:44.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424304,ok=424304,error=0, records=41
[INFO ] 2026-06-01 09:19:50.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:19:52.762 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:19:59.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 09:19:59.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424305,ok=424305,error=0, records=41
[INFO ] 2026-06-01 09:19:59.264 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21221/300s
[INFO ] 2026-06-01 09:20:00.895 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21230/300s
[INFO ] 2026-06-01 09:20:05.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:20:07.767 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:20:14.206 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 09:20:14.206 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424306,ok=424306,error=0, records=41
[INFO ] 2026-06-01 09:20:14.206 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21217/300s
[INFO ] 2026-06-01 09:20:20.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:20:22.773 [1037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:20:29.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 09:20:29.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424307,ok=424307,error=0, records=41
[INFO ] 2026-06-01 09:20:35.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:20:37.779 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:20:41.194 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21230/300s
[INFO ] 2026-06-01 09:20:44.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 09:20:44.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424308,ok=424308,error=0, records=41
[INFO ] 2026-06-01 09:20:50.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:20:52.783 [1067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:20:59.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 09:20:59.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424309,ok=424309,error=0, records=41
[INFO ] 2026-06-01 09:21:05.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:21:07.789 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:21:08.084 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21217/300s
[INFO ] 2026-06-01 09:21:14.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 09:21:14.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424310,ok=424310,error=0, records=41
[INFO ] 2026-06-01 09:21:20.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:21:22.793 [1053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:21:29.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 09:21:29.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424311,ok=424311,error=0, records=41
[INFO ] 2026-06-01 09:21:34.262 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21226/300s
[INFO ] 2026-06-01 09:21:35.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:21:37.798 [997  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:21:44.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 09:21:44.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424312,ok=424312,error=0, records=41
[INFO ] 2026-06-01 09:21:50.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:21:52.581 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17675/300s
[INFO ] 2026-06-01 09:21:52.583 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:21:52.766 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:21:52.766 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 09:21:52.766 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:21:52.766 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:21:52.766 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[WARN ] 2026-06-01 09:21:52.804 [1596 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:21:52.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:21:52.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:21:52.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:21:52.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:21:52.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:21:52.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:21:59.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 09:21:59.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424313,ok=424313,error=0, records=41
[INFO ] 2026-06-01 09:22:05.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:22:05.647 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21229/300s
[WARN ] 2026-06-01 09:22:07.809 [1048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:22:14.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 09:22:14.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424314,ok=424314,error=0, records=41
[INFO ] 2026-06-01 09:22:20.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:22:22.815 [1626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:22:29.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 09:22:29.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424315,ok=424315,error=0, records=41
[INFO ] 2026-06-01 09:22:35.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:22:37.820 [1590 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:22:40.683 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21227/300s
[INFO ] 2026-06-01 09:22:42.584 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21227/300s
[INFO ] 2026-06-01 09:22:44.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 09:22:44.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424316,ok=424316,error=0, records=41
[INFO ] 2026-06-01 09:22:50.191 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21227/300s
[INFO ] 2026-06-01 09:22:50.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:22:52.825 [1653 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:22:59.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 09:22:59.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424317,ok=424317,error=0, records=41
[INFO ] 2026-06-01 09:23:05.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:23:07.831 [1590 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:23:14.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 09:23:14.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424318,ok=424318,error=0, records=41
[INFO ] 2026-06-01 09:23:20.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:23:22.836 [1680 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:23:29.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 09:23:29.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424319,ok=424319,error=0, records=41
[INFO ] 2026-06-01 09:23:35.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 09:23:35.650 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 09:23:37.841 [1691 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:23:44.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 09:23:44.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424320,ok=424320,error=0, records=41
[INFO ] 2026-06-01 09:23:50.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:23:50.651 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 09:23:52.846 [1691 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:23:59.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 09:23:59.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424321,ok=424321,error=0, records=41
[INFO ] 2026-06-01 09:24:05.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:24:07.851 [1680 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:24:14.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 09:24:14.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424322,ok=424322,error=0, records=41
[INFO ] 2026-06-01 09:24:20.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:24:22.856 [1707 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:24:29.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 09:24:29.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424323,ok=424323,error=0, records=41
[INFO ] 2026-06-01 09:24:35.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:24:37.861 [1707 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:24:44.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 09:24:44.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424324,ok=424324,error=0, records=41
[INFO ] 2026-06-01 09:24:50.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:24:52.767 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866024},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 09:24:52.867 [1721 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:24:52.944 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:24:52.944 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 09:24:52.944 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:24:52.944 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:24:52.944 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:24:52.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:24:52.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:24:52.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:24:52.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:24:52.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:24:52.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:24:59.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 09:24:59.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424325,ok=424325,error=0, records=41
[INFO ] 2026-06-01 09:24:59.369 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21222/300s
[INFO ] 2026-06-01 09:25:00.899 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21231/300s
[INFO ] 2026-06-01 09:25:05.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:25:07.872 [1707 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:25:14.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 09:25:14.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424326,ok=424326,error=0, records=41
[INFO ] 2026-06-01 09:25:14.333 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21218/300s
[INFO ] 2026-06-01 09:25:20.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:25:22.878 [1707 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:25:29.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 09:25:29.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424327,ok=424327,error=0, records=41
[INFO ] 2026-06-01 09:25:35.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:25:37.884 [1833 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:25:41.201 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21231/300s
[INFO ] 2026-06-01 09:25:44.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 09:25:44.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424328,ok=424328,error=0, records=41
[WARN ] 2026-06-01 09:25:47.388 [1866 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32735/stat), No such file or directory
[WARN ] 2026-06-01 09:25:47.388 [1866 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32734/stat), No such file or directory
[WARN ] 2026-06-01 09:25:47.388 [1866 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32733/stat), No such file or directory
[INFO ] 2026-06-01 09:25:50.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:25:52.889 [1866 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:25:59.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 09:25:59.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424329,ok=424329,error=0, records=41
[INFO ] 2026-06-01 09:26:05.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:26:07.894 [1883 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:26:08.259 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21218/300s
[INFO ] 2026-06-01 09:26:14.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 09:26:14.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424330,ok=424330,error=0, records=41
[INFO ] 2026-06-01 09:26:20.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:26:22.900 [1883 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:26:29.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 09:26:29.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424331,ok=424331,error=0, records=41
[INFO ] 2026-06-01 09:26:34.316 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21227/300s
[INFO ] 2026-06-01 09:26:35.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:26:37.906 [1883 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:26:44.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 09:26:44.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424332,ok=424332,error=0, records=41
[INFO ] 2026-06-01 09:26:50.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:26:52.911 [1877 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:26:59.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 09:26:59.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424333,ok=424333,error=0, records=41
[INFO ] 2026-06-01 09:27:05.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:27:05.660 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21230/300s
[WARN ] 2026-06-01 09:27:07.918 [1929 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:27:14.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 09:27:14.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424334,ok=424334,error=0, records=41
[INFO ] 2026-06-01 09:27:20.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:27:22.924 [1919 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:27:29.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 09:27:29.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424335,ok=424335,error=0, records=41
[INFO ] 2026-06-01 09:27:35.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:27:37.929 [1954 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:27:40.740 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21228/300s
[INFO ] 2026-06-01 09:27:42.642 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21228/300s
[INFO ] 2026-06-01 09:27:44.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 09:27:44.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424336,ok=424336,error=0, records=41
[INFO ] 2026-06-01 09:27:50.246 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21228/300s
[INFO ] 2026-06-01 09:27:50.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:27:52.934 [1971 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:27:52.944 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17676/300s
[INFO ] 2026-06-01 09:27:52.945 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:27:53.099 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:27:53.099 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:27:53.099 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:27:53.099 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:27:53.099 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:27:53.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:27:53.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:27:53.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:27:53.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:27:53.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:27:53.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:27:59.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 09:27:59.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424337,ok=424337,error=0, records=41
[INFO ] 2026-06-01 09:28:05.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:28:07.940 [2012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:28:14.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 09:28:14.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424338,ok=424338,error=0, records=41
[INFO ] 2026-06-01 09:28:20.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:28:22.945 [1995 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:28:29.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 09:28:29.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424339,ok=424339,error=0, records=41
[INFO ] 2026-06-01 09:28:35.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:28:37.951 [1944 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:28:44.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 09:28:44.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424340,ok=424340,error=0, records=41
[INFO ] 2026-06-01 09:28:50.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:28:52.957 [2045 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:28:59.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 09:28:59.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424341,ok=424341,error=0, records=41
[INFO ] 2026-06-01 09:29:05.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:29:07.962 [2029 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:29:14.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 09:29:14.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424342,ok=424342,error=0, records=41
[INFO ] 2026-06-01 09:29:20.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:29:22.967 [2086 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:29:29.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 09:29:29.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424343,ok=424343,error=0, records=41
[INFO ] 2026-06-01 09:29:35.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:29:37.972 [2072 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:29:44.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 09:29:44.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424344,ok=424344,error=0, records=41
[INFO ] 2026-06-01 09:29:50.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:29:52.977 [2072 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:29:59.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 09:29:59.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424345,ok=424345,error=0, records=41
[INFO ] 2026-06-01 09:29:59.479 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21223/300s
[INFO ] 2026-06-01 09:30:00.902 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21232/300s
[INFO ] 2026-06-01 09:30:05.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:30:07.983 [2045 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:30:14.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 09:30:14.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424346,ok=424346,error=0, records=41
[INFO ] 2026-06-01 09:30:14.458 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21219/300s
[INFO ] 2026-06-01 09:30:20.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:30:22.989 [2039 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:30:29.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 09:30:29.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424347,ok=424347,error=0, records=41
[INFO ] 2026-06-01 09:30:35.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:30:37.994 [2158 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:30:41.208 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21232/300s
[INFO ] 2026-06-01 09:30:44.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 09:30:44.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424348,ok=424348,error=0, records=41
[INFO ] 2026-06-01 09:30:50.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:30:53.000 [2028 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:30:53.101 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:30:53.263 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:30:53.263 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 09:30:53.263 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:30:53.264 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:30:53.264 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:30:53.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:30:53.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:30:53.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:30:53.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:30:53.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:30:53.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:30:59.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 09:30:59.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424349,ok=424349,error=0, records=41
[INFO ] 2026-06-01 09:31:05.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:31:08.005 [2187 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:31:08.441 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21219/300s
[INFO ] 2026-06-01 09:31:14.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 09:31:14.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424350,ok=424350,error=0, records=41
[INFO ] 2026-06-01 09:31:20.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:31:23.012 [2086 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:31:29.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 09:31:29.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424351,ok=424351,error=0, records=41
[INFO ] 2026-06-01 09:31:34.372 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21228/300s
[INFO ] 2026-06-01 09:31:35.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:31:38.017 [2028 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:31:44.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 09:31:44.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424352,ok=424352,error=0, records=41
[INFO ] 2026-06-01 09:31:50.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:31:53.022 [2172 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:31:59.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 09:31:59.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424353,ok=424353,error=0, records=41
[INFO ] 2026-06-01 09:32:05.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:32:05.672 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21231/300s
[WARN ] 2026-06-01 09:32:08.027 [2242 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:32:14.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 09:32:14.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424354,ok=424354,error=0, records=41
[INFO ] 2026-06-01 09:32:20.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:32:23.032 [2086 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:32:29.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 09:32:29.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424355,ok=424355,error=0, records=41
[INFO ] 2026-06-01 09:32:35.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:32:38.038 [2270 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:32:40.791 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21229/300s
[INFO ] 2026-06-01 09:32:42.692 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21229/300s
[INFO ] 2026-06-01 09:32:44.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 09:32:44.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424356,ok=424356,error=0, records=41
[INFO ] 2026-06-01 09:32:50.299 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21229/300s
[INFO ] 2026-06-01 09:32:50.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:32:53.043 [2270 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:32:59.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 09:32:59.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424357,ok=424357,error=0, records=41
[INFO ] 2026-06-01 09:33:05.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:33:08.048 [2286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:33:14.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 09:33:14.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424358,ok=424358,error=0, records=41
[INFO ] 2026-06-01 09:33:20.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:33:22.554 [2320 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:33:29.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 09:33:29.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424359,ok=424359,error=0, records=41
[INFO ] 2026-06-01 09:33:35.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 09:33:35.676 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 09:33:37.559 [2336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:33:44.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 09:33:44.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424360,ok=424360,error=0, records=41
[INFO ] 2026-06-01 09:33:50.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:33:52.564 [2309 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:33:53.264 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17677/300s
[INFO ] 2026-06-01 09:33:53.265 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:33:53.428 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:33:53.428 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 09:33:53.428 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:33:53.428 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:33:53.428 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:33:53.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:33:53.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:33:53.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:33:53.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:33:53.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:33:53.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:33:59.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 09:33:59.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424361,ok=424361,error=0, records=41
[INFO ] 2026-06-01 09:34:05.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:34:07.568 [2336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:34:14.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 09:34:14.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424362,ok=424362,error=0, records=41
[INFO ] 2026-06-01 09:34:20.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:34:22.573 [2366 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:34:29.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 09:34:29.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424363,ok=424363,error=0, records=41
[INFO ] 2026-06-01 09:34:35.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:34:37.577 [2410 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:34:44.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 09:34:44.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424364,ok=424364,error=0, records=41
[INFO ] 2026-06-01 09:34:50.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:34:52.581 [2402 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:34:59.583 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21224/300s
[INFO ] 2026-06-01 09:34:59.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 09:34:59.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424365,ok=424365,error=0, records=41
[INFO ] 2026-06-01 09:35:00.905 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21233/300s
[INFO ] 2026-06-01 09:35:05.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:35:07.587 [2421 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:35:14.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 09:35:14.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424366,ok=424366,error=0, records=41
[INFO ] 2026-06-01 09:35:14.715 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21220/300s
[INFO ] 2026-06-01 09:35:20.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:35:22.593 [2460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:35:29.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 09:35:29.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424367,ok=424367,error=0, records=41
[INFO ] 2026-06-01 09:35:35.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:35:37.598 [2460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:35:41.214 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21233/300s
[INFO ] 2026-06-01 09:35:44.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 09:35:44.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424368,ok=424368,error=0, records=41
[INFO ] 2026-06-01 09:35:50.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:35:52.603 [2459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:35:59.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 09:35:59.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424369,ok=424369,error=0, records=41
[INFO ] 2026-06-01 09:36:05.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:36:07.608 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:36:08.621 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21220/300s
[INFO ] 2026-06-01 09:36:14.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 09:36:14.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424370,ok=424370,error=0, records=41
[INFO ] 2026-06-01 09:36:20.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:36:22.613 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:36:29.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-01 09:36:29.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424371,ok=424371,error=0, records=41
[INFO ] 2026-06-01 09:36:34.420 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21229/300s
[INFO ] 2026-06-01 09:36:35.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:36:37.618 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:36:44.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 09:36:44.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424372,ok=424372,error=0, records=41
[INFO ] 2026-06-01 09:36:50.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:36:52.623 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:36:53.430 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:36:53.584 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:36:53.584 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:36:53.584 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:36:53.584 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:36:53.584 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:36:53.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:36:53.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:36:53.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:36:53.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:36:53.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:36:53.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:36:59.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-01 09:36:59.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424373,ok=424373,error=0, records=41
[INFO ] 2026-06-01 09:37:05.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:37:05.683 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21232/300s
[WARN ] 2026-06-01 09:37:07.629 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:37:14.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 09:37:14.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424374,ok=424374,error=0, records=41
[INFO ] 2026-06-01 09:37:20.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:37:22.634 [2460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:37:29.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 09:37:29.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424375,ok=424375,error=0, records=41
[INFO ] 2026-06-01 09:37:35.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:37:37.639 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:37:40.823 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21230/300s
[INFO ] 2026-06-01 09:37:42.724 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21230/300s
[INFO ] 2026-06-01 09:37:44.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 09:37:44.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424376,ok=424376,error=0, records=41
[INFO ] 2026-06-01 09:37:50.329 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21230/300s
[INFO ] 2026-06-01 09:37:50.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:37:52.644 [2460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:37:59.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 09:37:59.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424377,ok=424377,error=0, records=41
[INFO ] 2026-06-01 09:38:05.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:38:07.651 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:38:14.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 09:38:14.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424378,ok=424378,error=0, records=41
[INFO ] 2026-06-01 09:38:20.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:38:22.655 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:38:29.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 09:38:29.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424379,ok=424379,error=0, records=41
[INFO ] 2026-06-01 09:38:35.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:38:37.661 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:38:44.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 09:38:44.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424380,ok=424380,error=0, records=41
[INFO ] 2026-06-01 09:38:50.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:38:50.688 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 09:38:52.666 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:38:59.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 09:38:59.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424381,ok=424381,error=0, records=41
[INFO ] 2026-06-01 09:39:05.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:39:07.671 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:39:14.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 09:39:14.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424382,ok=424382,error=0, records=41
[INFO ] 2026-06-01 09:39:20.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:39:22.676 [2459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:39:29.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 09:39:29.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424383,ok=424383,error=0, records=41
[INFO ] 2026-06-01 09:39:35.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:39:37.680 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:39:44.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 09:39:44.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424384,ok=424384,error=0, records=41
[INFO ] 2026-06-01 09:39:50.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:39:52.685 [2460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:39:53.584 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17678/300s
[INFO ] 2026-06-01 09:39:53.586 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865620},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:39:53.738 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:39:53.738 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:39:53.738 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:39:53.738 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:39:53.738 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:39:53.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:39:53.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:39:53.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:39:53.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:39:53.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:39:53.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:39:59.687 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21225/300s
[INFO ] 2026-06-01 09:39:59.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 09:39:59.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424385,ok=424385,error=0, records=41
[INFO ] 2026-06-01 09:40:00.908 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21234/300s
[INFO ] 2026-06-01 09:40:05.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:40:07.690 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:40:14.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 09:40:14.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424386,ok=424386,error=0, records=41
[INFO ] 2026-06-01 09:40:14.927 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21221/300s
[INFO ] 2026-06-01 09:40:20.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:40:22.696 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:40:29.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 09:40:29.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424387,ok=424387,error=0, records=41
[INFO ] 2026-06-01 09:40:35.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:40:37.702 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:40:41.220 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21234/300s
[INFO ] 2026-06-01 09:40:44.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 09:40:44.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424388,ok=424388,error=0, records=41
[INFO ] 2026-06-01 09:40:50.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:40:52.709 [2460 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:40:59.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 09:40:59.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424389,ok=424389,error=0, records=41
[INFO ] 2026-06-01 09:41:05.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:41:07.713 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:41:08.803 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21221/300s
[INFO ] 2026-06-01 09:41:14.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 09:41:14.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424390,ok=424390,error=0, records=41
[INFO ] 2026-06-01 09:41:20.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:41:22.717 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:41:29.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 09:41:29.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424391,ok=424391,error=0, records=41
[INFO ] 2026-06-01 09:41:34.478 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21230/300s
[INFO ] 2026-06-01 09:41:35.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:41:37.723 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:41:44.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 09:41:44.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424392,ok=424392,error=0, records=41
[INFO ] 2026-06-01 09:41:50.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:41:52.728 [2459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:41:59.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 09:41:59.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424393,ok=424393,error=0, records=41
[INFO ] 2026-06-01 09:42:05.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:42:05.696 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21233/300s
[WARN ] 2026-06-01 09:42:07.733 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:42:14.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 09:42:14.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424394,ok=424394,error=0, records=41
[INFO ] 2026-06-01 09:42:20.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:42:22.740 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:42:29.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 09:42:29.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424395,ok=424395,error=0, records=41
[INFO ] 2026-06-01 09:42:35.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:42:37.745 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:42:40.873 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21231/300s
[INFO ] 2026-06-01 09:42:42.774 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21231/300s
[INFO ] 2026-06-01 09:42:44.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 09:42:44.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424396,ok=424396,error=0, records=41
[INFO ] 2026-06-01 09:42:50.380 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21231/300s
[INFO ] 2026-06-01 09:42:50.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:42:52.751 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:42:53.740 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865540},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:42:53.897 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:42:53.897 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 09:42:53.897 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:42:53.897 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:42:53.897 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:42:53.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:42:53.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:42:53.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:42:53.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:42:53.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:42:53.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:42:59.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 09:42:59.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424397,ok=424397,error=0, records=41
[INFO ] 2026-06-01 09:43:05.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:43:07.755 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:43:14.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 09:43:14.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424398,ok=424398,error=0, records=41
[INFO ] 2026-06-01 09:43:20.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:43:22.760 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:43:30.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-01 09:43:30.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424399,ok=424399,error=0, records=41
[INFO ] 2026-06-01 09:43:35.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 09:43:35.700 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 09:43:37.765 [2459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:43:45.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 09:43:45.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424400,ok=424400,error=0, records=41
[INFO ] 2026-06-01 09:43:50.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:43:52.770 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:44:00.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 09:44:00.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424401,ok=424401,error=0, records=41
[INFO ] 2026-06-01 09:44:05.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:44:07.776 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:44:15.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 09:44:15.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424402,ok=424402,error=0, records=41
[INFO ] 2026-06-01 09:44:20.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:44:22.781 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:44:30.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 09:44:30.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424403,ok=424403,error=0, records=41
[INFO ] 2026-06-01 09:44:35.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:44:37.786 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:44:45.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 09:44:45.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424404,ok=424404,error=0, records=41
[INFO ] 2026-06-01 09:44:50.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:44:52.790 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:44:59.792 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21226/300s
[INFO ] 2026-06-01 09:45:00.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 09:45:00.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424405,ok=424405,error=0, records=41
[INFO ] 2026-06-01 09:45:00.911 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21235/300s
[INFO ] 2026-06-01 09:45:05.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:45:07.795 [2465 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:45:15.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 09:45:15.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424406,ok=424406,error=0, records=41
[INFO ] 2026-06-01 09:45:15.109 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21222/300s
[INFO ] 2026-06-01 09:45:20.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:45:22.799 [2459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:45:30.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 09:45:30.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424407,ok=424407,error=0, records=41
[INFO ] 2026-06-01 09:45:35.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:45:37.804 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:45:41.226 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21235/300s
[INFO ] 2026-06-01 09:45:45.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 09:45:45.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424408,ok=424408,error=0, records=41
[INFO ] 2026-06-01 09:45:50.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:45:52.808 [3026 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:45:53.897 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17679/300s
[INFO ] 2026-06-01 09:45:53.898 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865460},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:45:54.058 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:45:54.058 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 09:45:54.058 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:45:54.058 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:45:54.058 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:45:54.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:45:54.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:45:54.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:45:54.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:45:54.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:45:54.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:46:00.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 09:46:00.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424409,ok=424409,error=0, records=41
[INFO ] 2026-06-01 09:46:05.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:46:07.814 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:46:08.983 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21222/300s
[INFO ] 2026-06-01 09:46:15.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 09:46:15.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424410,ok=424410,error=0, records=41
[INFO ] 2026-06-01 09:46:20.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:46:22.820 [3031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:46:30.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 09:46:30.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424411,ok=424411,error=0, records=41
[INFO ] 2026-06-01 09:46:34.532 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21231/300s
[INFO ] 2026-06-01 09:46:35.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:46:37.826 [3031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:46:45.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 09:46:45.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424412,ok=424412,error=0, records=41
[INFO ] 2026-06-01 09:46:50.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:46:52.831 [3057 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:47:00.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 09:47:00.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424413,ok=424413,error=0, records=41
[INFO ] 2026-06-01 09:47:05.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:47:05.708 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21234/300s
[WARN ] 2026-06-01 09:47:07.836 [3103 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:47:15.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 09:47:15.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424414,ok=424414,error=0, records=41
[INFO ] 2026-06-01 09:47:20.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:47:22.841 [3031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:47:30.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 09:47:30.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424415,ok=424415,error=0, records=41
[INFO ] 2026-06-01 09:47:35.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:47:37.845 [3113 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:47:40.926 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21232/300s
[INFO ] 2026-06-01 09:47:42.827 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21232/300s
[INFO ] 2026-06-01 09:47:45.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 09:47:45.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424416,ok=424416,error=0, records=41
[INFO ] 2026-06-01 09:47:50.442 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21232/300s
[INFO ] 2026-06-01 09:47:50.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:47:52.850 [2448 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:48:00.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 09:48:00.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424417,ok=424417,error=0, records=41
[INFO ] 2026-06-01 09:48:05.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:48:07.856 [3143 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:48:15.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 09:48:15.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424418,ok=424418,error=0, records=41
[INFO ] 2026-06-01 09:48:20.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:48:22.861 [3057 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:48:30.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 09:48:30.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424419,ok=424419,error=0, records=41
[INFO ] 2026-06-01 09:48:35.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:48:37.866 [3143 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:48:45.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 09:48:45.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424420,ok=424420,error=0, records=41
[INFO ] 2026-06-01 09:48:50.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:48:52.870 [3157 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:48:54.059 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865380},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:48:54.227 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:48:54.228 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 09:48:54.228 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:48:54.228 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:48:54.228 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:48:54.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:48:54.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:48:54.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:48:54.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:48:54.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:48:54.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:49:00.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 09:49:00.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424421,ok=424421,error=0, records=41
[INFO ] 2026-06-01 09:49:05.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:49:07.874 [3218 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:49:15.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 09:49:15.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424422,ok=424422,error=0, records=41
[INFO ] 2026-06-01 09:49:20.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:49:22.881 [3199 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:49:30.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 09:49:30.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424423,ok=424423,error=0, records=41
[INFO ] 2026-06-01 09:49:35.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:49:37.886 [3245 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:49:45.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 09:49:45.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424424,ok=424424,error=0, records=41
[INFO ] 2026-06-01 09:49:50.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:49:52.891 [3263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:49:59.894 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21227/300s
[INFO ] 2026-06-01 09:50:00.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 09:50:00.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424425,ok=424425,error=0, records=41
[INFO ] 2026-06-01 09:50:00.915 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21236/300s
[INFO ] 2026-06-01 09:50:05.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:50:07.897 [3289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:50:15.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 09:50:15.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424426,ok=424426,error=0, records=41
[INFO ] 2026-06-01 09:50:15.224 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21223/300s
[INFO ] 2026-06-01 09:50:20.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:50:22.903 [3294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:50:30.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 09:50:30.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424427,ok=424427,error=0, records=41
[INFO ] 2026-06-01 09:50:35.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:50:37.908 [3263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:50:41.233 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21236/300s
[INFO ] 2026-06-01 09:50:45.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 09:50:45.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424428,ok=424428,error=0, records=41
[INFO ] 2026-06-01 09:50:50.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:50:52.913 [3333 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:51:00.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 09:51:00.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424429,ok=424429,error=0, records=41
[INFO ] 2026-06-01 09:51:05.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:51:07.919 [3322 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:51:09.167 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21223/300s
[INFO ] 2026-06-01 09:51:15.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 09:51:15.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424430,ok=424430,error=0, records=41
[INFO ] 2026-06-01 09:51:20.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:51:22.923 [3316 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:51:30.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 09:51:30.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424431,ok=424431,error=0, records=41
[INFO ] 2026-06-01 09:51:34.588 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21232/300s
[INFO ] 2026-06-01 09:51:35.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:51:37.929 [3388 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:51:45.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 09:51:45.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424432,ok=424432,error=0, records=41
[INFO ] 2026-06-01 09:51:50.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:51:52.935 [3400 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:51:54.228 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17680/300s
[INFO ] 2026-06-01 09:51:54.229 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865300},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:51:54.381 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:51:54.381 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 09:51:54.381 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:51:54.381 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:51:54.381 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:51:54.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:51:54.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:51:54.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:51:54.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:51:54.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:51:54.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:52:00.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 09:52:00.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424433,ok=424433,error=0, records=41
[INFO ] 2026-06-01 09:52:05.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:52:05.721 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21235/300s
[WARN ] 2026-06-01 09:52:07.941 [3401 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:52:15.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 09:52:15.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424434,ok=424434,error=0, records=41
[INFO ] 2026-06-01 09:52:20.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:52:22.947 [3416 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:52:30.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 09:52:30.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424435,ok=424435,error=0, records=41
[INFO ] 2026-06-01 09:52:35.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:52:37.952 [3417 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:52:40.988 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21233/300s
[INFO ] 2026-06-01 09:52:42.890 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21233/300s
[INFO ] 2026-06-01 09:52:45.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 09:52:45.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424436,ok=424436,error=0, records=41
[INFO ] 2026-06-01 09:52:50.497 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21233/300s
[INFO ] 2026-06-01 09:52:50.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:52:52.958 [3417 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:53:00.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 09:53:00.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424437,ok=424437,error=0, records=41
[INFO ] 2026-06-01 09:53:05.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:53:07.963 [3442 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:53:15.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 09:53:15.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424438,ok=424438,error=0, records=41
[INFO ] 2026-06-01 09:53:20.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:53:22.967 [3427 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:53:30.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 09:53:30.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424439,ok=424439,error=0, records=41
[INFO ] 2026-06-01 09:53:35.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 09:53:35.726 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 09:53:37.972 [3417 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:53:45.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 09:53:45.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424440,ok=424440,error=0, records=41
[INFO ] 2026-06-01 09:53:50.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:53:50.727 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 09:53:52.976 [3417 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:54:00.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 09:54:00.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424441,ok=424441,error=0, records=41
[INFO ] 2026-06-01 09:54:05.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:54:07.981 [3427 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:54:15.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 09:54:15.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424442,ok=424442,error=0, records=41
[INFO ] 2026-06-01 09:54:20.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:54:22.986 [3546 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:54:30.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 09:54:30.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424443,ok=424443,error=0, records=41
[INFO ] 2026-06-01 09:54:35.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:54:37.990 [3503 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:54:45.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 09:54:45.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424444,ok=424444,error=0, records=41
[INFO ] 2026-06-01 09:54:50.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:54:52.996 [3560 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:54:54.383 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865220},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:54:54.534 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:54:54.534 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 09:54:54.534 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:54:54.534 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:54:54.534 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:54:54.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:54:54.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:54:54.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:54:54.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:54:54.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:54:54.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:54:59.998 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21228/300s
[INFO ] 2026-06-01 09:55:00.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 09:55:00.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424445,ok=424445,error=0, records=41
[INFO ] 2026-06-01 09:55:00.918 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21237/300s
[INFO ] 2026-06-01 09:55:05.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:55:08.001 [3560 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:55:15.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 09:55:15.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424446,ok=424446,error=0, records=41
[INFO ] 2026-06-01 09:55:15.359 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21224/300s
[INFO ] 2026-06-01 09:55:20.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:55:23.005 [3560 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:55:30.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 09:55:30.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424447,ok=424447,error=0, records=41
[INFO ] 2026-06-01 09:55:35.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:55:38.009 [3617 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:55:41.239 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21237/300s
[INFO ] 2026-06-01 09:55:45.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 09:55:45.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424448,ok=424448,error=0, records=41
[INFO ] 2026-06-01 09:55:50.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:55:53.014 [3560 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:56:00.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 09:56:00.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424449,ok=424449,error=0, records=41
[INFO ] 2026-06-01 09:56:05.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:56:08.019 [3631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:56:09.349 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21224/300s
[INFO ] 2026-06-01 09:56:15.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 09:56:15.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424450,ok=424450,error=0, records=41
[INFO ] 2026-06-01 09:56:20.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:56:23.024 [3631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:56:30.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 09:56:30.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424451,ok=424451,error=0, records=41
[INFO ] 2026-06-01 09:56:34.644 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21233/300s
[INFO ] 2026-06-01 09:56:35.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:56:38.029 [3660 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:56:45.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 09:56:45.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424452,ok=424452,error=0, records=41
[INFO ] 2026-06-01 09:56:50.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:56:53.035 [3660 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:57:00.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 09:57:00.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424453,ok=424453,error=0, records=41
[INFO ] 2026-06-01 09:57:05.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 09:57:05.735 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21236/300s
[WARN ] 2026-06-01 09:57:08.039 [3693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:57:15.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 09:57:15.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424454,ok=424454,error=0, records=41
[INFO ] 2026-06-01 09:57:20.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:57:23.044 [3710 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:57:30.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 09:57:30.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424455,ok=424455,error=0, records=41
[INFO ] 2026-06-01 09:57:35.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:57:38.048 [3735 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:57:41.042 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21234/300s
[INFO ] 2026-06-01 09:57:42.943 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21234/300s
[INFO ] 2026-06-01 09:57:45.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 09:57:45.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424456,ok=424456,error=0, records=41
[INFO ] 2026-06-01 09:57:50.550 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21234/300s
[INFO ] 2026-06-01 09:57:50.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:57:52.554 [3762 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:57:54.534 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17681/300s
[INFO ] 2026-06-01 09:57:54.536 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865136},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 09:57:54.670 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 09:57:54.670 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 09:57:54.670 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 09:57:54.670 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 09:57:54.670 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 09:57:54.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:57:54.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 09:57:54.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:57:54.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 09:57:54.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:57:54.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 09:58:00.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 09:58:00.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424457,ok=424457,error=0, records=41
[INFO ] 2026-06-01 09:58:05.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:58:07.558 [3779 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:58:15.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 09:58:15.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424458,ok=424458,error=0, records=41
[INFO ] 2026-06-01 09:58:20.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:58:22.563 [3801 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:58:30.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 09:58:30.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424459,ok=424459,error=0, records=41
[INFO ] 2026-06-01 09:58:35.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:58:37.568 [3812 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:58:45.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 09:58:45.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424460,ok=424460,error=0, records=41
[INFO ] 2026-06-01 09:58:50.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:58:52.572 [3818 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:59:00.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-01 09:59:00.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424461,ok=424461,error=0, records=41
[INFO ] 2026-06-01 09:59:05.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:59:07.577 [3849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:59:15.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 09:59:15.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424462,ok=424462,error=0, records=41
[INFO ] 2026-06-01 09:59:20.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:59:22.581 [3852 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:59:30.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 09:59:30.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424463,ok=424463,error=0, records=41
[INFO ] 2026-06-01 09:59:35.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:59:37.586 [3886 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 09:59:45.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 09:59:45.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424464,ok=424464,error=0, records=41
[INFO ] 2026-06-01 09:59:50.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 09:59:52.592 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:00:00.095 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21229/300s
[INFO ] 2026-06-01 10:00:00.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 10:00:00.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424465,ok=424465,error=0, records=41
[INFO ] 2026-06-01 10:00:00.921 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21238/300s
[INFO ] 2026-06-01 10:00:05.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:00:07.598 [3849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:00:15.592 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 10:00:15.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424466,ok=424466,error=0, records=41
[INFO ] 2026-06-01 10:00:15.593 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21225/300s
[INFO ] 2026-06-01 10:00:20.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:00:22.603 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:00:30.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 10:00:30.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424467,ok=424467,error=0, records=41
[INFO ] 2026-06-01 10:00:35.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:00:37.610 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:00:41.246 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21238/300s
[INFO ] 2026-06-01 10:00:45.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 10:00:45.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424468,ok=424468,error=0, records=41
[INFO ] 2026-06-01 10:00:50.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:00:52.615 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:00:54.672 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865048},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:00:54.822 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:00:54.822 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:00:54.822 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:00:54.822 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:00:54.822 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:00:54.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:00:54.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:00:54.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:00:54.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:00:54.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:00:54.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:01:00.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 10:01:00.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424469,ok=424469,error=0, records=41
[INFO ] 2026-06-01 10:01:05.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:01:07.619 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:01:09.532 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21225/300s
[INFO ] 2026-06-01 10:01:15.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 10:01:15.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424470,ok=424470,error=0, records=41
[INFO ] 2026-06-01 10:01:20.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:01:22.624 [3849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:01:30.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 10:01:30.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424471,ok=424471,error=0, records=41
[INFO ] 2026-06-01 10:01:34.698 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21234/300s
[INFO ] 2026-06-01 10:01:35.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:01:37.631 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:01:45.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 10:01:45.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424472,ok=424472,error=0, records=41
[INFO ] 2026-06-01 10:01:50.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:01:52.637 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:02:00.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 10:02:00.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424473,ok=424473,error=0, records=41
[INFO ] 2026-06-01 10:02:05.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:02:05.747 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21237/300s
[WARN ] 2026-06-01 10:02:07.642 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:02:15.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 10:02:15.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424474,ok=424474,error=0, records=41
[INFO ] 2026-06-01 10:02:20.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:02:22.648 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:02:30.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 10:02:30.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424475,ok=424475,error=0, records=41
[INFO ] 2026-06-01 10:02:35.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:02:37.654 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:02:41.101 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21235/300s
[INFO ] 2026-06-01 10:02:43.003 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21235/300s
[INFO ] 2026-06-01 10:02:45.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 10:02:45.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424476,ok=424476,error=0, records=41
[INFO ] 2026-06-01 10:02:50.610 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21235/300s
[INFO ] 2026-06-01 10:02:50.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:02:52.660 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:03:00.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 10:03:00.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424477,ok=424477,error=0, records=41
[INFO ] 2026-06-01 10:03:05.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:03:07.665 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:03:15.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 10:03:15.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424478,ok=424478,error=0, records=41
[INFO ] 2026-06-01 10:03:20.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:03:22.670 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:03:30.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 10:03:30.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424479,ok=424479,error=0, records=41
[INFO ] 2026-06-01 10:03:35.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 10:03:35.751 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 10:03:37.674 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:03:45.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 10:03:45.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424480,ok=424480,error=0, records=41
[INFO ] 2026-06-01 10:03:50.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:03:52.680 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:03:54.823 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17682/300s
[INFO ] 2026-06-01 10:03:54.824 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864972},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:03:54.990 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:03:54.990 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:03:54.990 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:03:54.990 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:03:54.990 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:03:55.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:03:55.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:03:55.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:03:55.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:03:55.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:03:55.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:04:00.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 10:04:00.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424481,ok=424481,error=0, records=41
[INFO ] 2026-06-01 10:04:05.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:04:07.685 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:04:15.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 10:04:15.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424482,ok=424482,error=0, records=41
[INFO ] 2026-06-01 10:04:20.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:04:22.690 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:04:30.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 10:04:30.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424483,ok=424483,error=0, records=41
[INFO ] 2026-06-01 10:04:35.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:04:37.696 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:04:45.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 10:04:45.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424484,ok=424484,error=0, records=41
[INFO ] 2026-06-01 10:04:50.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:04:52.701 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:05:00.204 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21230/300s
[INFO ] 2026-06-01 10:05:00.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 10:05:00.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424485,ok=424485,error=0, records=41
[INFO ] 2026-06-01 10:05:00.925 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21239/300s
[INFO ] 2026-06-01 10:05:05.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:05:07.706 [3849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:05:15.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 10:05:15.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424486,ok=424486,error=0, records=41
[INFO ] 2026-06-01 10:05:15.703 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21226/300s
[INFO ] 2026-06-01 10:05:20.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:05:22.712 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:05:30.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 10:05:30.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424487,ok=424487,error=0, records=41
[INFO ] 2026-06-01 10:05:35.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:05:37.718 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:05:41.253 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21239/300s
[INFO ] 2026-06-01 10:05:45.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 10:05:45.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424488,ok=424488,error=0, records=41
[INFO ] 2026-06-01 10:05:50.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:05:52.723 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:06:00.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 10:06:00.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424489,ok=424489,error=0, records=41
[INFO ] 2026-06-01 10:06:05.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:06:07.729 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:06:09.694 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21226/300s
[INFO ] 2026-06-01 10:06:15.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 10:06:15.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424490,ok=424490,error=0, records=41
[INFO ] 2026-06-01 10:06:20.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:06:22.735 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:06:30.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 10:06:30.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424491,ok=424491,error=0, records=41
[INFO ] 2026-06-01 10:06:34.752 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21235/300s
[INFO ] 2026-06-01 10:06:35.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:06:37.740 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:06:45.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 10:06:45.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424492,ok=424492,error=0, records=41
[INFO ] 2026-06-01 10:06:50.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:06:52.744 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:06:54.992 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864876},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:06:55.171 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:06:55.171 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:06:55.171 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:06:55.171 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:06:55.171 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:06:55.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:06:55.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:06:55.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:06:55.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:06:55.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:06:55.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:07:00.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 10:07:00.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424493,ok=424493,error=0, records=41
[INFO ] 2026-06-01 10:07:05.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:07:05.760 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21238/300s
[WARN ] 2026-06-01 10:07:07.750 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:07:15.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-01 10:07:15.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424494,ok=424494,error=0, records=41
[INFO ] 2026-06-01 10:07:20.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:07:22.755 [3917 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:07:30.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 10:07:30.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424495,ok=424495,error=0, records=41
[INFO ] 2026-06-01 10:07:35.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:07:37.760 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:07:41.157 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21236/300s
[INFO ] 2026-06-01 10:07:43.058 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21236/300s
[INFO ] 2026-06-01 10:07:45.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 10:07:45.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424496,ok=424496,error=0, records=41
[INFO ] 2026-06-01 10:07:50.664 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21236/300s
[INFO ] 2026-06-01 10:07:50.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:07:52.767 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:08:00.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 10:08:00.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424497,ok=424497,error=0, records=41
[INFO ] 2026-06-01 10:08:05.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:08:07.772 [3849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:08:15.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 10:08:15.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424498,ok=424498,error=0, records=41
[INFO ] 2026-06-01 10:08:20.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:08:22.778 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:08:30.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 10:08:30.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424499,ok=424499,error=0, records=41
[INFO ] 2026-06-01 10:08:35.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:08:37.783 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:08:45.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 10:08:45.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424500,ok=424500,error=0, records=41
[INFO ] 2026-06-01 10:08:50.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:08:50.764 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 10:08:52.789 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:09:00.836 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 10:09:00.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424501,ok=424501,error=0, records=41
[INFO ] 2026-06-01 10:09:05.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:09:07.794 [3885 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:09:15.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 10:09:15.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424502,ok=424502,error=0, records=41
[INFO ] 2026-06-01 10:09:20.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:09:22.799 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:09:30.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 10:09:30.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424503,ok=424503,error=0, records=41
[INFO ] 2026-06-01 10:09:35.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:09:37.804 [3835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:09:45.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 10:09:45.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424504,ok=424504,error=0, records=41
[INFO ] 2026-06-01 10:09:50.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:09:52.810 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:09:55.171 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17683/300s
[INFO ] 2026-06-01 10:09:55.173 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864800},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:09:55.328 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:09:55.328 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 10:09:55.328 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:09:55.328 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:09:55.328 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:09:55.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:09:55.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:09:55.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:09:55.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:09:55.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:09:55.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:10:00.312 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21231/300s
[INFO ] 2026-06-01 10:10:00.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 10:10:00.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424505,ok=424505,error=0, records=41
[INFO ] 2026-06-01 10:10:00.928 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21240/300s
[INFO ] 2026-06-01 10:10:05.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:10:07.817 [3902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:10:15.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 10:10:15.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424506,ok=424506,error=0, records=41
[INFO ] 2026-06-01 10:10:15.861 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21227/300s
[INFO ] 2026-06-01 10:10:20.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:10:22.821 [4475 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:10:30.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 10:10:30.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424507,ok=424507,error=0, records=41
[INFO ] 2026-06-01 10:10:35.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:10:37.827 [4475 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:10:41.259 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21240/300s
[INFO ] 2026-06-01 10:10:45.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 10:10:45.888 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424508,ok=424508,error=0, records=41
[INFO ] 2026-06-01 10:10:50.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:10:52.832 [4489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:11:00.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 10:11:00.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424509,ok=424509,error=0, records=41
[INFO ] 2026-06-01 10:11:05.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:11:07.838 [4489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:11:09.871 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21227/300s
[INFO ] 2026-06-01 10:11:15.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 10:11:15.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424510,ok=424510,error=0, records=41
[INFO ] 2026-06-01 10:11:20.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:11:22.843 [4435 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:11:30.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 10:11:30.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424511,ok=424511,error=0, records=41
[INFO ] 2026-06-01 10:11:34.802 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21236/300s
[INFO ] 2026-06-01 10:11:35.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:11:37.847 [4475 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:11:45.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 10:11:45.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424512,ok=424512,error=0, records=41
[INFO ] 2026-06-01 10:11:50.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:11:52.852 [4503 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:12:00.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 10:12:00.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424513,ok=424513,error=0, records=41
[INFO ] 2026-06-01 10:12:05.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:12:05.773 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21239/300s
[WARN ] 2026-06-01 10:12:07.857 [4475 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:12:15.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 10:12:15.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424514,ok=424514,error=0, records=41
[INFO ] 2026-06-01 10:12:20.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:12:22.863 [4489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:12:30.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 10:12:30.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424515,ok=424515,error=0, records=41
[INFO ] 2026-06-01 10:12:35.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:12:37.867 [4530 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:12:41.190 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21237/300s
[INFO ] 2026-06-01 10:12:43.092 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21237/300s
[INFO ] 2026-06-01 10:12:45.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 10:12:45.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424516,ok=424516,error=0, records=41
[INFO ] 2026-06-01 10:12:50.699 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21237/300s
[INFO ] 2026-06-01 10:12:50.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:12:52.872 [4530 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:12:55.330 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:12:55.493 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:12:55.493 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 10:12:55.493 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:12:55.493 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:12:55.493 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:12:55.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:12:55.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:12:55.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:12:55.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:12:55.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:12:55.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:13:00.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 10:13:00.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424517,ok=424517,error=0, records=41
[INFO ] 2026-06-01 10:13:05.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:13:07.878 [4644 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:13:15.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 10:13:15.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424518,ok=424518,error=0, records=41
[INFO ] 2026-06-01 10:13:20.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:13:22.883 [4649 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:13:30.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 10:13:30.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424519,ok=424519,error=0, records=41
[INFO ] 2026-06-01 10:13:35.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 10:13:35.777 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 10:13:37.888 [4554 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:13:46.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 10:13:46.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424520,ok=424520,error=0, records=41
[INFO ] 2026-06-01 10:13:50.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:13:52.893 [4671 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:14:01.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 10:14:01.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424521,ok=424521,error=0, records=41
[INFO ] 2026-06-01 10:14:05.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:14:07.898 [4703 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:14:16.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 10:14:16.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424522,ok=424522,error=0, records=41
[INFO ] 2026-06-01 10:14:20.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:14:22.904 [4671 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:14:31.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 10:14:31.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424523,ok=424523,error=0, records=41
[INFO ] 2026-06-01 10:14:35.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:14:37.908 [4741 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:14:46.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 10:14:46.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424524,ok=424524,error=0, records=41
[INFO ] 2026-06-01 10:14:50.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:14:52.914 [4746 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:15:00.416 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21232/300s
[INFO ] 2026-06-01 10:15:00.932 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21241/300s
[INFO ] 2026-06-01 10:15:01.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 10:15:01.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424525,ok=424525,error=0, records=41
[INFO ] 2026-06-01 10:15:05.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:15:07.919 [4763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:15:16.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 10:15:16.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424526,ok=424526,error=0, records=41
[INFO ] 2026-06-01 10:15:16.046 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21228/300s
[INFO ] 2026-06-01 10:15:20.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:15:22.924 [4785 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:15:31.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 10:15:31.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424527,ok=424527,error=0, records=41
[INFO ] 2026-06-01 10:15:35.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:15:37.929 [4808 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:15:41.266 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21241/300s
[INFO ] 2026-06-01 10:15:46.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 10:15:46.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424528,ok=424528,error=0, records=41
[INFO ] 2026-06-01 10:15:50.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:15:52.935 [4818 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:15:55.493 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17684/300s
[INFO ] 2026-06-01 10:15:55.495 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864648},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:15:55.656 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:15:55.656 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:15:55.656 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:15:55.656 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:15:55.656 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:15:55.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:15:55.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:15:55.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:15:55.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:15:55.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:15:55.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:16:01.106 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 10:16:01.106 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424529,ok=424529,error=0, records=41
[INFO ] 2026-06-01 10:16:05.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:16:07.940 [4819 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:16:10.053 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21228/300s
[INFO ] 2026-06-01 10:16:16.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 10:16:16.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424530,ok=424530,error=0, records=41
[INFO ] 2026-06-01 10:16:20.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:16:22.946 [4859 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:16:31.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 10:16:31.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424531,ok=424531,error=0, records=41
[INFO ] 2026-06-01 10:16:34.860 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21237/300s
[INFO ] 2026-06-01 10:16:35.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:16:37.950 [4870 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:16:46.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 10:16:46.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424532,ok=424532,error=0, records=41
[INFO ] 2026-06-01 10:16:50.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:16:52.955 [4884 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:17:01.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 10:17:01.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424533,ok=424533,error=0, records=41
[INFO ] 2026-06-01 10:17:05.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:17:05.786 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21240/300s
[WARN ] 2026-06-01 10:17:07.960 [4830 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:17:16.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 10:17:16.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424534,ok=424534,error=0, records=41
[INFO ] 2026-06-01 10:17:20.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:17:22.965 [4864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:17:31.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 10:17:31.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424535,ok=424535,error=0, records=41
[INFO ] 2026-06-01 10:17:35.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:17:37.970 [4884 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:17:41.254 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21238/300s
[INFO ] 2026-06-01 10:17:43.156 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21238/300s
[INFO ] 2026-06-01 10:17:46.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 10:17:46.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424536,ok=424536,error=0, records=41
[INFO ] 2026-06-01 10:17:50.763 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21238/300s
[INFO ] 2026-06-01 10:17:50.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:17:52.975 [4864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:18:01.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 10:18:01.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424537,ok=424537,error=0, records=41
[INFO ] 2026-06-01 10:18:05.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:18:07.979 [4864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:18:16.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 10:18:16.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424538,ok=424538,error=0, records=41
[INFO ] 2026-06-01 10:18:20.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:18:22.985 [4864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:18:31.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 10:18:31.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424539,ok=424539,error=0, records=41
[INFO ] 2026-06-01 10:18:35.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:18:37.990 [4939 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:18:46.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 10:18:46.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424540,ok=424540,error=0, records=41
[INFO ] 2026-06-01 10:18:50.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:18:52.996 [4939 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:18:55.658 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864584},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:18:55.834 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:18:55.834 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 10:18:55.834 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:18:55.834 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:18:55.834 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:18:55.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:18:55.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:18:55.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:18:55.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:18:55.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:18:55.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:19:01.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 10:19:01.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424541,ok=424541,error=0, records=41
[INFO ] 2026-06-01 10:19:05.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:19:08.003 [4939 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:19:16.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 10:19:16.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424542,ok=424542,error=0, records=41
[INFO ] 2026-06-01 10:19:20.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:19:23.008 [4996 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:19:31.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 10:19:31.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424543,ok=424543,error=0, records=41
[INFO ] 2026-06-01 10:19:35.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:19:38.014 [5025 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:19:46.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 10:19:46.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424544,ok=424544,error=0, records=41
[INFO ] 2026-06-01 10:19:50.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:19:53.019 [5052 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:20:00.521 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21233/300s
[INFO ] 2026-06-01 10:20:00.935 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21242/300s
[INFO ] 2026-06-01 10:20:01.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 10:20:01.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424545,ok=424545,error=0, records=41
[INFO ] 2026-06-01 10:20:05.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:20:08.023 [4982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:20:16.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 10:20:16.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424546,ok=424546,error=0, records=41
[INFO ] 2026-06-01 10:20:16.261 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21229/300s
[INFO ] 2026-06-01 10:20:20.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:20:23.028 [5072 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:20:31.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 10:20:31.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424547,ok=424547,error=0, records=41
[INFO ] 2026-06-01 10:20:35.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:20:38.033 [5072 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:20:41.272 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21242/300s
[INFO ] 2026-06-01 10:20:46.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 10:20:46.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424548,ok=424548,error=0, records=41
[INFO ] 2026-06-01 10:20:50.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:20:53.039 [5115 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:21:01.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 10:21:01.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424549,ok=424549,error=0, records=41
[INFO ] 2026-06-01 10:21:05.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:21:08.045 [5116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:21:10.237 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21229/300s
[INFO ] 2026-06-01 10:21:16.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 10:21:16.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424550,ok=424550,error=0, records=41
[INFO ] 2026-06-01 10:21:20.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:21:23.049 [5150 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:21:31.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 10:21:31.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424551,ok=424551,error=0, records=41
[INFO ] 2026-06-01 10:21:34.913 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21238/300s
[INFO ] 2026-06-01 10:21:35.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:21:37.555 [5155 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:21:46.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 10:21:46.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424552,ok=424552,error=0, records=41
[INFO ] 2026-06-01 10:21:50.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:21:52.560 [5172 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:21:55.835 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17685/300s
[INFO ] 2026-06-01 10:21:55.836 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864504},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:21:56.009 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:21:56.009 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 10:21:56.009 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:21:56.009 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:21:56.009 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:21:56.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:21:56.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:21:56.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:21:56.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:21:56.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:21:56.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:22:01.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 10:22:01.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424553,ok=424553,error=0, records=41
[INFO ] 2026-06-01 10:22:05.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:22:05.798 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21241/300s
[WARN ] 2026-06-01 10:22:07.565 [5190 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:22:16.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 10:22:16.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424554,ok=424554,error=0, records=41
[INFO ] 2026-06-01 10:22:20.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:22:22.570 [5186 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:22:31.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 10:22:31.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424555,ok=424555,error=0, records=41
[INFO ] 2026-06-01 10:22:35.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:22:37.576 [5237 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:22:41.305 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21239/300s
[INFO ] 2026-06-01 10:22:43.206 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21239/300s
[INFO ] 2026-06-01 10:22:46.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 10:22:46.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424556,ok=424556,error=0, records=41
[INFO ] 2026-06-01 10:22:50.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:22:50.813 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21239/300s
[WARN ] 2026-06-01 10:22:52.581 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:23:01.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 10:23:01.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424557,ok=424557,error=0, records=41
[INFO ] 2026-06-01 10:23:05.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:23:07.586 [5241 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:23:16.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 10:23:16.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424558,ok=424558,error=0, records=41
[INFO ] 2026-06-01 10:23:20.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:23:22.591 [5293 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:23:31.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 10:23:31.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424559,ok=424559,error=0, records=41
[INFO ] 2026-06-01 10:23:35.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 10:23:35.802 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 10:23:37.597 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:23:46.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 10:23:46.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424560,ok=424560,error=0, records=41
[INFO ] 2026-06-01 10:23:50.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:23:50.802 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 10:23:52.604 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:24:01.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 10:24:01.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424561,ok=424561,error=0, records=41
[INFO ] 2026-06-01 10:24:05.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:24:07.609 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:24:16.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 10:24:16.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424562,ok=424562,error=0, records=41
[INFO ] 2026-06-01 10:24:20.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:24:22.615 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:24:31.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 10:24:31.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424563,ok=424563,error=0, records=41
[INFO ] 2026-06-01 10:24:35.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:24:37.620 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:24:46.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 10:24:46.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424564,ok=424564,error=0, records=41
[INFO ] 2026-06-01 10:24:50.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:24:52.624 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:24:56.011 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864428},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:24:56.166 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:24:56.167 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 10:24:56.167 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:24:56.167 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:24:56.167 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:24:56.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:24:56.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:24:56.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:24:56.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:24:56.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:24:56.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:25:00.627 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21234/300s
[INFO ] 2026-06-01 10:25:00.939 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21243/300s
[INFO ] 2026-06-01 10:25:01.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 10:25:01.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424565,ok=424565,error=0, records=41
[INFO ] 2026-06-01 10:25:05.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:25:07.630 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:25:16.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 10:25:16.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424566,ok=424566,error=0, records=41
[INFO ] 2026-06-01 10:25:16.380 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21230/300s
[INFO ] 2026-06-01 10:25:20.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:25:22.635 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:25:31.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 10:25:31.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424567,ok=424567,error=0, records=41
[INFO ] 2026-06-01 10:25:35.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:25:37.641 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:25:41.279 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21243/300s
[INFO ] 2026-06-01 10:25:46.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-01 10:25:46.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424568,ok=424568,error=0, records=41
[INFO ] 2026-06-01 10:25:50.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:25:52.646 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:26:01.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 10:26:01.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424569,ok=424569,error=0, records=41
[INFO ] 2026-06-01 10:26:05.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:26:07.655 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:26:10.418 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21230/300s
[INFO ] 2026-06-01 10:26:16.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 10:26:16.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424570,ok=424570,error=0, records=41
[INFO ] 2026-06-01 10:26:20.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:26:22.660 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:26:31.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 10:26:31.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424571,ok=424571,error=0, records=41
[INFO ] 2026-06-01 10:26:34.971 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21239/300s
[INFO ] 2026-06-01 10:26:35.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:26:37.664 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:26:46.413 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 10:26:46.413 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424572,ok=424572,error=0, records=41
[INFO ] 2026-06-01 10:26:50.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:26:52.670 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:27:01.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 10:27:01.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424573,ok=424573,error=0, records=41
[INFO ] 2026-06-01 10:27:05.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:27:05.812 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21242/300s
[WARN ] 2026-06-01 10:27:07.674 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:27:16.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 10:27:16.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424574,ok=424574,error=0, records=41
[INFO ] 2026-06-01 10:27:20.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:27:22.679 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:27:31.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 10:27:31.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424575,ok=424575,error=0, records=41
[INFO ] 2026-06-01 10:27:35.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:27:37.683 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:27:41.378 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21240/300s
[INFO ] 2026-06-01 10:27:43.280 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21240/300s
[INFO ] 2026-06-01 10:27:46.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 10:27:46.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424576,ok=424576,error=0, records=41
[INFO ] 2026-06-01 10:27:50.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:27:50.887 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21240/300s
[WARN ] 2026-06-01 10:27:52.688 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:27:56.167 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17686/300s
[INFO ] 2026-06-01 10:27:56.168 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864356},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:27:56.339 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:27:56.339 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 10:28:01.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 10:28:01.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424577,ok=424577,error=0, records=41
[INFO ] 2026-06-01 10:28:05.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:28:07.694 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:28:16.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 10:28:16.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424578,ok=424578,error=0, records=41
[INFO ] 2026-06-01 10:28:20.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:28:22.699 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:28:31.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 10:28:31.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424579,ok=424579,error=0, records=41
[INFO ] 2026-06-01 10:28:35.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:28:37.704 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:28:46.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 10:28:46.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424580,ok=424580,error=0, records=41
[INFO ] 2026-06-01 10:28:50.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:28:52.710 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:29:01.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 10:29:01.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424581,ok=424581,error=0, records=41
[INFO ] 2026-06-01 10:29:05.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:29:07.717 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:29:16.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 10:29:16.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424582,ok=424582,error=0, records=41
[INFO ] 2026-06-01 10:29:20.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:29:22.722 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:29:31.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 10:29:31.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424583,ok=424583,error=0, records=41
[INFO ] 2026-06-01 10:29:35.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:29:37.746 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:29:46.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 10:29:46.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424584,ok=424584,error=0, records=41
[WARN ] 2026-06-01 10:29:47.750 [5308 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1802/stat), No such file or directory
[WARN ] 2026-06-01 10:29:47.750 [5308 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1805/stat), No such file or directory
[INFO ] 2026-06-01 10:29:50.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:29:52.751 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:30:00.753 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21235/300s
[INFO ] 2026-06-01 10:30:00.947 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21244/300s
[INFO ] 2026-06-01 10:30:01.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 10:30:01.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424585,ok=424585,error=0, records=41
[INFO ] 2026-06-01 10:30:05.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:30:07.756 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:30:16.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 10:30:16.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424586,ok=424586,error=0, records=41
[INFO ] 2026-06-01 10:30:16.492 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21231/300s
[INFO ] 2026-06-01 10:30:20.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:30:22.761 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:30:31.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 10:30:31.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424587,ok=424587,error=0, records=41
[INFO ] 2026-06-01 10:30:35.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:30:37.766 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:30:41.287 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21244/300s
[INFO ] 2026-06-01 10:30:46.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 10:30:46.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424588,ok=424588,error=0, records=41
[INFO ] 2026-06-01 10:30:50.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:30:52.772 [5286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:30:56.341 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864212},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:30:56.501 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:30:56.501 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 10:30:56.501 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:30:56.501 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:30:56.501 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:30:56.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:30:56.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:30:56.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:30:56.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:30:56.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:30:56.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:31:01.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 10:31:01.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424589,ok=424589,error=0, records=41
[INFO ] 2026-06-01 10:31:05.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:31:07.782 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:31:10.628 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21231/300s
[INFO ] 2026-06-01 10:31:16.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 10:31:16.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424590,ok=424590,error=0, records=41
[INFO ] 2026-06-01 10:31:20.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:31:22.788 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:31:31.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 10:31:31.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424591,ok=424591,error=0, records=41
[INFO ] 2026-06-01 10:31:35.084 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21240/300s
[INFO ] 2026-06-01 10:31:35.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:31:37.797 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:31:46.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 10:31:46.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424592,ok=424592,error=0, records=41
[INFO ] 2026-06-01 10:31:50.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:31:52.835 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:32:01.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 10:32:01.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424593,ok=424593,error=0, records=41
[INFO ] 2026-06-01 10:32:05.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:32:05.823 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21243/300s
[WARN ] 2026-06-01 10:32:07.841 [5262 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:32:16.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 10:32:16.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424594,ok=424594,error=0, records=41
[INFO ] 2026-06-01 10:32:20.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:32:22.846 [6061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:32:31.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 10:32:31.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424595,ok=424595,error=0, records=41
[INFO ] 2026-06-01 10:32:35.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:32:37.852 [6061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:32:41.467 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21241/300s
[INFO ] 2026-06-01 10:32:43.324 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21241/300s
[INFO ] 2026-06-01 10:32:46.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 10:32:46.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424596,ok=424596,error=0, records=41
[WARN ] 2026-06-01 10:32:47.357 [6047 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5719/stat), No such file or directory
[WARN ] 2026-06-01 10:32:47.358 [6047 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5714/stat), No such file or directory
[INFO ] 2026-06-01 10:32:50.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:32:50.972 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21241/300s
[WARN ] 2026-06-01 10:32:52.857 [6047 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:33:01.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-01 10:33:01.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424597,ok=424597,error=0, records=41
[INFO ] 2026-06-01 10:33:05.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:33:07.863 [6104 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:33:16.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 10:33:16.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424598,ok=424598,error=0, records=41
[INFO ] 2026-06-01 10:33:20.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:33:22.869 [5308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:33:31.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 10:33:31.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424599,ok=424599,error=0, records=41
[INFO ] 2026-06-01 10:33:35.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 10:33:35.827 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 10:33:37.873 [6047 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:33:46.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 10:33:46.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424600,ok=424600,error=0, records=41
[INFO ] 2026-06-01 10:33:50.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:33:52.879 [6037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:33:56.501 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17687/300s
[INFO ] 2026-06-01 10:33:56.503 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864056},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:33:56.666 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:33:56.666 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:33:56.666 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:33:56.667 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:33:56.667 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:33:56.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:33:56.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:33:56.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:33:56.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:33:56.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:33:56.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:34:01.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 10:34:01.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424601,ok=424601,error=0, records=41
[INFO ] 2026-06-01 10:34:05.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:34:07.885 [6037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:34:16.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 10:34:16.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424602,ok=424602,error=0, records=41
[INFO ] 2026-06-01 10:34:20.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:34:22.891 [6173 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:34:31.588 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 10:34:31.588 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424603,ok=424603,error=0, records=41
[INFO ] 2026-06-01 10:34:35.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:34:37.898 [6173 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:34:46.592 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 10:34:46.592 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424604,ok=424604,error=0, records=41
[INFO ] 2026-06-01 10:34:50.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:34:52.903 [6216 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:35:00.906 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21236/300s
[INFO ] 2026-06-01 10:35:00.951 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21245/300s
[INFO ] 2026-06-01 10:35:01.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 10:35:01.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424605,ok=424605,error=0, records=41
[INFO ] 2026-06-01 10:35:05.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:35:07.910 [6238 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:35:16.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 10:35:16.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424606,ok=424606,error=0, records=41
[INFO ] 2026-06-01 10:35:16.603 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21232/300s
[INFO ] 2026-06-01 10:35:20.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:35:22.915 [6238 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:35:31.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 10:35:31.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424607,ok=424607,error=0, records=41
[INFO ] 2026-06-01 10:35:35.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:35:37.920 [6255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:35:41.293 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21245/300s
[INFO ] 2026-06-01 10:35:46.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 10:35:46.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424608,ok=424608,error=0, records=41
[INFO ] 2026-06-01 10:35:50.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:35:52.925 [6284 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:36:01.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 10:36:01.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424609,ok=424609,error=0, records=41
[INFO ] 2026-06-01 10:36:05.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:36:07.930 [6308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:36:10.828 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21232/300s
[INFO ] 2026-06-01 10:36:16.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 10:36:16.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424610,ok=424610,error=0, records=41
[INFO ] 2026-06-01 10:36:20.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:36:22.938 [6326 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:36:31.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 10:36:31.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424611,ok=424611,error=0, records=41
[INFO ] 2026-06-01 10:36:35.165 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21241/300s
[INFO ] 2026-06-01 10:36:35.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:36:37.943 [6331 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:36:46.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 10:36:46.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424612,ok=424612,error=0, records=41
[INFO ] 2026-06-01 10:36:50.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:36:52.952 [6331 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:36:56.668 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:36:56.814 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:36:56.814 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 10:36:56.814 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:36:56.814 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:36:56.814 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:36:56.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:36:56.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:36:56.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:36:56.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:36:56.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:36:56.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:37:01.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 10:37:01.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424613,ok=424613,error=0, records=41
[INFO ] 2026-06-01 10:37:05.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:37:05.836 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21244/300s
[WARN ] 2026-06-01 10:37:07.957 [6243 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:37:16.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 10:37:16.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424614,ok=424614,error=0, records=41
[INFO ] 2026-06-01 10:37:20.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:37:22.962 [6387 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:37:31.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 10:37:31.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424615,ok=424615,error=0, records=41
[INFO ] 2026-06-01 10:37:35.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:37:37.967 [6331 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:37:41.525 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21242/300s
[INFO ] 2026-06-01 10:37:43.419 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21242/300s
[INFO ] 2026-06-01 10:37:46.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 10:37:46.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424616,ok=424616,error=0, records=41
[INFO ] 2026-06-01 10:37:50.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:37:51.033 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21242/300s
[WARN ] 2026-06-01 10:37:52.971 [6243 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:38:01.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 10:38:01.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424617,ok=424617,error=0, records=41
[INFO ] 2026-06-01 10:38:05.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:38:07.977 [6430 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:38:16.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 10:38:16.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424618,ok=424618,error=0, records=41
[INFO ] 2026-06-01 10:38:20.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:38:22.982 [6401 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:38:31.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 10:38:31.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424619,ok=424619,error=0, records=41
[INFO ] 2026-06-01 10:38:35.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:38:37.987 [6444 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:38:46.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 10:38:46.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424620,ok=424620,error=0, records=41
[INFO ] 2026-06-01 10:38:50.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:38:50.840 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 10:38:52.993 [6331 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:39:01.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 10:39:01.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424621,ok=424621,error=0, records=41
[INFO ] 2026-06-01 10:39:05.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:39:07.998 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:39:16.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 10:39:16.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424622,ok=424622,error=0, records=41
[INFO ] 2026-06-01 10:39:20.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:39:23.003 [6401 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:39:31.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 10:39:31.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424623,ok=424623,error=0, records=41
[INFO ] 2026-06-01 10:39:35.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:39:38.007 [6401 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:39:46.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 10:39:46.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424624,ok=424624,error=0, records=41
[INFO ] 2026-06-01 10:39:50.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:39:53.013 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:39:56.814 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17688/300s
[INFO ] 2026-06-01 10:39:56.815 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863896},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:39:56.968 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:39:56.968 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 10:39:56.968 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:39:56.968 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:39:56.968 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:39:57.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:39:57.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:39:57.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:39:57.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:39:57.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:39:57.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:40:00.954 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21246/300s
[INFO ] 2026-06-01 10:40:01.016 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21237/300s
[INFO ] 2026-06-01 10:40:01.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 10:40:01.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424625,ok=424625,error=0, records=41
[INFO ] 2026-06-01 10:40:05.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:40:08.019 [6531 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:40:16.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 10:40:16.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424626,ok=424626,error=0, records=41
[INFO ] 2026-06-01 10:40:16.929 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21233/300s
[INFO ] 2026-06-01 10:40:20.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:40:23.023 [6459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:40:31.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 10:40:31.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424627,ok=424627,error=0, records=41
[INFO ] 2026-06-01 10:40:35.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:40:38.028 [6489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:40:41.300 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21246/300s
[INFO ] 2026-06-01 10:40:46.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 10:40:46.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424628,ok=424628,error=0, records=41
[INFO ] 2026-06-01 10:40:50.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:40:53.042 [6489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:41:01.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 10:41:01.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424629,ok=424629,error=0, records=41
[INFO ] 2026-06-01 10:41:05.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:41:08.046 [6620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:41:11.010 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21233/300s
[INFO ] 2026-06-01 10:41:16.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 10:41:16.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424630,ok=424630,error=0, records=41
[INFO ] 2026-06-01 10:41:20.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:41:23.051 [6625 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:41:31.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 10:41:31.957 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424631,ok=424631,error=0, records=41
[INFO ] 2026-06-01 10:41:35.226 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21242/300s
[INFO ] 2026-06-01 10:41:35.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:41:37.555 [6655 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:41:46.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 10:41:46.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424632,ok=424632,error=0, records=41
[INFO ] 2026-06-01 10:41:50.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:41:52.559 [6625 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:42:01.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-01 10:42:01.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424633,ok=424633,error=0, records=41
[INFO ] 2026-06-01 10:42:05.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:42:05.848 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21245/300s
[WARN ] 2026-06-01 10:42:07.564 [6693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:42:16.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 10:42:16.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424634,ok=424634,error=0, records=41
[INFO ] 2026-06-01 10:42:20.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:42:22.569 [6699 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:42:32.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 10:42:32.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424635,ok=424635,error=0, records=41
[INFO ] 2026-06-01 10:42:35.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:42:37.573 [6690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:42:41.561 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21243/300s
[INFO ] 2026-06-01 10:42:43.484 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21243/300s
[INFO ] 2026-06-01 10:42:47.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 10:42:47.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424636,ok=424636,error=0, records=41
[INFO ] 2026-06-01 10:42:50.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:42:51.068 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21243/300s
[WARN ] 2026-06-01 10:42:52.578 [6690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:42:56.970 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863820},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:42:57.140 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:42:57.140 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 10:42:57.140 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:42:57.140 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:42:57.140 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:42:57.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:42:57.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:42:57.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:42:57.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:42:57.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:42:57.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:43:02.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 10:43:02.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424637,ok=424637,error=0, records=41
[INFO ] 2026-06-01 10:43:05.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:43:07.583 [6766 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:43:17.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 10:43:17.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424638,ok=424638,error=0, records=41
[INFO ] 2026-06-01 10:43:20.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:43:22.588 [6766 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:43:32.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 10:43:32.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424639,ok=424639,error=0, records=41
[INFO ] 2026-06-01 10:43:35.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 10:43:35.852 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 10:43:37.594 [6799 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:43:47.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 10:43:47.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424640,ok=424640,error=0, records=41
[INFO ] 2026-06-01 10:43:50.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:43:52.599 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:44:02.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 10:44:02.044 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424641,ok=424641,error=0, records=41
[INFO ] 2026-06-01 10:44:05.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:44:07.604 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:44:17.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 10:44:17.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424642,ok=424642,error=0, records=41
[INFO ] 2026-06-01 10:44:20.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:44:22.610 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:44:32.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 10:44:32.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424643,ok=424643,error=0, records=41
[INFO ] 2026-06-01 10:44:35.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:44:37.616 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:44:47.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 10:44:47.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424644,ok=424644,error=0, records=41
[INFO ] 2026-06-01 10:44:50.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:44:52.621 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:45:00.957 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21247/300s
[INFO ] 2026-06-01 10:45:01.123 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21238/300s
[INFO ] 2026-06-01 10:45:02.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 10:45:02.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424645,ok=424645,error=0, records=41
[INFO ] 2026-06-01 10:45:05.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:45:07.626 [6809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:45:17.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 10:45:17.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424646,ok=424646,error=0, records=41
[INFO ] 2026-06-01 10:45:17.123 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21234/300s
[INFO ] 2026-06-01 10:45:20.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:45:22.631 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:45:32.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 10:45:32.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424647,ok=424647,error=0, records=41
[INFO ] 2026-06-01 10:45:35.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:45:37.637 [6809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:45:41.306 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21247/300s
[INFO ] 2026-06-01 10:45:47.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 10:45:47.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424648,ok=424648,error=0, records=41
[INFO ] 2026-06-01 10:45:50.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:45:52.642 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:45:57.140 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17689/300s
[INFO ] 2026-06-01 10:45:57.142 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863732},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:45:57.298 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:45:57.298 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:45:57.298 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:45:57.298 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:45:57.298 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:45:57.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:45:57.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:45:57.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:45:57.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:45:57.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:45:57.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:46:02.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 10:46:02.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424649,ok=424649,error=0, records=41
[INFO ] 2026-06-01 10:46:05.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:46:07.647 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:46:11.187 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21234/300s
[INFO ] 2026-06-01 10:46:17.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 10:46:17.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424650,ok=424650,error=0, records=41
[INFO ] 2026-06-01 10:46:20.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:46:22.651 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:46:32.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 10:46:32.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424651,ok=424651,error=0, records=41
[INFO ] 2026-06-01 10:46:35.281 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21243/300s
[INFO ] 2026-06-01 10:46:35.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:46:37.657 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:46:47.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 10:46:47.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424652,ok=424652,error=0, records=41
[INFO ] 2026-06-01 10:46:50.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:46:52.662 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:47:02.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 10:47:02.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424653,ok=424653,error=0, records=41
[INFO ] 2026-06-01 10:47:05.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:47:05.860 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21246/300s
[WARN ] 2026-06-01 10:47:07.669 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:47:17.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 10:47:17.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424654,ok=424654,error=0, records=41
[INFO ] 2026-06-01 10:47:20.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:47:22.680 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:47:32.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 10:47:32.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424655,ok=424655,error=0, records=41
[INFO ] 2026-06-01 10:47:35.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:47:37.685 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:47:41.621 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21244/300s
[INFO ] 2026-06-01 10:47:43.562 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21244/300s
[INFO ] 2026-06-01 10:47:47.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 10:47:47.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424656,ok=424656,error=0, records=41
[INFO ] 2026-06-01 10:47:50.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:47:51.127 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21244/300s
[WARN ] 2026-06-01 10:47:52.690 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:48:02.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 10:48:02.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424657,ok=424657,error=0, records=41
[INFO ] 2026-06-01 10:48:05.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:48:07.696 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:48:17.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 10:48:17.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424658,ok=424658,error=0, records=41
[INFO ] 2026-06-01 10:48:20.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:48:22.702 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:48:32.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 10:48:32.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424659,ok=424659,error=0, records=41
[WARN ] 2026-06-01 10:48:32.708 [6759 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5963/stat), No such file or directory
[INFO ] 2026-06-01 10:48:35.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:48:37.708 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:48:47.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 10:48:47.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424660,ok=424660,error=0, records=41
[WARN ] 2026-06-01 10:48:47.713 [6778 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5671/stat), No such file or directory
[WARN ] 2026-06-01 10:48:47.713 [6778 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5963/stat), No such file or directory
[WARN ] 2026-06-01 10:48:47.713 [6778 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5923/stat), No such file or directory
[INFO ] 2026-06-01 10:48:50.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:48:52.714 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:48:57.300 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863592},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:48:57.443 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:48:57.443 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 10:48:57.443 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:48:57.443 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:48:57.443 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:48:57.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:48:57.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:48:57.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:48:57.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:48:57.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:48:57.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:49:02.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 10:49:02.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424661,ok=424661,error=0, records=41
[INFO ] 2026-06-01 10:49:05.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:49:07.719 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:49:17.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-01 10:49:17.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424662,ok=424662,error=0, records=41
[INFO ] 2026-06-01 10:49:20.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:49:22.727 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:49:32.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10132, records=41
[INFO ] 2026-06-01 10:49:32.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424663,ok=424663,error=0, records=41
[INFO ] 2026-06-01 10:49:35.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:49:37.733 [6809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:49:47.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10131, records=41
[INFO ] 2026-06-01 10:49:47.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424664,ok=424664,error=0, records=41
[INFO ] 2026-06-01 10:49:50.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:49:52.738 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:50:00.960 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21248/300s
[INFO ] 2026-06-01 10:50:01.241 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21239/300s
[INFO ] 2026-06-01 10:50:02.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10127, records=41
[INFO ] 2026-06-01 10:50:02.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424665,ok=424665,error=0, records=41
[INFO ] 2026-06-01 10:50:05.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:50:07.745 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:50:17.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 10:50:17.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424666,ok=424666,error=0, records=41
[INFO ] 2026-06-01 10:50:17.319 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21235/300s
[INFO ] 2026-06-01 10:50:20.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:50:22.750 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:50:32.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 10:50:32.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424667,ok=424667,error=0, records=41
[INFO ] 2026-06-01 10:50:35.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:50:37.756 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:50:41.311 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21248/300s
[INFO ] 2026-06-01 10:50:47.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 10:50:47.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424668,ok=424668,error=0, records=41
[INFO ] 2026-06-01 10:50:50.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:50:52.761 [6767 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:51:02.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 10:51:02.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424669,ok=424669,error=0, records=41
[INFO ] 2026-06-01 10:51:05.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:51:07.766 [6784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:51:11.361 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21235/300s
[INFO ] 2026-06-01 10:51:17.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 10:51:17.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424670,ok=424670,error=0, records=41
[INFO ] 2026-06-01 10:51:20.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:51:22.771 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:51:32.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 10:51:32.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424671,ok=424671,error=0, records=41
[INFO ] 2026-06-01 10:51:35.336 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21244/300s
[INFO ] 2026-06-01 10:51:35.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:51:37.777 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:51:47.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 10:51:47.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424672,ok=424672,error=0, records=41
[INFO ] 2026-06-01 10:51:50.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:51:52.782 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:51:57.443 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17690/300s
[INFO ] 2026-06-01 10:51:57.445 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863468},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:51:57.618 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:51:57.618 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 10:51:57.618 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:51:57.618 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:51:57.618 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:51:57.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:51:57.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:51:57.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:51:57.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:51:57.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:51:57.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:52:02.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 10:52:02.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424673,ok=424673,error=0, records=41
[INFO ] 2026-06-01 10:52:05.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:52:05.870 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21247/300s
[WARN ] 2026-06-01 10:52:07.788 [6759 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:52:17.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 10:52:17.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424674,ok=424674,error=0, records=41
[INFO ] 2026-06-01 10:52:20.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:52:22.794 [6809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:52:32.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 10:52:32.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424675,ok=424675,error=0, records=41
[INFO ] 2026-06-01 10:52:35.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:52:37.799 [6778 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:52:41.637 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21245/300s
[INFO ] 2026-06-01 10:52:43.586 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21245/300s
[INFO ] 2026-06-01 10:52:47.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 10:52:47.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424676,ok=424676,error=0, records=41
[INFO ] 2026-06-01 10:52:50.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:52:51.146 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21245/300s
[WARN ] 2026-06-01 10:52:52.804 [6809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:53:02.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 10:53:02.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424677,ok=424677,error=0, records=41
[INFO ] 2026-06-01 10:53:05.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:53:07.809 [7422 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:53:17.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-01 10:53:17.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424678,ok=424678,error=0, records=41
[INFO ] 2026-06-01 10:53:20.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:53:22.814 [7422 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:53:32.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 10:53:32.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424679,ok=424679,error=0, records=41
[INFO ] 2026-06-01 10:53:35.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 10:53:35.875 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 10:53:37.820 [7447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:53:47.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13500, records=49
[INFO ] 2026-06-01 10:53:47.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424680,ok=424680,error=0, records=49
[INFO ] 2026-06-01 10:53:50.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:53:50.875 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 10:53:52.825 [7432 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:54:02.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 10:54:02.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424681,ok=424681,error=0, records=41
[INFO ] 2026-06-01 10:54:05.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:54:07.830 [7452 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:54:17.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 10:54:17.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424682,ok=424682,error=0, records=41
[INFO ] 2026-06-01 10:54:20.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:54:22.836 [7508 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:54:32.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 10:54:32.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424683,ok=424683,error=0, records=41
[INFO ] 2026-06-01 10:54:35.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:54:37.841 [7494 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:54:47.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 10:54:47.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424684,ok=424684,error=0, records=41
[INFO ] 2026-06-01 10:54:50.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:54:52.846 [7447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:54:57.620 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863400},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:54:57.792 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:54:57.792 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 10:54:57.792 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:54:57.792 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:54:57.792 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:54:57.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:54:57.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:54:57.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:54:57.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:54:57.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:54:57.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:55:00.963 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21249/300s
[INFO ] 2026-06-01 10:55:01.348 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21240/300s
[INFO ] 2026-06-01 10:55:02.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 10:55:02.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424685,ok=424685,error=0, records=41
[INFO ] 2026-06-01 10:55:05.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.73%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:55:07.851 [7545 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:55:17.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 10:55:17.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424686,ok=424686,error=0, records=41
[INFO ] 2026-06-01 10:55:17.523 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21236/300s
[INFO ] 2026-06-01 10:55:20.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:55:22.856 [7447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:55:32.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 10:55:32.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424687,ok=424687,error=0, records=41
[INFO ] 2026-06-01 10:55:35.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:55:37.861 [7447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:55:41.317 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21249/300s
[INFO ] 2026-06-01 10:55:47.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 10:55:47.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424688,ok=424688,error=0, records=41
[INFO ] 2026-06-01 10:55:50.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:55:52.866 [7447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:56:02.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 10:56:02.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424689,ok=424689,error=0, records=41
[INFO ] 2026-06-01 10:56:05.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:56:07.871 [7545 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:56:11.546 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21236/300s
[INFO ] 2026-06-01 10:56:17.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-01 10:56:17.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424690,ok=424690,error=0, records=41
[INFO ] 2026-06-01 10:56:20.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:56:22.876 [7616 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:56:32.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 10:56:32.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424691,ok=424691,error=0, records=41
[INFO ] 2026-06-01 10:56:35.392 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21245/300s
[INFO ] 2026-06-01 10:56:35.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:56:37.883 [7627 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:56:47.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 10:56:47.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424692,ok=424692,error=0, records=41
[INFO ] 2026-06-01 10:56:50.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:56:52.888 [7637 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:57:02.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 10:57:02.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424693,ok=424693,error=0, records=41
[INFO ] 2026-06-01 10:57:05.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:57:05.884 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21248/300s
[WARN ] 2026-06-01 10:57:07.895 [7669 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:57:17.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 10:57:17.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424694,ok=424694,error=0, records=41
[INFO ] 2026-06-01 10:57:20.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:57:22.900 [7648 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:57:32.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 10:57:32.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424695,ok=424695,error=0, records=41
[INFO ] 2026-06-01 10:57:35.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:57:37.906 [7701 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:57:41.709 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21246/300s
[INFO ] 2026-06-01 10:57:43.685 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21246/300s
[INFO ] 2026-06-01 10:57:47.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 10:57:47.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424696,ok=424696,error=0, records=41
[INFO ] 2026-06-01 10:57:50.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 10:57:51.217 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21246/300s
[WARN ] 2026-06-01 10:57:52.911 [7718 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:57:57.792 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17691/300s
[INFO ] 2026-06-01 10:57:57.794 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863340},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 10:57:57.957 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 10:57:57.957 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 10:57:57.957 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 10:57:57.957 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 10:57:57.957 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 10:57:57.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:57:57.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 10:57:57.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:57:57.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 10:57:57.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:57:57.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 10:58:02.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 10:58:02.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424697,ok=424697,error=0, records=41
[INFO ] 2026-06-01 10:58:05.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:58:07.916 [7736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:58:17.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 10:58:17.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424698,ok=424698,error=0, records=41
[INFO ] 2026-06-01 10:58:20.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:58:22.921 [7747 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:58:32.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 10:58:32.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424699,ok=424699,error=0, records=41
[INFO ] 2026-06-01 10:58:35.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:58:37.928 [7770 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:58:47.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 10:58:47.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424700,ok=424700,error=0, records=41
[INFO ] 2026-06-01 10:58:50.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:58:52.933 [7758 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:59:02.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 10:59:02.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424701,ok=424701,error=0, records=41
[INFO ] 2026-06-01 10:59:05.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:59:07.939 [7780 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:59:17.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 10:59:17.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424702,ok=424702,error=0, records=41
[INFO ] 2026-06-01 10:59:20.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:59:22.945 [7813 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:59:32.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 10:59:32.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424703,ok=424703,error=0, records=41
[INFO ] 2026-06-01 10:59:35.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:59:37.950 [7797 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 10:59:47.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 10:59:47.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424704,ok=424704,error=0, records=41
[INFO ] 2026-06-01 10:59:50.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 10:59:52.955 [7843 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:00:00.966 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21250/300s
[INFO ] 2026-06-01 11:00:01.458 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21241/300s
[INFO ] 2026-06-01 11:00:02.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 11:00:02.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424705,ok=424705,error=0, records=41
[INFO ] 2026-06-01 11:00:05.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:00:07.961 [7797 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:00:17.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 11:00:17.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424706,ok=424706,error=0, records=41
[INFO ] 2026-06-01 11:00:17.725 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21237/300s
[INFO ] 2026-06-01 11:00:20.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:00:22.965 [7780 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:00:32.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 11:00:32.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424707,ok=424707,error=0, records=41
[INFO ] 2026-06-01 11:00:35.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:00:37.969 [7780 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:00:41.324 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21250/300s
[INFO ] 2026-06-01 11:00:47.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 11:00:47.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424708,ok=424708,error=0, records=41
[INFO ] 2026-06-01 11:00:50.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:00:52.974 [7876 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:00:57.959 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863268},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:00:58.111 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:00:58.111 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 11:00:58.111 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:00:58.111 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:00:58.111 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:00:58.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:00:58.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:00:58.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:00:58.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:00:58.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:00:58.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:01:02.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 11:01:02.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424709,ok=424709,error=0, records=41
[INFO ] 2026-06-01 11:01:05.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:01:07.980 [7931 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:01:11.729 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21237/300s
[INFO ] 2026-06-01 11:01:17.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 11:01:17.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424710,ok=424710,error=0, records=41
[INFO ] 2026-06-01 11:01:20.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:01:22.986 [7931 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:01:32.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 11:01:32.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424711,ok=424711,error=0, records=41
[INFO ] 2026-06-01 11:01:35.450 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21246/300s
[INFO ] 2026-06-01 11:01:35.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:01:37.990 [7862 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:01:47.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 11:01:47.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424712,ok=424712,error=0, records=41
[INFO ] 2026-06-01 11:01:50.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:01:52.996 [7862 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:02:02.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 11:02:02.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424713,ok=424713,error=0, records=41
[INFO ] 2026-06-01 11:02:05.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:02:05.897 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21249/300s
[WARN ] 2026-06-01 11:02:08.001 [7843 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:02:17.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 11:02:17.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424714,ok=424714,error=0, records=41
[INFO ] 2026-06-01 11:02:20.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:02:23.009 [7986 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:02:32.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 11:02:32.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424715,ok=424715,error=0, records=41
[INFO ] 2026-06-01 11:02:35.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:02:38.013 [7986 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:02:41.761 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21247/300s
[INFO ] 2026-06-01 11:02:43.763 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21247/300s
[INFO ] 2026-06-01 11:02:47.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 11:02:47.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424716,ok=424716,error=0, records=41
[INFO ] 2026-06-01 11:02:50.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:02:51.269 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21247/300s
[WARN ] 2026-06-01 11:02:53.019 [8028 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:03:02.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 11:03:02.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424717,ok=424717,error=0, records=41
[INFO ] 2026-06-01 11:03:05.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:03:08.024 [7843 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:03:17.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 11:03:17.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424718,ok=424718,error=0, records=41
[INFO ] 2026-06-01 11:03:20.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:03:23.030 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:03:32.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 11:03:32.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424719,ok=424719,error=0, records=41
[INFO ] 2026-06-01 11:03:35.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 11:03:35.901 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 11:03:38.036 [8076 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:03:47.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11255, records=44
[INFO ] 2026-06-01 11:03:47.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424720,ok=424720,error=0, records=44
[INFO ] 2026-06-01 11:03:50.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:03:53.042 [7843 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:03:58.111 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17692/300s
[INFO ] 2026-06-01 11:03:58.113 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863204},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:03:58.277 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:03:58.277 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 11:03:58.277 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:03:58.277 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:03:58.277 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:03:58.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:03:58.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:03:58.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:03:58.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:03:58.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:03:58.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:04:02.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 11:04:02.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424721,ok=424721,error=0, records=41
[INFO ] 2026-06-01 11:04:05.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:04:08.048 [8102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:04:17.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 11:04:17.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424722,ok=424722,error=0, records=41
[INFO ] 2026-06-01 11:04:20.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:04:23.052 [8125 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:04:32.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 11:04:32.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424723,ok=424723,error=0, records=41
[INFO ] 2026-06-01 11:04:35.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:04:37.558 [8138 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:04:47.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 11:04:47.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424724,ok=424724,error=0, records=41
[INFO ] 2026-06-01 11:04:50.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:04:52.563 [8138 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:05:00.969 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21251/300s
[INFO ] 2026-06-01 11:05:01.565 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21242/300s
[INFO ] 2026-06-01 11:05:02.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 11:05:02.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424725,ok=424725,error=0, records=41
[INFO ] 2026-06-01 11:05:05.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:05:07.568 [8165 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:05:17.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 11:05:17.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424726,ok=424726,error=0, records=41
[INFO ] 2026-06-01 11:05:17.847 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21238/300s
[INFO ] 2026-06-01 11:05:20.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:05:22.573 [8186 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:05:32.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 11:05:32.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424727,ok=424727,error=0, records=41
[INFO ] 2026-06-01 11:05:35.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:05:37.579 [8209 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:05:41.331 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21251/300s
[INFO ] 2026-06-01 11:05:47.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 11:05:47.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424728,ok=424728,error=0, records=41
[INFO ] 2026-06-01 11:05:50.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:05:52.585 [8194 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:06:02.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 11:06:02.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424729,ok=424729,error=0, records=41
[INFO ] 2026-06-01 11:06:05.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:06:07.591 [8246 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:06:11.914 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21238/300s
[INFO ] 2026-06-01 11:06:17.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 11:06:17.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424730,ok=424730,error=0, records=41
[INFO ] 2026-06-01 11:06:20.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:06:22.598 [8194 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:06:32.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 11:06:32.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424731,ok=424731,error=0, records=41
[INFO ] 2026-06-01 11:06:35.511 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21247/300s
[INFO ] 2026-06-01 11:06:35.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:06:37.602 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:06:47.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 11:06:47.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424732,ok=424732,error=0, records=41
[INFO ] 2026-06-01 11:06:50.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:06:52.613 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:06:58.279 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863120},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:06:58.437 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:06:58.437 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:06:58.437 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:06:58.437 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:06:58.437 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:06:58.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:06:58.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:06:58.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:06:58.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:06:58.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:06:58.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:07:02.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 11:07:02.888 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424733,ok=424733,error=0, records=41
[INFO ] 2026-06-01 11:07:05.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:07:05.910 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21250/300s
[WARN ] 2026-06-01 11:07:07.618 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 11:07:17.622 [8240 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6989/stat), No such file or directory
[INFO ] 2026-06-01 11:07:17.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 11:07:17.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424734,ok=424734,error=0, records=41
[INFO ] 2026-06-01 11:07:20.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:07:22.624 [8194 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 11:07:32.628 [8240 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6989/stat), No such file or directory
[WARN ] 2026-06-01 11:07:32.628 [8240 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7100/stat), No such file or directory
[INFO ] 2026-06-01 11:07:32.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 11:07:32.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424735,ok=424735,error=0, records=41
[INFO ] 2026-06-01 11:07:35.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:07:37.629 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:07:41.827 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21248/300s
[INFO ] 2026-06-01 11:07:43.828 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21248/300s
[WARN ] 2026-06-01 11:07:47.633 [8239 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6989/stat), No such file or directory
[WARN ] 2026-06-01 11:07:47.634 [8239 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7100/stat), No such file or directory
[INFO ] 2026-06-01 11:07:47.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 11:07:47.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424736,ok=424736,error=0, records=41
[INFO ] 2026-06-01 11:07:50.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:07:51.333 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21248/300s
[WARN ] 2026-06-01 11:07:52.635 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:08:02.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 11:08:02.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424737,ok=424737,error=0, records=41
[INFO ] 2026-06-01 11:08:05.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:08:07.641 [8194 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:08:17.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 11:08:17.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424738,ok=424738,error=0, records=41
[INFO ] 2026-06-01 11:08:20.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:08:22.646 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:08:32.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 11:08:32.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424739,ok=424739,error=0, records=41
[INFO ] 2026-06-01 11:08:35.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:08:37.653 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:08:47.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 11:08:47.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424740,ok=424740,error=0, records=41
[INFO ] 2026-06-01 11:08:50.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:08:50.914 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 11:08:52.658 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:09:02.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 11:09:02.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424741,ok=424741,error=0, records=41
[INFO ] 2026-06-01 11:09:05.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:09:07.665 [8194 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:09:17.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 11:09:17.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424742,ok=424742,error=0, records=41
[INFO ] 2026-06-01 11:09:20.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:09:22.671 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:09:32.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 11:09:32.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424743,ok=424743,error=0, records=41
[INFO ] 2026-06-01 11:09:35.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:09:37.677 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:09:48.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-01 11:09:48.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424744,ok=424744,error=0, records=41
[INFO ] 2026-06-01 11:09:50.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:09:52.681 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:09:58.438 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17693/300s
[INFO ] 2026-06-01 11:09:58.439 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863036},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:09:58.603 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:09:58.603 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 11:09:58.603 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:09:58.603 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:09:58.603 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:09:58.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:09:58.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:09:58.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:09:58.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:09:58.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:09:58.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:10:00.972 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21252/300s
[INFO ] 2026-06-01 11:10:01.684 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21243/300s
[INFO ] 2026-06-01 11:10:03.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 11:10:03.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424745,ok=424745,error=0, records=41
[INFO ] 2026-06-01 11:10:05.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:10:07.686 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:10:18.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 11:10:18.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424746,ok=424746,error=0, records=41
[INFO ] 2026-06-01 11:10:18.014 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21239/300s
[INFO ] 2026-06-01 11:10:20.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:10:22.691 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:10:33.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10144, records=41
[INFO ] 2026-06-01 11:10:33.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424747,ok=424747,error=0, records=41
[INFO ] 2026-06-01 11:10:35.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:10:37.696 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:10:41.337 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21252/300s
[INFO ] 2026-06-01 11:10:48.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 11:10:48.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424748,ok=424748,error=0, records=41
[INFO ] 2026-06-01 11:10:50.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:10:52.703 [8239 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:11:03.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 11:11:03.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424749,ok=424749,error=0, records=41
[INFO ] 2026-06-01 11:11:05.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:11:07.708 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:11:12.093 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21239/300s
[INFO ] 2026-06-01 11:11:18.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 11:11:18.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424750,ok=424750,error=0, records=41
[INFO ] 2026-06-01 11:11:20.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:11:22.714 [8194 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:11:33.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 11:11:33.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424751,ok=424751,error=0, records=41
[INFO ] 2026-06-01 11:11:35.563 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21248/300s
[INFO ] 2026-06-01 11:11:35.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:11:37.718 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:11:48.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 11:11:48.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424752,ok=424752,error=0, records=41
[INFO ] 2026-06-01 11:11:50.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:11:52.724 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:12:03.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 11:12:03.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424753,ok=424753,error=0, records=41
[INFO ] 2026-06-01 11:12:05.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:12:05.924 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21251/300s
[WARN ] 2026-06-01 11:12:07.729 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:12:18.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 11:12:18.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424754,ok=424754,error=0, records=41
[INFO ] 2026-06-01 11:12:20.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:12:22.734 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:12:33.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 11:12:33.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424755,ok=424755,error=0, records=41
[INFO ] 2026-06-01 11:12:35.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:12:37.740 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:12:41.890 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21249/300s
[INFO ] 2026-06-01 11:12:43.892 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21249/300s
[INFO ] 2026-06-01 11:12:48.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 11:12:48.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424756,ok=424756,error=0, records=41
[INFO ] 2026-06-01 11:12:50.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:12:51.396 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21249/300s
[WARN ] 2026-06-01 11:12:52.745 [8239 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:12:58.605 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862956},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:12:58.767 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:12:58.768 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 11:13:03.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 11:13:03.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424757,ok=424757,error=0, records=41
[INFO ] 2026-06-01 11:13:05.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:13:07.750 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:13:18.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 11:13:18.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424758,ok=424758,error=0, records=41
[INFO ] 2026-06-01 11:13:20.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:13:22.755 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:13:33.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 11:13:33.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424759,ok=424759,error=0, records=41
[INFO ] 2026-06-01 11:13:35.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 11:13:35.928 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 11:13:37.760 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:13:48.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 11:13:48.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424760,ok=424760,error=0, records=41
[INFO ] 2026-06-01 11:13:50.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:13:52.764 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:14:03.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 11:14:03.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424761,ok=424761,error=0, records=41
[INFO ] 2026-06-01 11:14:05.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:14:07.768 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:14:18.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 11:14:18.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424762,ok=424762,error=0, records=41
[INFO ] 2026-06-01 11:14:20.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:14:22.773 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:14:33.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 11:14:33.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424763,ok=424763,error=0, records=41
[INFO ] 2026-06-01 11:14:35.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:14:37.779 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:14:48.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 11:14:48.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424764,ok=424764,error=0, records=41
[INFO ] 2026-06-01 11:14:50.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:14:52.784 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:15:00.975 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21253/300s
[INFO ] 2026-06-01 11:15:01.787 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21244/300s
[INFO ] 2026-06-01 11:15:03.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 11:15:03.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424765,ok=424765,error=0, records=41
[INFO ] 2026-06-01 11:15:05.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:15:07.790 [8261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:15:18.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 11:15:18.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424766,ok=424766,error=0, records=41
[INFO ] 2026-06-01 11:15:18.130 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21240/300s
[INFO ] 2026-06-01 11:15:20.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:15:22.796 [8239 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:15:33.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 11:15:33.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424767,ok=424767,error=0, records=41
[INFO ] 2026-06-01 11:15:35.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:15:37.801 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:15:41.343 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21253/300s
[INFO ] 2026-06-01 11:15:48.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 11:15:48.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424768,ok=424768,error=0, records=41
[INFO ] 2026-06-01 11:15:50.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:15:52.806 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:15:58.768 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17694/300s
[INFO ] 2026-06-01 11:15:58.770 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862876},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:15:58.930 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:15:58.930 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 11:15:58.931 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:15:58.931 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:15:58.931 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:15:58.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:15:58.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:15:58.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:15:58.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:15:58.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:15:58.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:16:03.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 11:16:03.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424769,ok=424769,error=0, records=41
[INFO ] 2026-06-01 11:16:05.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:16:07.812 [8255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:16:12.278 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21240/300s
[INFO ] 2026-06-01 11:16:18.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 11:16:18.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424770,ok=424770,error=0, records=41
[INFO ] 2026-06-01 11:16:20.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:16:22.818 [8841 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:16:33.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 11:16:33.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424771,ok=424771,error=0, records=41
[INFO ] 2026-06-01 11:16:35.620 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21249/300s
[INFO ] 2026-06-01 11:16:35.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:16:37.823 [8855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:16:48.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 11:16:48.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424772,ok=424772,error=0, records=41
[INFO ] 2026-06-01 11:16:50.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:16:52.828 [8841 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:17:03.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 11:17:03.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424773,ok=424773,error=0, records=41
[INFO ] 2026-06-01 11:17:05.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:17:05.937 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21252/300s
[WARN ] 2026-06-01 11:17:07.833 [8869 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:17:18.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 11:17:18.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424774,ok=424774,error=0, records=41
[INFO ] 2026-06-01 11:17:20.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:17:22.837 [8910 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:17:33.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 11:17:33.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424775,ok=424775,error=0, records=41
[INFO ] 2026-06-01 11:17:35.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:17:37.841 [8841 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:17:41.952 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21250/300s
[INFO ] 2026-06-01 11:17:43.954 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21250/300s
[INFO ] 2026-06-01 11:17:48.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 11:17:48.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424776,ok=424776,error=0, records=41
[INFO ] 2026-06-01 11:17:50.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:17:51.461 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21250/300s
[WARN ] 2026-06-01 11:17:52.846 [8883 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:18:03.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 11:18:03.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424777,ok=424777,error=0, records=41
[INFO ] 2026-06-01 11:18:05.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:18:07.852 [8947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:18:18.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 11:18:18.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424778,ok=424778,error=0, records=41
[INFO ] 2026-06-01 11:18:20.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:18:22.858 [8869 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:18:33.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 11:18:33.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424779,ok=424779,error=0, records=41
[INFO ] 2026-06-01 11:18:35.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:18:37.863 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:18:48.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 11:18:48.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424780,ok=424780,error=0, records=41
[INFO ] 2026-06-01 11:18:50.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:18:52.867 [8240 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:18:58.932 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862800},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:18:59.086 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:18:59.086 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 11:18:59.086 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:18:59.086 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:18:59.086 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:18:59.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:18:59.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:18:59.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:18:59.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:18:59.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:18:59.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:19:03.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 11:19:03.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424781,ok=424781,error=0, records=41
[INFO ] 2026-06-01 11:19:05.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:19:07.872 [8988 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:19:18.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 11:19:18.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424782,ok=424782,error=0, records=41
[INFO ] 2026-06-01 11:19:20.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:19:22.878 [9003 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:19:33.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 11:19:33.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424783,ok=424783,error=0, records=41
[INFO ] 2026-06-01 11:19:35.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:19:37.883 [9036 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:19:48.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 11:19:48.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424784,ok=424784,error=0, records=41
[INFO ] 2026-06-01 11:19:50.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:19:52.889 [9025 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:20:00.979 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21254/300s
[INFO ] 2026-06-01 11:20:01.892 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21245/300s
[INFO ] 2026-06-01 11:20:03.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 11:20:03.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424785,ok=424785,error=0, records=41
[INFO ] 2026-06-01 11:20:05.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:20:07.896 [9063 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:20:18.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10143, records=41
[INFO ] 2026-06-01 11:20:18.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424786,ok=424786,error=0, records=41
[INFO ] 2026-06-01 11:20:18.320 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21241/300s
[INFO ] 2026-06-01 11:20:20.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:20:22.900 [8961 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:20:33.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 11:20:33.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424787,ok=424787,error=0, records=41
[INFO ] 2026-06-01 11:20:35.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:20:37.905 [9095 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:20:41.350 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21254/300s
[INFO ] 2026-06-01 11:20:48.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 11:20:48.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424788,ok=424788,error=0, records=41
[INFO ] 2026-06-01 11:20:50.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:20:52.911 [9105 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:21:03.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 11:21:03.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424789,ok=424789,error=0, records=41
[INFO ] 2026-06-01 11:21:05.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:21:07.916 [9116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:21:12.458 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21241/300s
[INFO ] 2026-06-01 11:21:18.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 11:21:18.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424790,ok=424790,error=0, records=41
[INFO ] 2026-06-01 11:21:20.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:21:22.922 [9161 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:21:33.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 11:21:33.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424791,ok=424791,error=0, records=41
[INFO ] 2026-06-01 11:21:35.677 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21250/300s
[INFO ] 2026-06-01 11:21:35.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:21:37.928 [9170 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:21:48.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 11:21:48.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424792,ok=424792,error=0, records=41
[INFO ] 2026-06-01 11:21:50.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:21:52.933 [9193 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:21:59.086 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17695/300s
[INFO ] 2026-06-01 11:21:59.088 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:21:59.270 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:21:59.270 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:21:59.270 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:21:59.270 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:21:59.270 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:21:59.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:21:59.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:21:59.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:21:59.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:21:59.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:21:59.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:22:03.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 11:22:03.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424793,ok=424793,error=0, records=41
[INFO ] 2026-06-01 11:22:05.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:22:05.949 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21253/300s
[WARN ] 2026-06-01 11:22:07.939 [9181 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:22:18.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 11:22:18.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424794,ok=424794,error=0, records=41
[INFO ] 2026-06-01 11:22:20.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:22:22.944 [9210 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:22:33.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 11:22:33.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424795,ok=424795,error=0, records=41
[INFO ] 2026-06-01 11:22:35.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:22:37.949 [9237 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:22:42.016 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21251/300s
[INFO ] 2026-06-01 11:22:44.018 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21251/300s
[INFO ] 2026-06-01 11:22:48.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 11:22:48.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424796,ok=424796,error=0, records=41
[INFO ] 2026-06-01 11:22:50.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:22:51.525 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21251/300s
[WARN ] 2026-06-01 11:22:52.955 [9252 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:23:03.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 11:23:03.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424797,ok=424797,error=0, records=41
[INFO ] 2026-06-01 11:23:05.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:23:07.959 [9187 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:23:18.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 11:23:18.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424798,ok=424798,error=0, records=41
[INFO ] 2026-06-01 11:23:20.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:23:22.964 [9237 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:23:33.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 11:23:33.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424799,ok=424799,error=0, records=41
[INFO ] 2026-06-01 11:23:35.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 11:23:35.953 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 11:23:37.969 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:23:48.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 11:23:48.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424800,ok=424800,error=0, records=41
[INFO ] 2026-06-01 11:23:50.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:23:50.954 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 11:23:52.975 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:24:03.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 11:24:03.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424801,ok=424801,error=0, records=41
[INFO ] 2026-06-01 11:24:05.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:24:07.981 [9266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:24:18.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 11:24:18.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424802,ok=424802,error=0, records=41
[INFO ] 2026-06-01 11:24:20.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:24:22.987 [9294 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:24:33.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 11:24:33.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424803,ok=424803,error=0, records=41
[INFO ] 2026-06-01 11:24:35.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:24:37.992 [9252 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:24:48.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 11:24:48.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424804,ok=424804,error=0, records=41
[INFO ] 2026-06-01 11:24:50.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:24:52.998 [9308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:24:59.271 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862640},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:24:59.444 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:24:59.444 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:24:59.444 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:24:59.444 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:24:59.444 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:24:59.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:24:59.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:24:59.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:24:59.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:24:59.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:24:59.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:25:00.982 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21255/300s
[INFO ] 2026-06-01 11:25:02.000 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21246/300s
[INFO ] 2026-06-01 11:25:03.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 11:25:03.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424805,ok=424805,error=0, records=41
[INFO ] 2026-06-01 11:25:05.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:25:08.003 [9308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:25:18.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 11:25:18.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424806,ok=424806,error=0, records=41
[INFO ] 2026-06-01 11:25:18.442 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21242/300s
[INFO ] 2026-06-01 11:25:20.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:25:23.008 [9391 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:25:33.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 11:25:33.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424807,ok=424807,error=0, records=41
[INFO ] 2026-06-01 11:25:35.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:25:38.013 [9391 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:25:41.357 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21255/300s
[INFO ] 2026-06-01 11:25:48.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 11:25:48.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424808,ok=424808,error=0, records=41
[INFO ] 2026-06-01 11:25:50.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:25:53.018 [9266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:26:03.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 11:26:03.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424809,ok=424809,error=0, records=41
[INFO ] 2026-06-01 11:26:05.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:26:08.023 [9433 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:26:12.642 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21242/300s
[INFO ] 2026-06-01 11:26:18.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 11:26:18.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424810,ok=424810,error=0, records=41
[INFO ] 2026-06-01 11:26:20.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:26:23.028 [9335 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:26:33.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 11:26:33.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424811,ok=424811,error=0, records=41
[INFO ] 2026-06-01 11:26:35.738 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21251/300s
[INFO ] 2026-06-01 11:26:35.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:26:38.033 [9391 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:26:48.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 11:26:48.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424812,ok=424812,error=0, records=41
[INFO ] 2026-06-01 11:26:50.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:26:53.037 [9482 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:27:03.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 11:27:03.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424813,ok=424813,error=0, records=41
[INFO ] 2026-06-01 11:27:05.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:27:05.963 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21254/300s
[WARN ] 2026-06-01 11:27:08.043 [9487 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:27:18.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 11:27:18.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424814,ok=424814,error=0, records=41
[INFO ] 2026-06-01 11:27:20.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:27:23.049 [9499 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:27:33.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 11:27:33.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424815,ok=424815,error=0, records=41
[INFO ] 2026-06-01 11:27:35.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:27:38.053 [9533 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:27:42.085 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21252/300s
[INFO ] 2026-06-01 11:27:44.086 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21252/300s
[INFO ] 2026-06-01 11:27:48.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 11:27:48.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424816,ok=424816,error=0, records=41
[INFO ] 2026-06-01 11:27:50.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:27:51.593 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21252/300s
[WARN ] 2026-06-01 11:27:52.559 [9550 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:27:59.444 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17696/300s
[INFO ] 2026-06-01 11:27:59.446 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862560},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:27:59.586 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:27:59.586 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:27:59.586 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:27:59.586 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:27:59.586 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:27:59.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:27:59.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:27:59.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:27:59.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:27:59.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:27:59.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:28:03.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 11:28:03.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424817,ok=424817,error=0, records=41
[INFO ] 2026-06-01 11:28:05.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:28:07.564 [9569 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:28:18.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 11:28:18.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424818,ok=424818,error=0, records=41
[INFO ] 2026-06-01 11:28:20.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:28:22.569 [9538 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:28:33.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 11:28:33.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424819,ok=424819,error=0, records=41
[INFO ] 2026-06-01 11:28:35.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:28:37.575 [9596 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:28:48.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 11:28:48.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424820,ok=424820,error=0, records=41
[INFO ] 2026-06-01 11:28:50.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:28:52.582 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:29:03.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 11:29:03.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424821,ok=424821,error=0, records=41
[INFO ] 2026-06-01 11:29:05.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:29:07.588 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:29:18.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 11:29:18.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424822,ok=424822,error=0, records=41
[INFO ] 2026-06-01 11:29:20.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:29:22.592 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:29:33.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 11:29:33.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424823,ok=424823,error=0, records=41
[INFO ] 2026-06-01 11:29:35.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:29:37.597 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:29:48.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 11:29:48.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424824,ok=424824,error=0, records=41
[INFO ] 2026-06-01 11:29:50.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:29:52.602 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:30:00.985 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21256/300s
[INFO ] 2026-06-01 11:30:02.105 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21247/300s
[INFO ] 2026-06-01 11:30:03.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 11:30:03.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424825,ok=424825,error=0, records=41
[INFO ] 2026-06-01 11:30:05.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:30:07.608 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:30:18.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 11:30:18.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424826,ok=424826,error=0, records=41
[INFO ] 2026-06-01 11:30:18.714 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21243/300s
[INFO ] 2026-06-01 11:30:20.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:30:22.612 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:30:33.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 11:30:33.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424827,ok=424827,error=0, records=41
[INFO ] 2026-06-01 11:30:35.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:30:37.617 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:30:41.364 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21256/300s
[INFO ] 2026-06-01 11:30:48.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 11:30:48.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424828,ok=424828,error=0, records=41
[INFO ] 2026-06-01 11:30:50.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:30:52.623 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:30:59.588 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862480},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:30:59.756 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:30:59.756 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:30:59.756 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:30:59.756 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:30:59.756 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:30:59.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:30:59.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:30:59.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:30:59.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:30:59.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:30:59.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:31:03.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 11:31:03.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424829,ok=424829,error=0, records=41
[INFO ] 2026-06-01 11:31:05.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:31:07.629 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:31:12.827 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21243/300s
[INFO ] 2026-06-01 11:31:18.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 11:31:18.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424830,ok=424830,error=0, records=41
[INFO ] 2026-06-01 11:31:20.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:31:22.634 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:31:33.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 11:31:33.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424831,ok=424831,error=0, records=41
[INFO ] 2026-06-01 11:31:35.793 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21252/300s
[INFO ] 2026-06-01 11:31:35.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:31:37.639 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:31:48.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 11:31:48.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424832,ok=424832,error=0, records=41
[INFO ] 2026-06-01 11:31:50.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:31:52.644 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:32:03.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 11:32:03.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424833,ok=424833,error=0, records=41
[INFO ] 2026-06-01 11:32:05.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:32:05.975 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21255/300s
[WARN ] 2026-06-01 11:32:07.650 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:32:18.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 11:32:18.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424834,ok=424834,error=0, records=41
[INFO ] 2026-06-01 11:32:20.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:32:22.655 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:32:33.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 11:32:33.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424835,ok=424835,error=0, records=41
[INFO ] 2026-06-01 11:32:35.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:32:37.660 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:32:42.150 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21253/300s
[INFO ] 2026-06-01 11:32:44.152 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21253/300s
[INFO ] 2026-06-01 11:32:48.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 11:32:48.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424836,ok=424836,error=0, records=41
[INFO ] 2026-06-01 11:32:50.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:32:51.659 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21253/300s
[WARN ] 2026-06-01 11:32:52.665 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:33:03.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 11:33:03.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424837,ok=424837,error=0, records=41
[INFO ] 2026-06-01 11:33:05.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:33:07.670 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:33:18.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 11:33:18.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424838,ok=424838,error=0, records=41
[INFO ] 2026-06-01 11:33:20.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:33:22.675 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:33:34.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 11:33:34.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424839,ok=424839,error=0, records=41
[INFO ] 2026-06-01 11:33:35.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 11:33:35.979 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 11:33:37.679 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:33:49.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10720, records=44
[INFO ] 2026-06-01 11:33:49.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424840,ok=424840,error=0, records=44
[INFO ] 2026-06-01 11:33:50.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:33:52.685 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:33:59.756 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17697/300s
[INFO ] 2026-06-01 11:33:59.758 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862404},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:33:59.927 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:33:59.927 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 11:33:59.928 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:33:59.928 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:33:59.928 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:33:59.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:33:59.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:33:59.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:33:59.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:33:59.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:33:59.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:34:04.863 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-01 11:34:04.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424841,ok=424841,error=0, records=41
[INFO ] 2026-06-01 11:34:05.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:34:07.689 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:34:19.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 11:34:19.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424842,ok=424842,error=0, records=41
[INFO ] 2026-06-01 11:34:20.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:34:22.695 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:34:34.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 11:34:34.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424843,ok=424843,error=0, records=41
[INFO ] 2026-06-01 11:34:35.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:34:37.701 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:34:49.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 11:34:49.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424844,ok=424844,error=0, records=41
[INFO ] 2026-06-01 11:34:50.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:34:52.706 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:35:00.989 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21257/300s
[INFO ] 2026-06-01 11:35:02.210 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21248/300s
[INFO ] 2026-06-01 11:35:04.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 11:35:04.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424845,ok=424845,error=0, records=41
[INFO ] 2026-06-01 11:35:05.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:35:07.712 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:35:19.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 11:35:19.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424846,ok=424846,error=0, records=41
[INFO ] 2026-06-01 11:35:19.891 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21244/300s
[INFO ] 2026-06-01 11:35:20.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:35:22.717 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:35:34.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 11:35:34.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424847,ok=424847,error=0, records=41
[INFO ] 2026-06-01 11:35:35.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:35:37.722 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:35:41.370 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21257/300s
[INFO ] 2026-06-01 11:35:50.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 11:35:50.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424848,ok=424848,error=0, records=41
[INFO ] 2026-06-01 11:35:50.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:35:52.728 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:36:05.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 11:36:05.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424849,ok=424849,error=0, records=41
[INFO ] 2026-06-01 11:36:05.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:36:07.732 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:36:13.013 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21244/300s
[INFO ] 2026-06-01 11:36:20.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 11:36:20.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424850,ok=424850,error=0, records=41
[INFO ] 2026-06-01 11:36:20.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:36:22.737 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:36:35.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 11:36:35.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424851,ok=424851,error=0, records=41
[INFO ] 2026-06-01 11:36:35.852 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21253/300s
[INFO ] 2026-06-01 11:36:35.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:36:37.743 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:36:50.026 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 11:36:50.026 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424852,ok=424852,error=0, records=41
[INFO ] 2026-06-01 11:36:50.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:36:52.748 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:36:59.929 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:37:00.107 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:37:00.107 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 11:37:00.107 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:37:00.107 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:37:00.107 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:37:00.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:37:00.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:37:00.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:37:00.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:37:00.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:37:00.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:37:05.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 11:37:05.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424853,ok=424853,error=0, records=41
[INFO ] 2026-06-01 11:37:05.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:37:05.988 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21256/300s
[WARN ] 2026-06-01 11:37:07.753 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:37:20.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 11:37:20.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424854,ok=424854,error=0, records=41
[INFO ] 2026-06-01 11:37:20.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:37:22.759 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:37:35.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 11:37:35.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424855,ok=424855,error=0, records=41
[INFO ] 2026-06-01 11:37:35.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:37:37.764 [9579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:37:42.215 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21254/300s
[INFO ] 2026-06-01 11:37:44.217 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21254/300s
[INFO ] 2026-06-01 11:37:50.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 11:37:50.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424856,ok=424856,error=0, records=41
[INFO ] 2026-06-01 11:37:50.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:37:51.724 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21254/300s
[WARN ] 2026-06-01 11:37:52.768 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:38:05.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 11:38:05.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424857,ok=424857,error=0, records=41
[INFO ] 2026-06-01 11:38:05.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:38:07.774 [9642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:38:20.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 11:38:20.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424858,ok=424858,error=0, records=41
[INFO ] 2026-06-01 11:38:20.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:38:22.780 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:38:35.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 11:38:35.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424859,ok=424859,error=0, records=41
[INFO ] 2026-06-01 11:38:35.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:38:37.785 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:38:50.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 11:38:50.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424860,ok=424860,error=0, records=41
[INFO ] 2026-06-01 11:38:50.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:38:50.992 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 11:38:52.790 [9620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:39:05.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 11:39:05.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424861,ok=424861,error=0, records=41
[INFO ] 2026-06-01 11:39:05.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:39:07.794 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:39:20.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 11:39:20.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424862,ok=424862,error=0, records=41
[INFO ] 2026-06-01 11:39:20.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:39:22.800 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:39:35.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11088, records=46
[INFO ] 2026-06-01 11:39:35.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424863,ok=424863,error=0, records=46
[INFO ] 2026-06-01 11:39:35.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:39:37.806 [10198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:39:50.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 11:39:50.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424864,ok=424864,error=0, records=41
[INFO ] 2026-06-01 11:39:50.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:39:52.811 [10213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:40:00.108 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17698/300s
[INFO ] 2026-06-01 11:40:00.109 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862248},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:40:00.263 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:40:00.263 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:40:00.263 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:40:00.263 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:40:00.263 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:40:00.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:40:00.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:40:00.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:40:00.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:40:00.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:40:00.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:40:00.992 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21258/300s
[INFO ] 2026-06-01 11:40:02.314 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21249/300s
[INFO ] 2026-06-01 11:40:05.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 11:40:05.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424865,ok=424865,error=0, records=41
[INFO ] 2026-06-01 11:40:05.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:40:07.818 [10207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:40:20.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 11:40:20.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424866,ok=424866,error=0, records=41
[INFO ] 2026-06-01 11:40:20.222 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21245/300s
[INFO ] 2026-06-01 11:40:20.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:40:22.824 [10207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:40:35.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 11:40:35.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424867,ok=424867,error=0, records=41
[INFO ] 2026-06-01 11:40:35.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:40:37.830 [10207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:40:41.377 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21258/300s
[INFO ] 2026-06-01 11:40:50.236 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 11:40:50.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424868,ok=424868,error=0, records=41
[INFO ] 2026-06-01 11:40:50.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:40:52.834 [10207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:41:05.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 11:41:05.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424869,ok=424869,error=0, records=41
[INFO ] 2026-06-01 11:41:05.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:41:07.840 [10262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:41:13.193 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21245/300s
[INFO ] 2026-06-01 11:41:20.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 11:41:20.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424870,ok=424870,error=0, records=41
[INFO ] 2026-06-01 11:41:20.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:41:22.845 [10234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:41:35.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 11:41:35.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424871,ok=424871,error=0, records=41
[INFO ] 2026-06-01 11:41:35.905 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21254/300s
[INFO ] 2026-06-01 11:41:35.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:41:37.850 [10248] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:41:50.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 11:41:50.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424872,ok=424872,error=0, records=41
[INFO ] 2026-06-01 11:41:51.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:41:52.855 [10326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:42:05.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 11:42:05.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424873,ok=424873,error=0, records=41
[INFO ] 2026-06-01 11:42:06.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:42:06.001 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21257/300s
[WARN ] 2026-06-01 11:42:07.860 [10234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:42:20.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 11:42:20.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424874,ok=424874,error=0, records=41
[INFO ] 2026-06-01 11:42:21.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:42:22.864 [10354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:42:35.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 11:42:35.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424875,ok=424875,error=0, records=41
[INFO ] 2026-06-01 11:42:36.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:42:37.869 [10326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:42:42.280 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21255/300s
[INFO ] 2026-06-01 11:42:44.281 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21255/300s
[INFO ] 2026-06-01 11:42:50.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 11:42:50.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424876,ok=424876,error=0, records=41
[INFO ] 2026-06-01 11:42:51.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:42:51.788 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21255/300s
[WARN ] 2026-06-01 11:42:52.874 [10234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:43:00.265 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:43:00.421 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:43:00.421 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 11:43:00.421 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:43:00.421 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:43:00.421 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:43:00.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:43:00.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:43:00.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:43:00.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:43:00.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:43:00.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:43:05.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 11:43:05.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424877,ok=424877,error=0, records=41
[INFO ] 2026-06-01 11:43:06.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:43:07.880 [10400] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:43:20.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 11:43:20.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424878,ok=424878,error=0, records=41
[INFO ] 2026-06-01 11:43:21.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:43:22.885 [10400] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:43:35.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 11:43:35.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424879,ok=424879,error=0, records=41
[INFO ] 2026-06-01 11:43:36.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 11:43:36.004 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 11:43:37.891 [10423] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:43:50.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 11:43:50.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424880,ok=424880,error=0, records=41
[INFO ] 2026-06-01 11:43:51.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:43:52.897 [10417] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:44:05.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 11:44:05.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424881,ok=424881,error=0, records=41
[INFO ] 2026-06-01 11:44:06.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:44:07.902 [10460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:44:20.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 11:44:20.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424882,ok=424882,error=0, records=41
[INFO ] 2026-06-01 11:44:21.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:44:22.907 [10432] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:44:35.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 11:44:35.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424883,ok=424883,error=0, records=41
[INFO ] 2026-06-01 11:44:36.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:44:37.912 [10460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:44:50.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 11:44:50.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424884,ok=424884,error=0, records=41
[INFO ] 2026-06-01 11:44:51.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:44:52.917 [10516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:45:00.995 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21259/300s
[INFO ] 2026-06-01 11:45:02.420 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21250/300s
[INFO ] 2026-06-01 11:45:05.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 11:45:05.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424885,ok=424885,error=0, records=41
[INFO ] 2026-06-01 11:45:06.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:45:07.922 [10533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:45:20.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 11:45:20.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424886,ok=424886,error=0, records=41
[INFO ] 2026-06-01 11:45:20.363 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21246/300s
[INFO ] 2026-06-01 11:45:21.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:45:22.928 [10533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:45:35.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 11:45:35.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424887,ok=424887,error=0, records=41
[INFO ] 2026-06-01 11:45:36.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:45:37.934 [10573] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:45:41.383 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21259/300s
[INFO ] 2026-06-01 11:45:50.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 11:45:50.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424888,ok=424888,error=0, records=41
[INFO ] 2026-06-01 11:45:51.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:45:52.938 [10583] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:46:00.421 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17699/300s
[INFO ] 2026-06-01 11:46:00.423 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:46:00.591 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:46:00.591 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 11:46:00.591 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:46:00.591 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:46:00.591 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:46:00.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:46:00.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:46:00.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:46:00.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:46:00.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:46:00.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:46:05.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 11:46:05.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424889,ok=424889,error=0, records=41
[INFO ] 2026-06-01 11:46:06.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:46:07.946 [10606] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:46:13.376 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21246/300s
[INFO ] 2026-06-01 11:46:20.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12160, records=49
[INFO ] 2026-06-01 11:46:20.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424890,ok=424890,error=0, records=49
[INFO ] 2026-06-01 11:46:21.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:46:22.952 [10578] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:46:35.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 11:46:35.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424891,ok=424891,error=0, records=41
[INFO ] 2026-06-01 11:46:35.958 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21255/300s
[INFO ] 2026-06-01 11:46:36.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:46:37.957 [10630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:46:50.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 11:46:50.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424892,ok=424892,error=0, records=41
[INFO ] 2026-06-01 11:46:51.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:46:52.964 [10584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:47:05.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 11:47:05.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424893,ok=424893,error=0, records=41
[INFO ] 2026-06-01 11:47:06.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:47:06.013 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21258/300s
[WARN ] 2026-06-01 11:47:07.968 [10630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:47:20.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 11:47:20.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424894,ok=424894,error=0, records=41
[INFO ] 2026-06-01 11:47:21.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:47:22.973 [10630] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:47:35.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 11:47:35.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424895,ok=424895,error=0, records=41
[INFO ] 2026-06-01 11:47:36.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:47:37.978 [10584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:47:42.336 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21256/300s
[INFO ] 2026-06-01 11:47:44.338 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21256/300s
[INFO ] 2026-06-01 11:47:50.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-01 11:47:50.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424896,ok=424896,error=0, records=41
[INFO ] 2026-06-01 11:47:51.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:47:51.845 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21256/300s
[WARN ] 2026-06-01 11:47:52.983 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:48:05.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 11:48:05.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424897,ok=424897,error=0, records=41
[INFO ] 2026-06-01 11:48:06.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:48:07.988 [10584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:48:20.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 11:48:20.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424898,ok=424898,error=0, records=41
[INFO ] 2026-06-01 11:48:21.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:48:22.993 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:48:35.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 11:48:35.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424899,ok=424899,error=0, records=41
[INFO ] 2026-06-01 11:48:36.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:48:37.997 [10741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:48:50.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 11:48:50.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424900,ok=424900,error=0, records=41
[INFO ] 2026-06-01 11:48:51.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:48:53.002 [10713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:49:00.593 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:49:00.768 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:49:00.768 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 11:49:00.768 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:49:00.768 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:49:00.768 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:49:00.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:49:00.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:49:00.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:49:00.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:49:00.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:49:00.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:49:05.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 11:49:05.519 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424901,ok=424901,error=0, records=41
[INFO ] 2026-06-01 11:49:06.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:49:08.007 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:49:20.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 11:49:20.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424902,ok=424902,error=0, records=41
[INFO ] 2026-06-01 11:49:21.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:49:23.011 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:49:35.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 11:49:35.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424903,ok=424903,error=0, records=41
[INFO ] 2026-06-01 11:49:36.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:49:38.016 [10755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:49:50.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 11:49:50.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424904,ok=424904,error=0, records=41
[INFO ] 2026-06-01 11:49:51.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:49:53.021 [10755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:50:00.999 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21260/300s
[INFO ] 2026-06-01 11:50:02.525 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21251/300s
[INFO ] 2026-06-01 11:50:05.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 11:50:05.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424905,ok=424905,error=0, records=41
[INFO ] 2026-06-01 11:50:06.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:50:08.027 [10811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:50:20.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 11:50:20.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424906,ok=424906,error=0, records=41
[INFO ] 2026-06-01 11:50:20.546 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21247/300s
[INFO ] 2026-06-01 11:50:21.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:50:23.032 [10727] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:50:35.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 11:50:35.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424907,ok=424907,error=0, records=41
[INFO ] 2026-06-01 11:50:36.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:50:38.037 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:50:41.390 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21260/300s
[INFO ] 2026-06-01 11:50:50.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 11:50:50.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424908,ok=424908,error=0, records=41
[INFO ] 2026-06-01 11:50:51.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:50:53.043 [10873] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:51:05.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 11:51:05.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424909,ok=424909,error=0, records=41
[INFO ] 2026-06-01 11:51:06.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:51:08.049 [10891] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:51:13.558 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21247/300s
[INFO ] 2026-06-01 11:51:20.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 11:51:20.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424910,ok=424910,error=0, records=41
[INFO ] 2026-06-01 11:51:21.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:51:22.555 [10906] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:51:35.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 11:51:35.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424911,ok=424911,error=0, records=41
[INFO ] 2026-06-01 11:51:36.015 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21256/300s
[INFO ] 2026-06-01 11:51:36.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:51:37.560 [10890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:51:50.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 11:51:50.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424912,ok=424912,error=0, records=41
[INFO ] 2026-06-01 11:51:51.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:51:52.565 [10942] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:52:00.768 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17700/300s
[INFO ] 2026-06-01 11:52:00.770 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:52:00.956 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:52:00.956 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:52:00.956 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:52:00.956 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:52:00.956 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:52:00.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:52:00.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:52:00.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:52:00.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:52:00.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:52:00.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:52:05.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 11:52:05.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424913,ok=424913,error=0, records=41
[INFO ] 2026-06-01 11:52:06.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:52:06.026 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21259/300s
[WARN ] 2026-06-01 11:52:07.570 [10958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:52:20.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 11:52:20.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424914,ok=424914,error=0, records=41
[INFO ] 2026-06-01 11:52:21.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:52:22.576 [10942] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:52:35.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 11:52:35.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424915,ok=424915,error=0, records=41
[INFO ] 2026-06-01 11:52:36.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:52:37.581 [10890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:52:42.412 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21257/300s
[INFO ] 2026-06-01 11:52:44.414 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21257/300s
[INFO ] 2026-06-01 11:52:50.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 11:52:50.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424916,ok=424916,error=0, records=41
[INFO ] 2026-06-01 11:52:51.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:52:51.891 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21257/300s
[WARN ] 2026-06-01 11:52:52.586 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:53:05.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 11:53:05.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424917,ok=424917,error=0, records=41
[INFO ] 2026-06-01 11:53:06.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:53:07.592 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:53:20.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 11:53:20.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424918,ok=424918,error=0, records=41
[INFO ] 2026-06-01 11:53:21.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:53:22.598 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:53:35.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 11:53:35.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424919,ok=424919,error=0, records=41
[INFO ] 2026-06-01 11:53:36.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 11:53:36.030 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 11:53:37.604 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:53:50.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 11:53:50.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424920,ok=424920,error=0, records=41
[INFO ] 2026-06-01 11:53:51.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:53:51.031 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 11:53:52.611 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:54:05.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 11:54:05.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424921,ok=424921,error=0, records=41
[INFO ] 2026-06-01 11:54:06.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:54:07.616 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:54:20.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-01 11:54:20.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424922,ok=424922,error=0, records=41
[INFO ] 2026-06-01 11:54:21.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:54:22.621 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:54:35.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10401, records=41
[INFO ] 2026-06-01 11:54:35.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424923,ok=424923,error=0, records=41
[INFO ] 2026-06-01 11:54:36.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:54:37.626 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 11:54:47.630 [10999] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7091/stat), No such file or directory
[INFO ] 2026-06-01 11:54:50.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 11:54:50.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424924,ok=424924,error=0, records=41
[INFO ] 2026-06-01 11:54:51.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:54:52.631 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:55:00.958 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861844},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:55:01.003 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21261/300s
[INFO ] 2026-06-01 11:55:01.130 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:55:01.130 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 11:55:01.130 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:55:01.130 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:55:01.130 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:55:01.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:55:01.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:55:01.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:55:01.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:55:01.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:55:01.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:55:02.634 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21252/300s
[INFO ] 2026-06-01 11:55:05.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 11:55:05.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424925,ok=424925,error=0, records=41
[INFO ] 2026-06-01 11:55:06.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:55:07.637 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:55:20.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 11:55:20.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424926,ok=424926,error=0, records=41
[INFO ] 2026-06-01 11:55:20.680 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21248/300s
[INFO ] 2026-06-01 11:55:21.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:55:22.642 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:55:35.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 11:55:35.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424927,ok=424927,error=0, records=41
[INFO ] 2026-06-01 11:55:36.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:55:37.647 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:55:41.396 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21261/300s
[INFO ] 2026-06-01 11:55:50.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 11:55:50.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424928,ok=424928,error=0, records=41
[INFO ] 2026-06-01 11:55:51.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:55:52.650 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:56:05.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 11:56:05.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424929,ok=424929,error=0, records=41
[INFO ] 2026-06-01 11:56:06.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:56:07.656 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:56:13.734 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21248/300s
[INFO ] 2026-06-01 11:56:20.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 11:56:20.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424930,ok=424930,error=0, records=41
[INFO ] 2026-06-01 11:56:21.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:56:22.661 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:56:35.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 11:56:35.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424931,ok=424931,error=0, records=41
[INFO ] 2026-06-01 11:56:36.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:56:36.073 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21257/300s
[WARN ] 2026-06-01 11:56:37.665 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:56:50.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 11:56:50.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424932,ok=424932,error=0, records=41
[INFO ] 2026-06-01 11:56:51.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:56:52.671 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:57:05.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 11:57:05.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424933,ok=424933,error=0, records=41
[INFO ] 2026-06-01 11:57:06.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:57:06.040 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21260/300s
[WARN ] 2026-06-01 11:57:07.676 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:57:20.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 11:57:20.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424934,ok=424934,error=0, records=41
[INFO ] 2026-06-01 11:57:21.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:57:22.681 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:57:35.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 11:57:35.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424935,ok=424935,error=0, records=41
[INFO ] 2026-06-01 11:57:36.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:57:37.685 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:57:42.489 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21258/300s
[INFO ] 2026-06-01 11:57:44.491 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21258/300s
[INFO ] 2026-06-01 11:57:50.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 11:57:50.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424936,ok=424936,error=0, records=41
[INFO ] 2026-06-01 11:57:51.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:57:51.953 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21258/300s
[WARN ] 2026-06-01 11:57:52.691 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:58:01.130 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17701/300s
[INFO ] 2026-06-01 11:58:01.132 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 11:58:01.424 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 11:58:01.424 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 11:58:01.424 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 11:58:01.424 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 11:58:01.424 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 11:58:01.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:58:01.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 11:58:01.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:58:01.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 11:58:01.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:58:01.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 11:58:06.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 11:58:06.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424937,ok=424937,error=0, records=41
[INFO ] 2026-06-01 11:58:06.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:58:07.696 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:58:21.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 11:58:21.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424938,ok=424938,error=0, records=41
[INFO ] 2026-06-01 11:58:21.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:58:22.702 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:58:36.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 11:58:36.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424939,ok=424939,error=0, records=41
[INFO ] 2026-06-01 11:58:36.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:58:37.706 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:58:51.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 11:58:51.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424940,ok=424940,error=0, records=41
[INFO ] 2026-06-01 11:58:51.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:58:52.711 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:59:06.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 11:59:06.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424941,ok=424941,error=0, records=41
[INFO ] 2026-06-01 11:59:06.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:59:07.716 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:59:21.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-01 11:59:21.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424942,ok=424942,error=0, records=41
[INFO ] 2026-06-01 11:59:21.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 11:59:22.721 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:59:36.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 11:59:36.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 11:59:36.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424943,ok=424943,error=0, records=41
[WARN ] 2026-06-01 11:59:37.726 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 11:59:51.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 11:59:51.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 11:59:51.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424944,ok=424944,error=0, records=41
[WARN ] 2026-06-01 11:59:52.732 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:00:01.007 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21262/300s
[INFO ] 2026-06-01 12:00:02.736 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21253/300s
[INFO ] 2026-06-01 12:00:06.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:00:06.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 12:00:06.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424945,ok=424945,error=0, records=41
[WARN ] 2026-06-01 12:00:07.738 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:00:21.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:00:21.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 12:00:21.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424946,ok=424946,error=0, records=41
[INFO ] 2026-06-01 12:00:21.123 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21249/300s
[WARN ] 2026-06-01 12:00:22.742 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:00:36.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:00:36.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 12:00:36.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424947,ok=424947,error=0, records=41
[WARN ] 2026-06-01 12:00:37.748 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:00:41.403 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21262/300s
[INFO ] 2026-06-01 12:00:51.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:00:51.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:00:51.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424948,ok=424948,error=0, records=41
[WARN ] 2026-06-01 12:00:52.754 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:01:01.426 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:01:01.601 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:01:01.601 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 12:01:01.601 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:01:01.601 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:01:01.601 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:01:01.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:01:01.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:01:01.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:01:01.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:01:01.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:01:01.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:01:06.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:01:06.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:01:06.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424949,ok=424949,error=0, records=41
[WARN ] 2026-06-01 12:01:07.760 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:01:13.917 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21249/300s
[INFO ] 2026-06-01 12:01:21.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:01:21.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 12:01:21.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424950,ok=424950,error=0, records=41
[WARN ] 2026-06-01 12:01:22.765 [11022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:01:36.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:01:36.129 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21258/300s
[INFO ] 2026-06-01 12:01:36.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 12:01:36.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424951,ok=424951,error=0, records=41
[WARN ] 2026-06-01 12:01:37.770 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:01:51.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:01:51.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 12:01:51.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424952,ok=424952,error=0, records=41
[WARN ] 2026-06-01 12:01:52.776 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:02:06.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:02:06.052 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21261/300s
[INFO ] 2026-06-01 12:02:06.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 12:02:06.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424953,ok=424953,error=0, records=41
[WARN ] 2026-06-01 12:02:07.782 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:02:21.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:02:21.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:02:21.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424954,ok=424954,error=0, records=41
[WARN ] 2026-06-01 12:02:22.787 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:02:36.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:02:36.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 12:02:36.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424955,ok=424955,error=0, records=41
[WARN ] 2026-06-01 12:02:37.792 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:02:42.559 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21259/300s
[INFO ] 2026-06-01 12:02:44.561 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21259/300s
[INFO ] 2026-06-01 12:02:51.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:02:51.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 12:02:51.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424956,ok=424956,error=0, records=41
[INFO ] 2026-06-01 12:02:52.002 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21259/300s
[WARN ] 2026-06-01 12:02:52.797 [11032] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:03:06.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:03:06.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 12:03:06.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424957,ok=424957,error=0, records=41
[WARN ] 2026-06-01 12:03:07.803 [11005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:03:21.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:03:21.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 12:03:21.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424958,ok=424958,error=0, records=41
[WARN ] 2026-06-01 12:03:22.809 [10999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:03:36.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 12:03:36.056 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 12:03:36.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 12:03:36.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424959,ok=424959,error=0, records=41
[WARN ] 2026-06-01 12:03:37.813 [11601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:03:51.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:03:51.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 12:03:51.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424960,ok=424960,error=0, records=41
[WARN ] 2026-06-01 12:03:52.819 [11587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:04:01.601 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17702/300s
[INFO ] 2026-06-01 12:04:01.603 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:04:01.760 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:04:01.760 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 12:04:01.761 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:04:01.761 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[INFO ] 2026-06-01 12:04:01.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[WARN ] 2026-06-01 12:04:01.761 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:04:01.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:04:01.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:04:01.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:04:01.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:04:01.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:04:06.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:04:06.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 12:04:06.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424961,ok=424961,error=0, records=41
[WARN ] 2026-06-01 12:04:07.826 [11587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:04:21.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:04:21.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 12:04:21.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424962,ok=424962,error=0, records=41
[WARN ] 2026-06-01 12:04:22.831 [11645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:04:36.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:04:36.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 12:04:36.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424963,ok=424963,error=0, records=41
[WARN ] 2026-06-01 12:04:37.836 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:04:51.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:04:51.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 12:04:51.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424964,ok=424964,error=0, records=41
[WARN ] 2026-06-01 12:04:52.841 [11610] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:05:01.010 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21263/300s
[INFO ] 2026-06-01 12:05:02.844 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21254/300s
[INFO ] 2026-06-01 12:05:06.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:05:06.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 12:05:06.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424965,ok=424965,error=0, records=41
[WARN ] 2026-06-01 12:05:07.846 [11696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:05:21.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:05:21.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 12:05:21.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424966,ok=424966,error=0, records=41
[INFO ] 2026-06-01 12:05:21.261 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21250/300s
[WARN ] 2026-06-01 12:05:22.852 [11696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:05:36.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:05:36.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 12:05:36.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424967,ok=424967,error=0, records=41
[WARN ] 2026-06-01 12:05:37.857 [11696] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:05:41.410 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21263/300s
[INFO ] 2026-06-01 12:05:51.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:05:51.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 12:05:51.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424968,ok=424968,error=0, records=41
[WARN ] 2026-06-01 12:05:52.863 [11610] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:06:06.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:06:06.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 12:06:06.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424969,ok=424969,error=0, records=41
[WARN ] 2026-06-01 12:06:07.867 [11724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:06:14.101 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21250/300s
[INFO ] 2026-06-01 12:06:21.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:06:21.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 12:06:21.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424970,ok=424970,error=0, records=41
[WARN ] 2026-06-01 12:06:22.872 [11767] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:06:36.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:06:36.191 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21259/300s
[INFO ] 2026-06-01 12:06:36.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 12:06:36.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424971,ok=424971,error=0, records=41
[WARN ] 2026-06-01 12:06:37.878 [11791] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:06:51.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:06:51.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 12:06:51.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424972,ok=424972,error=0, records=41
[WARN ] 2026-06-01 12:06:52.884 [11724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:07:01.762 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861524},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:07:01.925 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:07:01.925 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 12:07:01.926 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:07:01.926 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:07:01.926 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:07:01.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:07:01.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:07:01.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:07:01.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:07:01.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:07:01.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:07:06.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:07:06.064 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21262/300s
[INFO ] 2026-06-01 12:07:06.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 12:07:06.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424973,ok=424973,error=0, records=41
[WARN ] 2026-06-01 12:07:07.890 [11812] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:07:21.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:07:21.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 12:07:21.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424974,ok=424974,error=0, records=41
[WARN ] 2026-06-01 12:07:22.895 [11840] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:07:36.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:07:36.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 12:07:36.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424975,ok=424975,error=0, records=41
[WARN ] 2026-06-01 12:07:37.900 [11845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:07:42.637 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21260/300s
[INFO ] 2026-06-01 12:07:44.636 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21260/300s
[INFO ] 2026-06-01 12:07:51.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:07:51.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 12:07:51.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424976,ok=424976,error=0, records=41
[INFO ] 2026-06-01 12:07:52.054 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21260/300s
[WARN ] 2026-06-01 12:07:52.907 [11840] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:08:06.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:08:06.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 12:08:06.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424977,ok=424977,error=0, records=41
[WARN ] 2026-06-01 12:08:07.913 [11866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:08:21.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:08:21.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 12:08:21.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424978,ok=424978,error=0, records=41
[WARN ] 2026-06-01 12:08:22.918 [11900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:08:36.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:08:36.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 12:08:36.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424979,ok=424979,error=0, records=41
[WARN ] 2026-06-01 12:08:37.923 [11911] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:08:51.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:08:51.068 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 12:08:51.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 12:08:51.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424980,ok=424980,error=0, records=41
[WARN ] 2026-06-01 12:08:52.928 [11921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:09:06.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:09:06.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 12:09:06.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424981,ok=424981,error=0, records=41
[WARN ] 2026-06-01 12:09:07.935 [11899] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:09:21.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:09:21.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 12:09:21.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424982,ok=424982,error=0, records=41
[WARN ] 2026-06-01 12:09:22.941 [11971] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:09:36.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:09:36.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 12:09:36.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424983,ok=424983,error=0, records=41
[WARN ] 2026-06-01 12:09:37.947 [11976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:09:51.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:09:51.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 12:09:51.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424984,ok=424984,error=0, records=41
[WARN ] 2026-06-01 12:09:52.952 [11996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:10:01.013 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21264/300s
[INFO ] 2026-06-01 12:10:01.926 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17703/300s
[INFO ] 2026-06-01 12:10:01.928 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861448},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:10:02.109 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:10:02.109 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 12:10:02.109 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:10:02.109 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:10:02.109 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:10:02.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:10:02.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:10:02.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:10:02.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:10:02.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:10:02.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:10:02.956 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21255/300s
[INFO ] 2026-06-01 12:10:06.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:10:06.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 12:10:06.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424985,ok=424985,error=0, records=41
[WARN ] 2026-06-01 12:10:07.958 [11996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:10:21.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:10:21.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 12:10:21.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424986,ok=424986,error=0, records=41
[INFO ] 2026-06-01 12:10:21.373 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21251/300s
[WARN ] 2026-06-01 12:10:22.963 [12030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:10:36.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:10:36.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 12:10:36.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424987,ok=424987,error=0, records=41
[WARN ] 2026-06-01 12:10:37.967 [11976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:10:41.416 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21264/300s
[INFO ] 2026-06-01 12:10:51.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:10:51.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 12:10:51.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424988,ok=424988,error=0, records=41
[WARN ] 2026-06-01 12:10:52.971 [12016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:11:06.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:11:06.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 12:11:06.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424989,ok=424989,error=0, records=41
[WARN ] 2026-06-01 12:11:07.977 [11976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:11:14.285 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21251/300s
[INFO ] 2026-06-01 12:11:21.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:11:21.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 12:11:21.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424990,ok=424990,error=0, records=41
[WARN ] 2026-06-01 12:11:22.982 [12058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:11:36.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:11:36.247 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21260/300s
[INFO ] 2026-06-01 12:11:36.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 12:11:36.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424991,ok=424991,error=0, records=41
[WARN ] 2026-06-01 12:11:37.987 [12058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:11:51.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:11:51.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 12:11:51.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424992,ok=424992,error=0, records=41
[WARN ] 2026-06-01 12:11:52.992 [12086] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:12:06.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:12:06.077 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21263/300s
[INFO ] 2026-06-01 12:12:06.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 12:12:06.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424993,ok=424993,error=0, records=41
[WARN ] 2026-06-01 12:12:07.997 [11976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:12:21.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:12:21.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 12:12:21.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424994,ok=424994,error=0, records=41
[WARN ] 2026-06-01 12:12:23.002 [12058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:12:36.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:12:36.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 12:12:36.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424995,ok=424995,error=0, records=41
[WARN ] 2026-06-01 12:12:38.006 [12044] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:12:42.706 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21261/300s
[INFO ] 2026-06-01 12:12:44.708 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21261/300s
[INFO ] 2026-06-01 12:12:51.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:12:51.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 12:12:51.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424996,ok=424996,error=0, records=41
[INFO ] 2026-06-01 12:12:52.115 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21261/300s
[WARN ] 2026-06-01 12:12:53.011 [12128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:13:02.111 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861368},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:13:02.269 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:13:02.269 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 12:13:02.269 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:13:02.269 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:13:02.269 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:13:02.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:13:02.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:13:02.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:13:02.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:13:02.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:13:02.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:13:06.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:13:06.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 12:13:06.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424997,ok=424997,error=0, records=41
[WARN ] 2026-06-01 12:13:08.015 [12170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:13:21.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:13:21.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 12:13:21.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424998,ok=424998,error=0, records=41
[WARN ] 2026-06-01 12:13:23.021 [12185] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:13:36.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 12:13:36.081 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 12:13:36.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 12:13:36.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=424999,ok=424999,error=0, records=41
[WARN ] 2026-06-01 12:13:38.026 [12213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:13:51.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:13:51.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:13:51.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425000,ok=425000,error=0, records=41
[WARN ] 2026-06-01 12:13:53.032 [12058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:14:06.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:14:06.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 12:14:06.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425001,ok=425001,error=0, records=41
[WARN ] 2026-06-01 12:14:08.037 [12058] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:14:21.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:14:21.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 12:14:21.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425002,ok=425002,error=0, records=41
[WARN ] 2026-06-01 12:14:23.044 [12252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:14:36.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:14:36.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 12:14:36.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425003,ok=425003,error=0, records=41
[WARN ] 2026-06-01 12:14:38.049 [12273] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:14:51.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:14:51.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 12:14:51.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425004,ok=425004,error=0, records=41
[WARN ] 2026-06-01 12:14:52.555 [12273] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:15:01.017 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21265/300s
[INFO ] 2026-06-01 12:15:03.058 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21256/300s
[INFO ] 2026-06-01 12:15:06.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:15:06.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 12:15:06.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425005,ok=425005,error=0, records=41
[WARN ] 2026-06-01 12:15:07.559 [12298] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 12:15:17.563 [12293] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/8551/stat), No such file or directory
[WARN ] 2026-06-01 12:15:17.563 [12293] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7086/stat), No such file or directory
[INFO ] 2026-06-01 12:15:21.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:15:21.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 12:15:21.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425006,ok=425006,error=0, records=41
[INFO ] 2026-06-01 12:15:21.498 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21252/300s
[WARN ] 2026-06-01 12:15:22.564 [12319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 12:15:32.567 [12298] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/8306/stat), No such file or directory
[WARN ] 2026-06-01 12:15:32.567 [12298] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/8551/stat), No such file or directory
[WARN ] 2026-06-01 12:15:32.567 [12298] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7086/stat), No such file or directory
[INFO ] 2026-06-01 12:15:36.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:15:36.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 12:15:36.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425007,ok=425007,error=0, records=41
[WARN ] 2026-06-01 12:15:37.568 [12374] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:15:41.423 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21265/300s
[WARN ] 2026-06-01 12:15:47.571 [12374] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/8306/stat), No such file or directory
[WARN ] 2026-06-01 12:15:47.571 [12374] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/8551/stat), No such file or directory
[WARN ] 2026-06-01 12:15:47.571 [12374] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7086/stat), No such file or directory
[INFO ] 2026-06-01 12:15:51.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:15:51.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 12:15:51.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425008,ok=425008,error=0, records=41
[WARN ] 2026-06-01 12:15:52.572 [12400] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:16:02.270 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17704/300s
[INFO ] 2026-06-01 12:16:02.271 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861268},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:16:02.461 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:16:02.461 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 12:16:02.461 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:16:02.461 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:16:02.461 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:16:02.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:16:02.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:16:02.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:16:02.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:16:02.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:16:02.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:16:06.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:16:06.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 12:16:06.513 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425009,ok=425009,error=0, records=41
[WARN ] 2026-06-01 12:16:07.576 [12374] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:16:14.467 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21252/300s
[INFO ] 2026-06-01 12:16:21.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:16:21.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 12:16:21.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425010,ok=425010,error=0, records=41
[WARN ] 2026-06-01 12:16:22.581 [12426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:16:36.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:16:36.298 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21261/300s
[INFO ] 2026-06-01 12:16:36.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 12:16:36.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425011,ok=425011,error=0, records=41
[WARN ] 2026-06-01 12:16:37.585 [12461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:16:51.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:16:51.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 12:16:51.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425012,ok=425012,error=0, records=41
[WARN ] 2026-06-01 12:16:52.590 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:17:06.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:17:06.095 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21264/300s
[INFO ] 2026-06-01 12:17:06.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 12:17:06.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425013,ok=425013,error=0, records=41
[WARN ] 2026-06-01 12:17:07.596 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:17:21.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:17:21.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11471, records=45
[INFO ] 2026-06-01 12:17:21.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425014,ok=425014,error=0, records=45
[WARN ] 2026-06-01 12:17:22.606 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:17:36.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:17:36.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 12:17:36.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425015,ok=425015,error=0, records=41
[WARN ] 2026-06-01 12:17:37.612 [12461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:17:42.781 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21262/300s
[INFO ] 2026-06-01 12:17:44.796 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21262/300s
[INFO ] 2026-06-01 12:17:51.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:17:51.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 12:17:51.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425016,ok=425016,error=0, records=41
[INFO ] 2026-06-01 12:17:52.179 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21262/300s
[WARN ] 2026-06-01 12:17:52.618 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:18:06.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:18:06.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 12:18:06.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425017,ok=425017,error=0, records=41
[WARN ] 2026-06-01 12:18:07.623 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:18:21.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:18:21.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 12:18:21.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425018,ok=425018,error=0, records=41
[WARN ] 2026-06-01 12:18:22.628 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:18:36.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:18:36.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 12:18:36.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425019,ok=425019,error=0, records=41
[WARN ] 2026-06-01 12:18:37.632 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:18:51.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:18:51.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 12:18:51.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425020,ok=425020,error=0, records=41
[WARN ] 2026-06-01 12:18:52.638 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:19:02.463 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861188},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:19:02.628 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:19:02.628 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 12:19:02.629 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:19:02.629 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:19:02.629 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:19:02.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:19:02.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:19:02.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:19:02.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:19:02.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:19:02.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:19:06.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:19:06.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 12:19:06.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425021,ok=425021,error=0, records=41
[WARN ] 2026-06-01 12:19:07.644 [12466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:19:21.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:19:21.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 12:19:21.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425022,ok=425022,error=0, records=41
[WARN ] 2026-06-01 12:19:22.650 [12466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:19:36.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:19:36.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 12:19:36.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425023,ok=425023,error=0, records=41
[WARN ] 2026-06-01 12:19:37.655 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:19:51.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:19:51.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 12:19:51.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425024,ok=425024,error=0, records=41
[WARN ] 2026-06-01 12:19:52.660 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:20:01.020 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21266/300s
[INFO ] 2026-06-01 12:20:03.163 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21257/300s
[INFO ] 2026-06-01 12:20:06.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:20:06.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 12:20:06.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425025,ok=425025,error=0, records=41
[WARN ] 2026-06-01 12:20:07.665 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:20:21.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:20:21.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 12:20:21.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425026,ok=425026,error=0, records=41
[INFO ] 2026-06-01 12:20:21.605 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21253/300s
[WARN ] 2026-06-01 12:20:22.670 [12461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:20:36.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:20:36.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 12:20:36.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425027,ok=425027,error=0, records=41
[WARN ] 2026-06-01 12:20:37.674 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:20:41.429 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21266/300s
[INFO ] 2026-06-01 12:20:51.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:20:51.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 12:20:51.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425028,ok=425028,error=0, records=41
[WARN ] 2026-06-01 12:20:52.680 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:21:06.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:21:06.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 12:21:06.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425029,ok=425029,error=0, records=41
[WARN ] 2026-06-01 12:21:07.686 [12466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:21:14.654 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21253/300s
[INFO ] 2026-06-01 12:21:21.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:21:21.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 12:21:21.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425030,ok=425030,error=0, records=41
[WARN ] 2026-06-01 12:21:22.692 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:21:36.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:21:36.352 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21262/300s
[INFO ] 2026-06-01 12:21:36.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 12:21:36.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425031,ok=425031,error=0, records=41
[WARN ] 2026-06-01 12:21:37.697 [12461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:21:51.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:21:51.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 12:21:51.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425032,ok=425032,error=0, records=41
[WARN ] 2026-06-01 12:21:52.703 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:22:02.629 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17705/300s
[INFO ] 2026-06-01 12:22:02.630 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:22:02.801 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:22:02.801 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 12:22:02.802 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:22:02.802 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:22:02.802 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:22:02.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:22:02.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:22:02.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:22:02.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:22:02.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:22:02.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:22:06.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:22:06.107 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21265/300s
[INFO ] 2026-06-01 12:22:06.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 12:22:06.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425033,ok=425033,error=0, records=41
[WARN ] 2026-06-01 12:22:07.707 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:22:21.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:22:21.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-01 12:22:21.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425034,ok=425034,error=0, records=41
[WARN ] 2026-06-01 12:22:22.713 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:22:36.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:22:36.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 12:22:36.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425035,ok=425035,error=0, records=41
[WARN ] 2026-06-01 12:22:37.718 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:22:42.839 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21263/300s
[INFO ] 2026-06-01 12:22:44.862 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21263/300s
[INFO ] 2026-06-01 12:22:51.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:22:51.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 12:22:51.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425036,ok=425036,error=0, records=41
[INFO ] 2026-06-01 12:22:52.239 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21263/300s
[WARN ] 2026-06-01 12:22:52.724 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:23:06.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:23:06.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 12:23:06.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425037,ok=425037,error=0, records=41
[WARN ] 2026-06-01 12:23:07.729 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:23:21.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:23:21.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 12:23:21.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425038,ok=425038,error=0, records=41
[WARN ] 2026-06-01 12:23:22.736 [12466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:23:36.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 12:23:36.112 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 12:23:36.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 12:23:36.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425039,ok=425039,error=0, records=41
[WARN ] 2026-06-01 12:23:37.742 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:23:51.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:23:51.113 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 12:23:51.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 12:23:51.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425040,ok=425040,error=0, records=41
[WARN ] 2026-06-01 12:23:52.748 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:24:06.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:24:06.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 12:24:06.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425041,ok=425041,error=0, records=41
[WARN ] 2026-06-01 12:24:07.754 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:24:21.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:24:21.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 12:24:21.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425042,ok=425042,error=0, records=41
[WARN ] 2026-06-01 12:24:22.759 [12461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:24:36.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:24:36.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 12:24:36.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425043,ok=425043,error=0, records=41
[WARN ] 2026-06-01 12:24:37.764 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:24:51.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:24:51.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 12:24:51.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425044,ok=425044,error=0, records=41
[WARN ] 2026-06-01 12:24:52.768 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:25:01.023 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21267/300s
[INFO ] 2026-06-01 12:25:02.804 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861020},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:25:02.976 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:25:02.976 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 12:25:02.976 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:25:02.976 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:25:02.976 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:25:03.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:25:03.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:25:03.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:25:03.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:25:03.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:25:03.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:25:03.271 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21258/300s
[INFO ] 2026-06-01 12:25:06.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:25:06.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 12:25:06.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425045,ok=425045,error=0, records=41
[WARN ] 2026-06-01 12:25:07.773 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:25:21.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:25:21.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 12:25:21.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425046,ok=425046,error=0, records=41
[INFO ] 2026-06-01 12:25:21.840 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21254/300s
[WARN ] 2026-06-01 12:25:22.779 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:25:36.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:25:36.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 12:25:36.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425047,ok=425047,error=0, records=41
[WARN ] 2026-06-01 12:25:37.784 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:25:41.436 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21267/300s
[INFO ] 2026-06-01 12:25:51.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:25:51.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 12:25:51.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425048,ok=425048,error=0, records=41
[WARN ] 2026-06-01 12:25:52.790 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:26:06.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:26:06.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 12:26:06.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425049,ok=425049,error=0, records=41
[WARN ] 2026-06-01 12:26:07.795 [12466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:26:14.842 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21254/300s
[INFO ] 2026-06-01 12:26:21.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:26:21.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 12:26:21.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425050,ok=425050,error=0, records=41
[WARN ] 2026-06-01 12:26:22.799 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:26:36.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:26:36.413 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21263/300s
[INFO ] 2026-06-01 12:26:36.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 12:26:36.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425051,ok=425051,error=0, records=41
[WARN ] 2026-06-01 12:26:37.804 [12472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:26:51.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:26:51.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 12:26:51.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425052,ok=425052,error=0, records=41
[WARN ] 2026-06-01 12:26:52.809 [13040] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:27:06.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:27:06.122 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21266/300s
[INFO ] 2026-06-01 12:27:06.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 12:27:06.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425053,ok=425053,error=0, records=41
[WARN ] 2026-06-01 12:27:07.816 [13025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:27:21.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:27:21.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 12:27:21.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425054,ok=425054,error=0, records=41
[WARN ] 2026-06-01 12:27:22.821 [13025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:27:36.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:27:36.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 12:27:36.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425055,ok=425055,error=0, records=41
[WARN ] 2026-06-01 12:27:37.826 [12471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:27:42.904 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21264/300s
[INFO ] 2026-06-01 12:27:44.928 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21264/300s
[INFO ] 2026-06-01 12:27:51.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:27:51.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 12:27:51.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425056,ok=425056,error=0, records=41
[INFO ] 2026-06-01 12:27:52.299 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21264/300s
[WARN ] 2026-06-01 12:27:52.831 [13025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:28:02.976 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17706/300s
[INFO ] 2026-06-01 12:28:02.978 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860944},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:28:03.141 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:28:03.141 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 12:28:03.141 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:28:03.141 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:28:03.142 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:28:03.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:28:03.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:28:03.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:28:03.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:28:03.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:28:03.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:28:06.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:28:06.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 12:28:06.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425057,ok=425057,error=0, records=41
[WARN ] 2026-06-01 12:28:07.838 [13112] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:28:21.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:28:21.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 12:28:21.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425058,ok=425058,error=0, records=41
[WARN ] 2026-06-01 12:28:22.842 [13121] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:28:36.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:28:36.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 12:28:36.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425059,ok=425059,error=0, records=41
[WARN ] 2026-06-01 12:28:37.849 [13040] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:28:51.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:28:51.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 12:28:51.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425060,ok=425060,error=0, records=41
[WARN ] 2026-06-01 12:28:52.853 [13148] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:29:06.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:29:06.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 12:29:06.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425061,ok=425061,error=0, records=41
[WARN ] 2026-06-01 12:29:07.858 [13055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:29:21.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:29:21.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 12:29:21.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425062,ok=425062,error=0, records=41
[WARN ] 2026-06-01 12:29:22.864 [13083] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:29:36.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:29:37.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 12:29:37.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425063,ok=425063,error=0, records=41
[WARN ] 2026-06-01 12:29:37.868 [13190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:29:51.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:29:52.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 12:29:52.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425064,ok=425064,error=0, records=41
[WARN ] 2026-06-01 12:29:52.873 [13204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:30:01.027 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21268/300s
[INFO ] 2026-06-01 12:30:03.377 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21259/300s
[INFO ] 2026-06-01 12:30:06.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:30:07.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 12:30:07.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425065,ok=425065,error=0, records=41
[WARN ] 2026-06-01 12:30:07.878 [13040] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:30:21.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:30:22.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 12:30:22.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425066,ok=425066,error=0, records=41
[INFO ] 2026-06-01 12:30:22.019 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21255/300s
[WARN ] 2026-06-01 12:30:22.884 [13243] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:30:36.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:30:37.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 12:30:37.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425067,ok=425067,error=0, records=41
[WARN ] 2026-06-01 12:30:37.890 [13249] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:30:41.442 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21268/300s
[INFO ] 2026-06-01 12:30:51.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:30:52.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 12:30:52.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425068,ok=425068,error=0, records=41
[WARN ] 2026-06-01 12:30:52.896 [13266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:31:03.143 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:31:03.283 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:31:03.283 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 12:31:03.284 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:31:03.284 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:31:03.284 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:31:03.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:31:03.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:31:03.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:31:03.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:31:03.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:31:03.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:31:06.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:31:07.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 12:31:07.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425069,ok=425069,error=0, records=41
[WARN ] 2026-06-01 12:31:07.901 [13276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:31:15.024 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21255/300s
[INFO ] 2026-06-01 12:31:21.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:31:22.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 12:31:22.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425070,ok=425070,error=0, records=41
[WARN ] 2026-06-01 12:31:22.907 [13308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:31:36.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:31:36.465 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21264/300s
[INFO ] 2026-06-01 12:31:37.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 12:31:37.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425071,ok=425071,error=0, records=41
[WARN ] 2026-06-01 12:31:37.913 [13332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:31:51.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:31:52.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 12:31:52.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425072,ok=425072,error=0, records=41
[WARN ] 2026-06-01 12:31:52.918 [13291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:32:06.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:32:06.134 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21267/300s
[INFO ] 2026-06-01 12:32:07.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 12:32:07.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425073,ok=425073,error=0, records=41
[WARN ] 2026-06-01 12:32:07.922 [13359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:32:21.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:32:22.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 12:32:22.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425074,ok=425074,error=0, records=41
[WARN ] 2026-06-01 12:32:22.927 [13326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:32:36.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:32:37.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 12:32:37.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425075,ok=425075,error=0, records=41
[WARN ] 2026-06-01 12:32:37.934 [13398] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:32:42.963 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21265/300s
[INFO ] 2026-06-01 12:32:44.990 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21265/300s
[INFO ] 2026-06-01 12:32:51.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:32:52.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 12:32:52.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425076,ok=425076,error=0, records=41
[INFO ] 2026-06-01 12:32:52.353 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21265/300s
[WARN ] 2026-06-01 12:32:52.939 [13408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:33:06.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:33:07.091 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10393, records=41
[INFO ] 2026-06-01 12:33:07.091 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425077,ok=425077,error=0, records=41
[WARN ] 2026-06-01 12:33:07.943 [13376] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:33:21.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:33:22.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 12:33:22.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425078,ok=425078,error=0, records=41
[WARN ] 2026-06-01 12:33:22.949 [13431] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:33:36.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 12:33:36.138 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 12:33:37.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 12:33:37.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425079,ok=425079,error=0, records=41
[WARN ] 2026-06-01 12:33:37.954 [13436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:33:51.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:33:52.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 12:33:52.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425080,ok=425080,error=0, records=41
[WARN ] 2026-06-01 12:33:52.958 [13442] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:34:03.284 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17707/300s
[INFO ] 2026-06-01 12:34:03.285 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860788},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:34:03.448 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:34:03.448 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 12:34:03.448 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:34:03.448 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:34:03.448 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:34:03.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:34:03.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:34:03.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:34:03.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:34:03.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:34:03.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:34:06.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:34:07.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 12:34:07.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425081,ok=425081,error=0, records=41
[WARN ] 2026-06-01 12:34:07.964 [13470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:34:21.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:34:22.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 12:34:22.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425082,ok=425082,error=0, records=41
[WARN ] 2026-06-01 12:34:22.969 [13484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:34:36.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:34:37.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 12:34:37.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425083,ok=425083,error=0, records=41
[WARN ] 2026-06-01 12:34:37.975 [13470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:34:51.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:34:52.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 12:34:52.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425084,ok=425084,error=0, records=41
[WARN ] 2026-06-01 12:34:52.979 [13436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:35:01.030 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21269/300s
[INFO ] 2026-06-01 12:35:03.482 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21260/300s
[INFO ] 2026-06-01 12:35:06.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:35:07.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 12:35:07.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425085,ok=425085,error=0, records=41
[WARN ] 2026-06-01 12:35:07.984 [13526] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:35:21.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:35:22.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 12:35:22.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425086,ok=425086,error=0, records=41
[INFO ] 2026-06-01 12:35:22.145 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21256/300s
[WARN ] 2026-06-01 12:35:22.989 [13526] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:35:36.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:35:37.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 12:35:37.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425087,ok=425087,error=0, records=41
[WARN ] 2026-06-01 12:35:37.993 [13484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:35:41.449 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21269/300s
[INFO ] 2026-06-01 12:35:51.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:35:52.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 12:35:52.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425088,ok=425088,error=0, records=41
[WARN ] 2026-06-01 12:35:52.998 [13484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:36:06.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:36:07.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 12:36:07.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425089,ok=425089,error=0, records=41
[WARN ] 2026-06-01 12:36:08.004 [13582] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:36:15.206 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21256/300s
[INFO ] 2026-06-01 12:36:21.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:36:22.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 12:36:22.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425090,ok=425090,error=0, records=41
[WARN ] 2026-06-01 12:36:23.009 [13484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:36:36.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:36:36.519 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21265/300s
[INFO ] 2026-06-01 12:36:37.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 12:36:37.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425091,ok=425091,error=0, records=41
[WARN ] 2026-06-01 12:36:38.014 [13568] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:36:51.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:36:52.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 12:36:52.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425092,ok=425092,error=0, records=41
[WARN ] 2026-06-01 12:36:53.019 [13568] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:37:03.450 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860708},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:37:03.615 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:37:03.616 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 12:37:03.616 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:37:03.616 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:37:03.616 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:37:03.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:37:03.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:37:03.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:37:03.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:37:03.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:37:03.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:37:06.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:37:06.146 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21268/300s
[INFO ] 2026-06-01 12:37:07.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 12:37:07.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425093,ok=425093,error=0, records=41
[WARN ] 2026-06-01 12:37:08.024 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:37:21.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:37:22.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 12:37:22.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425094,ok=425094,error=0, records=41
[WARN ] 2026-06-01 12:37:23.029 [13666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:37:36.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:37:37.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 12:37:37.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425095,ok=425095,error=0, records=41
[WARN ] 2026-06-01 12:37:38.035 [13651] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:37:43.029 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21266/300s
[INFO ] 2026-06-01 12:37:45.063 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21266/300s
[INFO ] 2026-06-01 12:37:51.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:37:52.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:37:52.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425096,ok=425096,error=0, records=41
[INFO ] 2026-06-01 12:37:52.400 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21266/300s
[WARN ] 2026-06-01 12:37:53.040 [13651] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:38:06.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:38:07.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 12:38:07.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425097,ok=425097,error=0, records=41
[WARN ] 2026-06-01 12:38:08.045 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:38:21.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:38:22.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 12:38:22.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425098,ok=425098,error=0, records=41
[WARN ] 2026-06-01 12:38:23.050 [13609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:38:36.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:38:37.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 12:38:37.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425099,ok=425099,error=0, records=41
[WARN ] 2026-06-01 12:38:37.555 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:38:51.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:38:51.151 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 12:38:52.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 12:38:52.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425100,ok=425100,error=0, records=41
[WARN ] 2026-06-01 12:38:52.560 [13749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:39:06.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:39:07.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:39:07.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425101,ok=425101,error=0, records=41
[WARN ] 2026-06-01 12:39:07.564 [13782] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:39:21.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:39:22.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 12:39:22.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425102,ok=425102,error=0, records=41
[WARN ] 2026-06-01 12:39:22.569 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:39:36.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:39:37.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 12:39:37.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425103,ok=425103,error=0, records=41
[WARN ] 2026-06-01 12:39:37.574 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:39:51.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:39:52.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 12:39:52.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425104,ok=425104,error=0, records=41
[WARN ] 2026-06-01 12:39:52.581 [13838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:40:01.033 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21270/300s
[INFO ] 2026-06-01 12:40:03.585 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21261/300s
[INFO ] 2026-06-01 12:40:03.616 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17708/300s
[INFO ] 2026-06-01 12:40:03.618 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860624},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:40:03.787 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:40:03.787 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 12:40:03.787 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:40:03.787 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:40:03.787 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:40:03.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:40:03.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:40:03.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:40:03.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:40:03.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:40:03.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:40:06.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:40:07.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 12:40:07.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425105,ok=425105,error=0, records=41
[WARN ] 2026-06-01 12:40:07.587 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:40:21.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:40:22.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 12:40:22.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425106,ok=425106,error=0, records=41
[INFO ] 2026-06-01 12:40:22.435 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21257/300s
[WARN ] 2026-06-01 12:40:22.592 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:40:36.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:40:37.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 12:40:37.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425107,ok=425107,error=0, records=41
[WARN ] 2026-06-01 12:40:37.598 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:40:41.456 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21270/300s
[INFO ] 2026-06-01 12:40:51.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:40:52.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 12:40:52.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425108,ok=425108,error=0, records=41
[WARN ] 2026-06-01 12:40:52.603 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:41:06.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:41:07.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 12:41:07.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425109,ok=425109,error=0, records=41
[WARN ] 2026-06-01 12:41:07.609 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:41:15.385 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21257/300s
[INFO ] 2026-06-01 12:41:21.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:41:22.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 12:41:22.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425110,ok=425110,error=0, records=41
[WARN ] 2026-06-01 12:41:22.615 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:41:36.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:41:36.572 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21266/300s
[INFO ] 2026-06-01 12:41:37.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 12:41:37.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425111,ok=425111,error=0, records=41
[WARN ] 2026-06-01 12:41:37.620 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:41:51.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:41:52.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 12:41:52.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425112,ok=425112,error=0, records=41
[WARN ] 2026-06-01 12:41:52.625 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:42:06.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:42:06.160 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21269/300s
[INFO ] 2026-06-01 12:42:07.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 12:42:07.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425113,ok=425113,error=0, records=41
[WARN ] 2026-06-01 12:42:07.630 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:42:21.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:42:22.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 12:42:22.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425114,ok=425114,error=0, records=41
[WARN ] 2026-06-01 12:42:22.635 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:42:36.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:42:37.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 12:42:37.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425115,ok=425115,error=0, records=41
[WARN ] 2026-06-01 12:42:37.640 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:42:43.100 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21267/300s
[INFO ] 2026-06-01 12:42:45.138 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21267/300s
[INFO ] 2026-06-01 12:42:51.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:42:52.447 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21267/300s
[INFO ] 2026-06-01 12:42:52.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 12:42:52.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425116,ok=425116,error=0, records=41
[WARN ] 2026-06-01 12:42:52.645 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:43:03.789 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860548},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:43:03.959 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:43:03.959 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 12:43:03.959 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:43:03.959 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:43:03.959 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:43:03.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:43:03.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:43:03.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:43:03.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:43:03.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:43:03.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:43:06.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:43:07.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 12:43:07.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425117,ok=425117,error=0, records=41
[WARN ] 2026-06-01 12:43:07.650 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:43:21.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:43:22.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 12:43:22.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425118,ok=425118,error=0, records=41
[WARN ] 2026-06-01 12:43:22.656 [13876] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:43:36.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 12:43:36.164 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 12:43:37.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 12:43:37.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425119,ok=425119,error=0, records=41
[WARN ] 2026-06-01 12:43:37.661 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:43:51.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:43:52.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 12:43:52.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425120,ok=425120,error=0, records=41
[WARN ] 2026-06-01 12:43:52.667 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:44:06.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:44:07.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 12:44:07.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425121,ok=425121,error=0, records=41
[WARN ] 2026-06-01 12:44:07.672 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:44:21.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:44:22.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 12:44:22.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425122,ok=425122,error=0, records=41
[WARN ] 2026-06-01 12:44:22.677 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:44:36.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:44:37.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 12:44:37.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425123,ok=425123,error=0, records=41
[WARN ] 2026-06-01 12:44:37.682 [13876] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:44:51.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:44:52.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 12:44:52.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425124,ok=425124,error=0, records=41
[WARN ] 2026-06-01 12:44:52.687 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:45:01.037 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21271/300s
[INFO ] 2026-06-01 12:45:03.691 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21262/300s
[INFO ] 2026-06-01 12:45:06.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:45:07.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 12:45:07.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425125,ok=425125,error=0, records=41
[WARN ] 2026-06-01 12:45:07.692 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:45:21.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:45:22.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 12:45:22.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425126,ok=425126,error=0, records=41
[INFO ] 2026-06-01 12:45:22.569 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21258/300s
[WARN ] 2026-06-01 12:45:22.698 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:45:36.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:45:37.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 12:45:37.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425127,ok=425127,error=0, records=41
[WARN ] 2026-06-01 12:45:37.703 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:45:41.462 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21271/300s
[INFO ] 2026-06-01 12:45:51.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:45:52.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 12:45:52.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425128,ok=425128,error=0, records=41
[WARN ] 2026-06-01 12:45:52.707 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:46:03.959 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17709/300s
[INFO ] 2026-06-01 12:46:03.961 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860472},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:46:04.121 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:46:04.121 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 12:46:04.121 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:46:04.121 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:46:04.121 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:46:04.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:46:04.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:46:04.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:46:04.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:46:04.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:46:04.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:46:06.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:46:07.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 12:46:07.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425129,ok=425129,error=0, records=41
[WARN ] 2026-06-01 12:46:07.712 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:46:15.567 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21258/300s
[INFO ] 2026-06-01 12:46:21.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:46:22.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 12:46:22.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425130,ok=425130,error=0, records=41
[WARN ] 2026-06-01 12:46:22.716 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:46:36.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:46:36.628 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21267/300s
[INFO ] 2026-06-01 12:46:37.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 12:46:37.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425131,ok=425131,error=0, records=41
[WARN ] 2026-06-01 12:46:37.721 [13876] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:46:51.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:46:52.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 12:46:52.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425132,ok=425132,error=0, records=41
[WARN ] 2026-06-01 12:46:52.726 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:47:06.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:47:06.172 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21270/300s
[INFO ] 2026-06-01 12:47:07.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 12:47:07.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425133,ok=425133,error=0, records=41
[WARN ] 2026-06-01 12:47:07.730 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:47:21.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:47:22.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 12:47:22.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425134,ok=425134,error=0, records=41
[WARN ] 2026-06-01 12:47:22.736 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:47:36.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:47:37.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 12:47:37.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425135,ok=425135,error=0, records=41
[WARN ] 2026-06-01 12:47:37.742 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:47:43.176 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21268/300s
[INFO ] 2026-06-01 12:47:45.190 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21268/300s
[INFO ] 2026-06-01 12:47:51.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:47:52.497 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21268/300s
[INFO ] 2026-06-01 12:47:52.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 12:47:52.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425136,ok=425136,error=0, records=41
[WARN ] 2026-06-01 12:47:52.746 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:48:06.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:48:07.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 12:48:07.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425137,ok=425137,error=0, records=41
[WARN ] 2026-06-01 12:48:07.751 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:48:21.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:48:22.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 12:48:22.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425138,ok=425138,error=0, records=41
[WARN ] 2026-06-01 12:48:22.756 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:48:36.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:48:37.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-01 12:48:37.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425139,ok=425139,error=0, records=41
[WARN ] 2026-06-01 12:48:37.761 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:48:51.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:48:52.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 12:48:52.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425140,ok=425140,error=0, records=41
[WARN ] 2026-06-01 12:48:52.766 [13826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:49:04.123 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860396},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:49:04.297 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:49:04.297 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 12:49:04.297 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:49:04.297 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:49:04.297 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:49:04.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:49:04.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:49:04.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:49:04.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:49:04.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:49:04.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:49:06.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:49:07.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 12:49:07.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425141,ok=425141,error=0, records=41
[WARN ] 2026-06-01 12:49:07.770 [13876] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:49:21.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:49:22.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 12:49:22.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425142,ok=425142,error=0, records=41
[WARN ] 2026-06-01 12:49:22.776 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:49:36.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:49:37.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 12:49:37.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425143,ok=425143,error=0, records=41
[WARN ] 2026-06-01 12:49:37.781 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:49:51.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:49:52.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 12:49:52.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425144,ok=425144,error=0, records=41
[WARN ] 2026-06-01 12:49:52.786 [13885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:50:01.040 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21272/300s
[INFO ] 2026-06-01 12:50:03.790 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21263/300s
[INFO ] 2026-06-01 12:50:06.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:50:07.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 12:50:07.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425145,ok=425145,error=0, records=41
[WARN ] 2026-06-01 12:50:07.791 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:50:21.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:50:22.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 12:50:22.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425146,ok=425146,error=0, records=41
[INFO ] 2026-06-01 12:50:22.763 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21259/300s
[WARN ] 2026-06-01 12:50:22.796 [13876] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:50:36.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:50:37.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 12:50:37.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425147,ok=425147,error=0, records=41
[WARN ] 2026-06-01 12:50:37.801 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:50:41.468 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21272/300s
[INFO ] 2026-06-01 12:50:51.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:50:52.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 12:50:52.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425148,ok=425148,error=0, records=41
[WARN ] 2026-06-01 12:50:52.806 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:51:06.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:51:07.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 12:51:07.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425149,ok=425149,error=0, records=41
[WARN ] 2026-06-01 12:51:07.811 [14444] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:51:15.752 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21259/300s
[INFO ] 2026-06-01 12:51:21.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:51:22.787 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 12:51:22.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425150,ok=425150,error=0, records=41
[WARN ] 2026-06-01 12:51:22.817 [14465] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:51:36.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:51:36.684 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21268/300s
[INFO ] 2026-06-01 12:51:37.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 12:51:37.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425151,ok=425151,error=0, records=41
[WARN ] 2026-06-01 12:51:37.822 [14450] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:51:51.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:51:52.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 12:51:52.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425152,ok=425152,error=0, records=41
[WARN ] 2026-06-01 12:51:52.827 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:52:04.297 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17710/300s
[INFO ] 2026-06-01 12:52:04.299 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:52:04.489 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:52:04.489 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 12:52:04.489 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:52:04.489 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:52:04.489 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:52:04.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:52:04.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:52:04.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:52:04.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:52:04.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:52:04.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:52:06.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:52:06.185 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21271/300s
[INFO ] 2026-06-01 12:52:07.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 12:52:07.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425153,ok=425153,error=0, records=41
[WARN ] 2026-06-01 12:52:07.834 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:52:21.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:52:22.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 12:52:22.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425154,ok=425154,error=0, records=41
[WARN ] 2026-06-01 12:52:22.839 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:52:36.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:52:37.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 12:52:37.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425155,ok=425155,error=0, records=41
[WARN ] 2026-06-01 12:52:37.844 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:52:43.241 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21269/300s
[INFO ] 2026-06-01 12:52:45.243 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21269/300s
[INFO ] 2026-06-01 12:52:51.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:52:52.550 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21269/300s
[INFO ] 2026-06-01 12:52:52.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 12:52:52.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425156,ok=425156,error=0, records=41
[WARN ] 2026-06-01 12:52:52.849 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:53:06.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:53:07.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 12:53:07.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425157,ok=425157,error=0, records=41
[WARN ] 2026-06-01 12:53:07.856 [13756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:53:21.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:53:22.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 12:53:22.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425158,ok=425158,error=0, records=41
[WARN ] 2026-06-01 12:53:22.861 [14568] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:53:36.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 12:53:36.189 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 12:53:37.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 12:53:37.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425159,ok=425159,error=0, records=41
[WARN ] 2026-06-01 12:53:37.865 [14554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:53:51.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:53:51.189 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 12:53:52.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 12:53:52.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425160,ok=425160,error=0, records=41
[WARN ] 2026-06-01 12:53:52.870 [14554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:54:06.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=24.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:54:07.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 12:54:07.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425161,ok=425161,error=0, records=41
[WARN ] 2026-06-01 12:54:07.876 [14616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:54:21.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:54:22.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 12:54:22.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425162,ok=425162,error=0, records=41
[WARN ] 2026-06-01 12:54:22.882 [14554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:54:36.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:54:37.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 12:54:37.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425163,ok=425163,error=0, records=41
[WARN ] 2026-06-01 12:54:37.887 [14627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:54:51.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:54:52.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 12:54:52.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425164,ok=425164,error=0, records=41
[WARN ] 2026-06-01 12:54:52.893 [14647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:55:01.044 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21273/300s
[INFO ] 2026-06-01 12:55:03.896 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21264/300s
[INFO ] 2026-06-01 12:55:04.491 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860244},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:55:04.632 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:55:04.632 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 12:55:04.633 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:55:04.633 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:55:04.633 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:55:04.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:55:04.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:55:04.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:55:04.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:55:04.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:55:04.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:55:06.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:55:07.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-01 12:55:07.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425165,ok=425165,error=0, records=41
[WARN ] 2026-06-01 12:55:07.898 [14627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:55:21.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:55:22.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 12:55:22.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425166,ok=425166,error=0, records=41
[INFO ] 2026-06-01 12:55:22.880 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21260/300s
[WARN ] 2026-06-01 12:55:22.904 [14761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 12:55:32.409 [14659] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12341/stat), No such file or directory
[INFO ] 2026-06-01 12:55:36.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:55:37.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 12:55:37.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425167,ok=425167,error=0, records=41
[WARN ] 2026-06-01 12:55:37.910 [14699] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:55:41.475 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21273/300s
[WARN ] 2026-06-01 12:55:47.415 [14761] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12341/stat), No such file or directory
[WARN ] 2026-06-01 12:55:47.415 [14761] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12338/stat), No such file or directory
[INFO ] 2026-06-01 12:55:51.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:55:52.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 12:55:52.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425168,ok=425168,error=0, records=41
[WARN ] 2026-06-01 12:55:52.916 [14780] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:56:06.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:56:07.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 12:56:07.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425169,ok=425169,error=0, records=41
[WARN ] 2026-06-01 12:56:07.922 [14808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:56:15.930 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21260/300s
[WARN ] 2026-06-01 12:56:17.427 [14761] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12304/stat), No such file or directory
[INFO ] 2026-06-01 12:56:21.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:56:22.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 12:56:22.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425170,ok=425170,error=0, records=41
[WARN ] 2026-06-01 12:56:22.927 [14820] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 12:56:32.433 [14820] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12304/stat), No such file or directory
[INFO ] 2026-06-01 12:56:36.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:56:36.737 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21269/300s
[INFO ] 2026-06-01 12:56:37.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 12:56:37.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425171,ok=425171,error=0, records=41
[WARN ] 2026-06-01 12:56:37.933 [14825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 12:56:47.440 [14858] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12304/stat), No such file or directory
[INFO ] 2026-06-01 12:56:51.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:56:52.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 12:56:52.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425172,ok=425172,error=0, records=41
[WARN ] 2026-06-01 12:56:52.939 [14826] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:57:06.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:57:06.197 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21272/300s
[INFO ] 2026-06-01 12:57:07.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 12:57:07.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425173,ok=425173,error=0, records=41
[WARN ] 2026-06-01 12:57:07.944 [14870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:57:21.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:57:22.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 12:57:22.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425174,ok=425174,error=0, records=41
[WARN ] 2026-06-01 12:57:22.949 [14893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:57:36.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:57:37.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 12:57:37.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425175,ok=425175,error=0, records=41
[WARN ] 2026-06-01 12:57:37.954 [14892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:57:43.292 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21270/300s
[INFO ] 2026-06-01 12:57:45.293 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21270/300s
[INFO ] 2026-06-01 12:57:51.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:57:52.600 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21270/300s
[INFO ] 2026-06-01 12:57:52.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 12:57:52.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425176,ok=425176,error=0, records=41
[WARN ] 2026-06-01 12:57:52.959 [14843] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:58:04.633 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17711/300s
[INFO ] 2026-06-01 12:58:04.634 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860160},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 12:58:04.794 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 12:58:04.794 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 12:58:04.794 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 12:58:04.794 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 12:58:04.794 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 12:58:04.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:58:04.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 12:58:04.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:58:04.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 12:58:04.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:58:04.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 12:58:06.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:58:07.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-01 12:58:07.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425177,ok=425177,error=0, records=41
[WARN ] 2026-06-01 12:58:07.964 [14921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:58:21.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:58:22.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 12:58:22.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425178,ok=425178,error=0, records=41
[WARN ] 2026-06-01 12:58:22.969 [14935] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:58:36.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:58:37.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 12:58:37.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425179,ok=425179,error=0, records=41
[WARN ] 2026-06-01 12:58:37.974 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:58:51.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:58:52.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 12:58:52.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425180,ok=425180,error=0, records=41
[WARN ] 2026-06-01 12:58:52.979 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:59:06.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:59:07.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 12:59:07.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425181,ok=425181,error=0, records=41
[WARN ] 2026-06-01 12:59:07.984 [14892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:59:21.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:59:22.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 12:59:22.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425182,ok=425182,error=0, records=41
[WARN ] 2026-06-01 12:59:22.990 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:59:36.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:59:37.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 12:59:37.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425183,ok=425183,error=0, records=41
[WARN ] 2026-06-01 12:59:37.994 [14978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 12:59:51.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 12:59:52.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 12:59:52.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425184,ok=425184,error=0, records=41
[WARN ] 2026-06-01 12:59:52.999 [14992] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:00:01.047 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21274/300s
[INFO ] 2026-06-01 13:00:04.003 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21265/300s
[INFO ] 2026-06-01 13:00:06.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:00:07.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 13:00:07.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425185,ok=425185,error=0, records=41
[WARN ] 2026-06-01 13:00:08.005 [15033] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:00:21.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:00:23.003 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 13:00:23.003 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425186,ok=425186,error=0, records=41
[INFO ] 2026-06-01 13:00:23.003 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21261/300s
[WARN ] 2026-06-01 13:00:23.009 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:00:36.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:00:38.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 13:00:38.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425187,ok=425187,error=0, records=41
[WARN ] 2026-06-01 13:00:38.014 [15052] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:00:41.481 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21274/300s
[INFO ] 2026-06-01 13:00:51.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:00:53.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 13:00:53.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425188,ok=425188,error=0, records=41
[WARN ] 2026-06-01 13:00:53.020 [15033] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:01:04.796 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860084},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:01:04.959 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:01:04.959 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 13:01:04.959 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:01:04.959 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:01:04.959 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:01:04.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:01:04.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:01:04.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:01:04.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:01:04.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:01:04.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:01:06.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:01:08.021 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 13:01:08.021 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425189,ok=425189,error=0, records=41
[WARN ] 2026-06-01 13:01:08.024 [15033] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:01:16.110 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21261/300s
[INFO ] 2026-06-01 13:01:21.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:01:23.026 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 13:01:23.026 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425190,ok=425190,error=0, records=41
[WARN ] 2026-06-01 13:01:23.029 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:01:36.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:01:36.791 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21270/300s
[INFO ] 2026-06-01 13:01:38.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 13:01:38.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425191,ok=425191,error=0, records=41
[WARN ] 2026-06-01 13:01:38.035 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:01:51.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:01:53.037 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 13:01:53.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425192,ok=425192,error=0, records=41
[WARN ] 2026-06-01 13:01:53.039 [14950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:02:06.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:02:06.210 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21273/300s
[INFO ] 2026-06-01 13:02:08.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 13:02:08.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425193,ok=425193,error=0, records=41
[WARN ] 2026-06-01 13:02:08.045 [15285] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:02:21.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:02:23.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 13:02:23.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425194,ok=425194,error=0, records=41
[WARN ] 2026-06-01 13:02:23.050 [15306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:02:36.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:02:37.554 [15306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:02:38.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 13:02:38.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425195,ok=425195,error=0, records=41
[INFO ] 2026-06-01 13:02:43.338 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21271/300s
[INFO ] 2026-06-01 13:02:45.342 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21271/300s
[INFO ] 2026-06-01 13:02:51.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:02:52.558 [15285] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:02:52.645 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21271/300s
[INFO ] 2026-06-01 13:02:53.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 13:02:53.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425196,ok=425196,error=0, records=41
[INFO ] 2026-06-01 13:03:06.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:03:07.564 [15341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:03:08.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10400, records=41
[INFO ] 2026-06-01 13:03:08.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425197,ok=425197,error=0, records=41
[INFO ] 2026-06-01 13:03:21.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:03:22.569 [15354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:03:23.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10384, records=41
[INFO ] 2026-06-01 13:03:23.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425198,ok=425198,error=0, records=41
[INFO ] 2026-06-01 13:03:36.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 13:03:36.213 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 13:03:37.575 [15386] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:03:38.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10384, records=41
[INFO ] 2026-06-01 13:03:38.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425199,ok=425199,error=0, records=41
[INFO ] 2026-06-01 13:03:51.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:03:52.581 [15402] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:03:53.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 13:03:53.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425200,ok=425200,error=0, records=41
[INFO ] 2026-06-01 13:04:04.960 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17712/300s
[INFO ] 2026-06-01 13:04:04.961 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859984},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:04:05.121 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:04:05.121 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 13:04:05.121 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:04:05.121 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:04:05.121 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:04:05.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:04:05.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:04:05.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:04:05.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:04:05.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:04:05.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:04:06.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.73%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:04:07.589 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:04:08.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 13:04:08.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425201,ok=425201,error=0, records=41
[INFO ] 2026-06-01 13:04:21.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:04:22.593 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:04:23.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11945, records=51
[INFO ] 2026-06-01 13:04:23.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425202,ok=425202,error=0, records=51
[INFO ] 2026-06-01 13:04:36.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:04:37.599 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:04:38.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 13:04:38.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425203,ok=425203,error=0, records=41
[INFO ] 2026-06-01 13:04:51.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:04:52.604 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:04:53.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 13:04:53.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425204,ok=425204,error=0, records=41
[INFO ] 2026-06-01 13:05:01.050 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21275/300s
[INFO ] 2026-06-01 13:05:04.107 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21266/300s
[INFO ] 2026-06-01 13:05:06.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:05:07.609 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:05:08.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 13:05:08.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425205,ok=425205,error=0, records=41
[INFO ] 2026-06-01 13:05:21.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:05:22.615 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:05:23.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 13:05:23.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425206,ok=425206,error=0, records=41
[INFO ] 2026-06-01 13:05:23.137 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21262/300s
[INFO ] 2026-06-01 13:05:36.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:05:37.620 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:05:38.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 13:05:38.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425207,ok=425207,error=0, records=41
[INFO ] 2026-06-01 13:05:41.487 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21275/300s
[INFO ] 2026-06-01 13:05:51.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:05:52.625 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:05:53.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 13:05:53.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425208,ok=425208,error=0, records=41
[INFO ] 2026-06-01 13:06:06.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:06:07.630 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:06:08.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 13:06:08.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425209,ok=425209,error=0, records=41
[INFO ] 2026-06-01 13:06:16.291 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21262/300s
[INFO ] 2026-06-01 13:06:21.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:06:22.636 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:06:23.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 13:06:23.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425210,ok=425210,error=0, records=41
[INFO ] 2026-06-01 13:06:36.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:06:36.848 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21271/300s
[WARN ] 2026-06-01 13:06:37.641 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:06:38.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 13:06:38.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425211,ok=425211,error=0, records=41
[INFO ] 2026-06-01 13:06:51.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:06:52.646 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:06:53.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 13:06:53.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425212,ok=425212,error=0, records=41
[INFO ] 2026-06-01 13:07:05.127 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859916},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:07:05.305 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:07:05.305 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 13:07:05.305 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:07:05.305 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:07:05.305 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:07:05.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:07:05.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:07:05.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:07:05.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:07:05.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:07:05.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:07:06.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.80%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:07:06.223 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21274/300s
[WARN ] 2026-06-01 13:07:07.652 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:07:08.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 13:07:08.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425213,ok=425213,error=0, records=41
[INFO ] 2026-06-01 13:07:21.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:07:22.657 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:07:23.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 13:07:23.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425214,ok=425214,error=0, records=41
[INFO ] 2026-06-01 13:07:36.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:07:37.663 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:07:38.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 13:07:38.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425215,ok=425215,error=0, records=41
[INFO ] 2026-06-01 13:07:43.422 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21272/300s
[INFO ] 2026-06-01 13:07:45.424 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21272/300s
[INFO ] 2026-06-01 13:07:51.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:07:52.666 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:07:52.730 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21272/300s
[INFO ] 2026-06-01 13:07:53.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 13:07:53.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425216,ok=425216,error=0, records=41
[INFO ] 2026-06-01 13:08:06.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:08:07.673 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:08:08.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 13:08:08.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425217,ok=425217,error=0, records=41
[INFO ] 2026-06-01 13:08:21.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:08:22.677 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:08:23.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 13:08:23.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425218,ok=425218,error=0, records=41
[INFO ] 2026-06-01 13:08:36.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:08:37.683 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:08:38.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 13:08:38.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425219,ok=425219,error=0, records=41
[INFO ] 2026-06-01 13:08:51.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:08:51.227 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 13:08:52.688 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:08:53.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 13:08:53.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425220,ok=425220,error=0, records=41
[INFO ] 2026-06-01 13:09:06.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:09:07.694 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:09:08.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 13:09:08.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425221,ok=425221,error=0, records=41
[INFO ] 2026-06-01 13:09:21.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:09:22.699 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:09:23.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 13:09:23.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425222,ok=425222,error=0, records=41
[INFO ] 2026-06-01 13:09:36.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:09:37.705 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:09:38.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-01 13:09:38.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425223,ok=425223,error=0, records=41
[INFO ] 2026-06-01 13:09:51.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:09:52.709 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:09:53.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 13:09:53.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425224,ok=425224,error=0, records=41
[INFO ] 2026-06-01 13:10:01.054 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21276/300s
[INFO ] 2026-06-01 13:10:04.213 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21267/300s
[INFO ] 2026-06-01 13:10:05.305 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17713/300s
[INFO ] 2026-06-01 13:10:05.307 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:10:05.463 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:10:05.463 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 13:10:05.463 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:10:05.463 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:10:05.463 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:10:05.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:10:05.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:10:05.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:10:05.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:10:05.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:10:05.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:10:06.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:10:07.716 [15403] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:10:08.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 13:10:08.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425225,ok=425225,error=0, records=41
[INFO ] 2026-06-01 13:10:21.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:10:22.721 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:10:23.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 13:10:23.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425226,ok=425226,error=0, records=41
[INFO ] 2026-06-01 13:10:23.328 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21263/300s
[INFO ] 2026-06-01 13:10:36.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:10:37.726 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:10:38.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 13:10:38.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425227,ok=425227,error=0, records=41
[INFO ] 2026-06-01 13:10:41.494 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21276/300s
[INFO ] 2026-06-01 13:10:51.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=11[>=300 0/4]
[WARN ] 2026-06-01 13:10:52.732 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:10:53.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 13:10:53.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425228,ok=425228,error=0, records=41
[INFO ] 2026-06-01 13:11:06.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:11:07.737 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:11:08.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 13:11:08.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425229,ok=425229,error=0, records=41
[INFO ] 2026-06-01 13:11:16.472 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21263/300s
[INFO ] 2026-06-01 13:11:21.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:11:22.742 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:11:23.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 13:11:23.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425230,ok=425230,error=0, records=41
[INFO ] 2026-06-01 13:11:36.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:11:36.902 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21272/300s
[WARN ] 2026-06-01 13:11:37.749 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:11:38.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 13:11:38.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425231,ok=425231,error=0, records=41
[INFO ] 2026-06-01 13:11:51.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:11:52.754 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:11:53.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 13:11:53.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425232,ok=425232,error=0, records=41
[INFO ] 2026-06-01 13:12:06.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:12:06.236 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21275/300s
[WARN ] 2026-06-01 13:12:07.759 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:12:08.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 13:12:08.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425233,ok=425233,error=0, records=41
[INFO ] 2026-06-01 13:12:21.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:12:22.763 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:12:23.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 13:12:23.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425234,ok=425234,error=0, records=41
[INFO ] 2026-06-01 13:12:36.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:12:37.770 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:12:38.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 13:12:38.388 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425235,ok=425235,error=0, records=41
[INFO ] 2026-06-01 13:12:43.474 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21273/300s
[INFO ] 2026-06-01 13:12:45.476 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21273/300s
[INFO ] 2026-06-01 13:12:51.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:12:52.775 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:12:52.782 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21273/300s
[INFO ] 2026-06-01 13:12:53.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 13:12:53.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425236,ok=425236,error=0, records=41
[INFO ] 2026-06-01 13:13:05.465 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859784},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:13:05.620 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:13:05.620 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 13:13:05.620 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:13:05.620 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:13:05.620 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:13:05.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:13:05.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:13:05.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:13:05.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:13:05.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:13:05.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:13:06.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:13:07.780 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:13:08.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 13:13:08.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425237,ok=425237,error=0, records=41
[INFO ] 2026-06-01 13:13:21.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:13:22.786 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:13:23.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 13:13:23.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425238,ok=425238,error=0, records=41
[INFO ] 2026-06-01 13:13:36.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 13:13:36.239 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 13:13:37.792 [15440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:13:38.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 13:13:38.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425239,ok=425239,error=0, records=41
[INFO ] 2026-06-01 13:13:51.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:13:52.797 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:13:53.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 13:13:53.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425240,ok=425240,error=0, records=41
[INFO ] 2026-06-01 13:14:06.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:14:07.801 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:14:08.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 13:14:08.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425241,ok=425241,error=0, records=41
[INFO ] 2026-06-01 13:14:21.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:14:22.806 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:14:23.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 13:14:23.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425242,ok=425242,error=0, records=41
[INFO ] 2026-06-01 13:14:36.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:14:37.812 [15985] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:14:38.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 13:14:38.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425243,ok=425243,error=0, records=41
[INFO ] 2026-06-01 13:14:51.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:14:52.817 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:14:53.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 13:14:53.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425244,ok=425244,error=0, records=41
[INFO ] 2026-06-01 13:15:01.057 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21277/300s
[INFO ] 2026-06-01 13:15:04.320 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21268/300s
[INFO ] 2026-06-01 13:15:06.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:15:07.821 [15426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:15:08.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 13:15:08.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425245,ok=425245,error=0, records=41
[INFO ] 2026-06-01 13:15:21.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:15:22.828 [15990] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:15:23.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 13:15:23.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425246,ok=425246,error=0, records=41
[INFO ] 2026-06-01 13:15:23.447 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21264/300s
[INFO ] 2026-06-01 13:15:36.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:15:37.833 [15406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:15:38.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 13:15:38.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425247,ok=425247,error=0, records=41
[INFO ] 2026-06-01 13:15:41.500 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21277/300s
[INFO ] 2026-06-01 13:15:51.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:15:52.838 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:15:53.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 13:15:53.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425248,ok=425248,error=0, records=41
[INFO ] 2026-06-01 13:16:05.620 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17714/300s
[INFO ] 2026-06-01 13:16:05.622 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859712},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:16:05.805 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:16:05.805 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 13:16:05.805 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:16:05.805 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:16:05.805 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:16:05.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:16:05.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:16:05.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:16:05.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:16:05.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:16:05.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:16:06.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:16:07.843 [16059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:16:08.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 13:16:08.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425249,ok=425249,error=0, records=41
[INFO ] 2026-06-01 13:16:16.657 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21264/300s
[INFO ] 2026-06-01 13:16:21.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:16:22.849 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:16:23.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 13:16:23.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425250,ok=425250,error=0, records=41
[INFO ] 2026-06-01 13:16:36.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:16:36.962 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21273/300s
[WARN ] 2026-06-01 13:16:37.854 [16099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:16:38.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 13:16:38.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425251,ok=425251,error=0, records=41
[INFO ] 2026-06-01 13:16:51.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:16:52.859 [15418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:16:53.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 13:16:53.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425252,ok=425252,error=0, records=41
[INFO ] 2026-06-01 13:17:06.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:17:06.248 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21276/300s
[WARN ] 2026-06-01 13:17:07.865 [16099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:17:08.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 13:17:08.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425253,ok=425253,error=0, records=41
[INFO ] 2026-06-01 13:17:21.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:17:22.870 [16141] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:17:23.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 13:17:23.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425254,ok=425254,error=0, records=41
[INFO ] 2026-06-01 13:17:36.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:17:37.876 [16141] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:17:38.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 13:17:38.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425255,ok=425255,error=0, records=41
[INFO ] 2026-06-01 13:17:43.546 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21274/300s
[INFO ] 2026-06-01 13:17:45.548 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21274/300s
[INFO ] 2026-06-01 13:17:51.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:17:52.855 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21274/300s
[WARN ] 2026-06-01 13:17:52.881 [16177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:17:53.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 13:17:53.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425256,ok=425256,error=0, records=41
[INFO ] 2026-06-01 13:18:06.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:18:07.886 [16171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:18:08.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 13:18:08.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425257,ok=425257,error=0, records=41
[INFO ] 2026-06-01 13:18:21.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:18:22.891 [16210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:18:23.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 13:18:23.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425258,ok=425258,error=0, records=41
[INFO ] 2026-06-01 13:18:36.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:18:37.897 [16171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:18:38.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 13:18:38.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425259,ok=425259,error=0, records=41
[INFO ] 2026-06-01 13:18:51.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:18:52.902 [16237] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:18:53.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 13:18:53.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425260,ok=425260,error=0, records=41
[INFO ] 2026-06-01 13:19:05.807 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859632},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:19:05.985 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:19:05.985 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 13:19:05.985 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:19:05.985 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:19:05.985 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:19:06.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:19:06.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:19:06.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:19:06.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:19:06.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:19:06.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:19:06.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:19:07.907 [16242] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:19:08.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 13:19:08.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425261,ok=425261,error=0, records=41
[INFO ] 2026-06-01 13:19:21.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:19:22.913 [16274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:19:23.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 13:19:23.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425262,ok=425262,error=0, records=41
[INFO ] 2026-06-01 13:19:36.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:19:37.919 [16274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:19:38.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 13:19:38.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425263,ok=425263,error=0, records=41
[INFO ] 2026-06-01 13:19:51.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:19:52.925 [16205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:19:53.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 13:19:53.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425264,ok=425264,error=0, records=41
[INFO ] 2026-06-01 13:20:01.061 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21278/300s
[INFO ] 2026-06-01 13:20:04.429 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21269/300s
[INFO ] 2026-06-01 13:20:06.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:20:07.932 [16312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:20:08.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 13:20:08.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425265,ok=425265,error=0, records=41
[INFO ] 2026-06-01 13:20:21.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:20:22.938 [16312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:20:23.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 13:20:23.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425266,ok=425266,error=0, records=41
[INFO ] 2026-06-01 13:20:23.663 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21265/300s
[INFO ] 2026-06-01 13:20:36.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:20:37.943 [16364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:20:38.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 13:20:38.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425267,ok=425267,error=0, records=41
[INFO ] 2026-06-01 13:20:41.507 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21278/300s
[INFO ] 2026-06-01 13:20:51.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:20:52.948 [16381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:20:53.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 13:20:53.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425268,ok=425268,error=0, records=41
[INFO ] 2026-06-01 13:21:06.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:21:07.954 [16391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:21:08.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 13:21:08.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425269,ok=425269,error=0, records=41
[INFO ] 2026-06-01 13:21:16.839 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21265/300s
[INFO ] 2026-06-01 13:21:21.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:21:22.959 [16296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:21:23.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 13:21:23.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425270,ok=425270,error=0, records=41
[INFO ] 2026-06-01 13:21:36.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:21:37.018 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21274/300s
[WARN ] 2026-06-01 13:21:37.964 [16296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:21:38.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 13:21:38.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425271,ok=425271,error=0, records=41
[INFO ] 2026-06-01 13:21:51.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:21:52.968 [16391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:21:53.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 13:21:53.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425272,ok=425272,error=0, records=41
[INFO ] 2026-06-01 13:22:05.986 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17715/300s
[INFO ] 2026-06-01 13:22:05.987 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859544},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:22:06.176 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:22:06.177 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 13:22:06.177 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:22:06.177 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:22:06.177 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:22:06.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:22:06.261 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21277/300s
[INFO ] 2026-06-01 13:22:06.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:22:06.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:22:06.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:22:06.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:22:06.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:22:06.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:22:07.973 [16296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:22:08.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 13:22:08.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425273,ok=425273,error=0, records=41
[INFO ] 2026-06-01 13:22:21.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:22:22.978 [16359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:22:23.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 13:22:23.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425274,ok=425274,error=0, records=41
[INFO ] 2026-06-01 13:22:36.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:22:37.982 [16478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:22:38.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 13:22:38.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425275,ok=425275,error=0, records=41
[INFO ] 2026-06-01 13:22:43.593 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21275/300s
[INFO ] 2026-06-01 13:22:45.595 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21275/300s
[INFO ] 2026-06-01 13:22:51.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:22:52.901 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21275/300s
[WARN ] 2026-06-01 13:22:52.988 [16464] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:22:53.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 13:22:53.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425276,ok=425276,error=0, records=41
[INFO ] 2026-06-01 13:23:06.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:23:07.993 [16506] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:23:08.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 13:23:08.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425277,ok=425277,error=0, records=41
[INFO ] 2026-06-01 13:23:21.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:23:22.999 [16296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:23:23.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 13:23:23.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425278,ok=425278,error=0, records=41
[INFO ] 2026-06-01 13:23:36.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 13:23:36.264 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 13:23:38.005 [16506] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:23:38.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 13:23:38.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425279,ok=425279,error=0, records=41
[INFO ] 2026-06-01 13:23:51.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:23:51.265 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 13:23:53.009 [16478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:23:53.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 13:23:53.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425280,ok=425280,error=0, records=41
[INFO ] 2026-06-01 13:24:06.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:24:08.014 [16560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:24:08.842 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 13:24:08.842 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425281,ok=425281,error=0, records=41
[INFO ] 2026-06-01 13:24:21.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:24:23.021 [16546] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:24:23.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 13:24:23.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425282,ok=425282,error=0, records=41
[INFO ] 2026-06-01 13:24:36.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:24:38.025 [16546] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:24:38.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 13:24:38.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425283,ok=425283,error=0, records=41
[INFO ] 2026-06-01 13:24:51.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:24:53.030 [16546] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:24:53.863 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 13:24:53.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425284,ok=425284,error=0, records=41
[INFO ] 2026-06-01 13:25:01.064 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21279/300s
[INFO ] 2026-06-01 13:25:04.534 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21270/300s
[INFO ] 2026-06-01 13:25:06.179 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859464},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:25:06.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.78MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 13:25:06.332 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:25:06.332 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 13:25:06.332 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:25:06.332 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:25:06.332 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:25:06.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:25:06.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:25:06.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:25:06.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:25:06.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:25:06.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:25:08.035 [16623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:25:08.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 13:25:08.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425285,ok=425285,error=0, records=41
[INFO ] 2026-06-01 13:25:21.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:25:23.040 [16635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:25:23.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 13:25:23.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425286,ok=425286,error=0, records=41
[INFO ] 2026-06-01 13:25:23.878 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21266/300s
[INFO ] 2026-06-01 13:25:36.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:25:38.044 [16635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:25:38.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 13:25:38.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425287,ok=425287,error=0, records=41
[INFO ] 2026-06-01 13:25:41.514 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21279/300s
[INFO ] 2026-06-01 13:25:51.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:25:53.050 [16672] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:25:53.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 13:25:53.888 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425288,ok=425288,error=0, records=41
[INFO ] 2026-06-01 13:26:06.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:26:07.555 [16684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:26:08.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 13:26:08.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425289,ok=425289,error=0, records=41
[INFO ] 2026-06-01 13:26:17.022 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21266/300s
[INFO ] 2026-06-01 13:26:21.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:26:22.560 [16702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:26:23.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 13:26:23.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425290,ok=425290,error=0, records=41
[INFO ] 2026-06-01 13:26:36.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:26:37.073 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21275/300s
[WARN ] 2026-06-01 13:26:37.565 [16701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:26:38.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 13:26:38.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425291,ok=425291,error=0, records=41
[INFO ] 2026-06-01 13:26:51.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:26:52.570 [16740] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:26:53.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 13:26:53.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425292,ok=425292,error=0, records=41
[INFO ] 2026-06-01 13:27:06.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:27:06.274 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21278/300s
[WARN ] 2026-06-01 13:27:07.576 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:27:08.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:27:08.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425293,ok=425293,error=0, records=41
[INFO ] 2026-06-01 13:27:21.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:27:22.582 [16780] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:27:23.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 13:27:23.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425294,ok=425294,error=0, records=41
[INFO ] 2026-06-01 13:27:36.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:27:37.587 [16787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:27:38.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 13:27:38.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425295,ok=425295,error=0, records=41
[INFO ] 2026-06-01 13:27:43.663 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21276/300s
[INFO ] 2026-06-01 13:27:45.665 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21276/300s
[INFO ] 2026-06-01 13:27:51.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:27:52.592 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:27:52.970 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21276/300s
[INFO ] 2026-06-01 13:27:53.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:27:53.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425296,ok=425296,error=0, records=41
[INFO ] 2026-06-01 13:28:06.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:28:06.332 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17716/300s
[INFO ] 2026-06-01 13:28:06.334 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859388},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:28:06.491 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:28:06.491 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 13:28:06.491 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:28:06.492 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:28:06.492 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:28:06.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:28:06.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:28:06.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:28:06.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:28:06.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:28:06.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:28:07.597 [16799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:28:08.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 13:28:08.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425297,ok=425297,error=0, records=41
[INFO ] 2026-06-01 13:28:21.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:28:22.602 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:28:23.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 13:28:23.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425298,ok=425298,error=0, records=41
[INFO ] 2026-06-01 13:28:36.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:28:37.606 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:28:38.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 13:28:38.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425299,ok=425299,error=0, records=41
[INFO ] 2026-06-01 13:28:51.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:28:52.610 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:28:54.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 13:28:54.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425300,ok=425300,error=0, records=41
[INFO ] 2026-06-01 13:29:06.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:29:07.615 [16799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:29:09.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 13:29:09.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425301,ok=425301,error=0, records=41
[INFO ] 2026-06-01 13:29:21.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:29:22.620 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:29:24.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 13:29:24.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425302,ok=425302,error=0, records=41
[INFO ] 2026-06-01 13:29:36.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:29:37.625 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:29:39.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 13:29:39.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425303,ok=425303,error=0, records=41
[INFO ] 2026-06-01 13:29:51.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:29:52.631 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:29:54.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 13:29:54.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425304,ok=425304,error=0, records=41
[INFO ] 2026-06-01 13:30:01.067 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21280/300s
[INFO ] 2026-06-01 13:30:04.635 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21271/300s
[INFO ] 2026-06-01 13:30:06.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:30:07.637 [16799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:30:09.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 13:30:09.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425305,ok=425305,error=0, records=41
[INFO ] 2026-06-01 13:30:21.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:30:22.643 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:30:24.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 13:30:24.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425306,ok=425306,error=0, records=41
[INFO ] 2026-06-01 13:30:24.041 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21267/300s
[INFO ] 2026-06-01 13:30:36.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:30:37.648 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:30:39.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 13:30:39.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425307,ok=425307,error=0, records=41
[INFO ] 2026-06-01 13:30:41.521 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21280/300s
[INFO ] 2026-06-01 13:30:51.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:30:52.654 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:30:54.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 13:30:54.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425308,ok=425308,error=0, records=41
[INFO ] 2026-06-01 13:31:06.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:31:06.493 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859304},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:31:06.631 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:31:06.631 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 13:31:06.631 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:31:06.631 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:31:06.631 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:31:06.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:31:06.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:31:06.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:31:06.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:31:06.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:31:06.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:31:07.660 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:31:09.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 13:31:09.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425309,ok=425309,error=0, records=41
[INFO ] 2026-06-01 13:31:17.198 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21267/300s
[INFO ] 2026-06-01 13:31:21.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:31:22.664 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:31:24.069 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 13:31:24.069 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425310,ok=425310,error=0, records=41
[INFO ] 2026-06-01 13:31:36.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:31:37.135 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21276/300s
[WARN ] 2026-06-01 13:31:37.669 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:31:39.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 13:31:39.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425311,ok=425311,error=0, records=41
[INFO ] 2026-06-01 13:31:51.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:31:52.673 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:31:54.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 13:31:54.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425312,ok=425312,error=0, records=41
[INFO ] 2026-06-01 13:32:06.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:32:06.287 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21279/300s
[WARN ] 2026-06-01 13:32:07.678 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:32:09.091 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 13:32:09.091 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425313,ok=425313,error=0, records=41
[INFO ] 2026-06-01 13:32:21.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:32:22.685 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:32:24.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 13:32:24.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425314,ok=425314,error=0, records=41
[INFO ] 2026-06-01 13:32:36.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:32:37.689 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:32:39.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 13:32:39.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425315,ok=425315,error=0, records=41
[INFO ] 2026-06-01 13:32:43.735 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21277/300s
[INFO ] 2026-06-01 13:32:45.737 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21277/300s
[INFO ] 2026-06-01 13:32:51.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:32:52.694 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:32:53.044 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21277/300s
[INFO ] 2026-06-01 13:32:54.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 13:32:54.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425316,ok=425316,error=0, records=41
[INFO ] 2026-06-01 13:33:06.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:33:07.701 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:33:09.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 13:33:09.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425317,ok=425317,error=0, records=41
[INFO ] 2026-06-01 13:33:21.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:33:22.705 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:33:24.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 13:33:24.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425318,ok=425318,error=0, records=41
[INFO ] 2026-06-01 13:33:36.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 13:33:36.292 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 13:33:37.710 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:33:39.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 13:33:39.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425319,ok=425319,error=0, records=41
[INFO ] 2026-06-01 13:33:51.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:33:52.715 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:33:54.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 13:33:54.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425320,ok=425320,error=0, records=41
[INFO ] 2026-06-01 13:34:06.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:34:06.631 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17717/300s
[INFO ] 2026-06-01 13:34:06.633 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859224},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:34:06.788 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:34:06.789 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 13:34:06.789 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:34:06.789 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:34:06.789 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:34:06.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:34:06.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:34:06.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:34:06.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:34:06.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:34:06.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:34:07.719 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:34:09.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 13:34:09.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425321,ok=425321,error=0, records=41
[INFO ] 2026-06-01 13:34:21.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:34:22.724 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:34:24.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 13:34:24.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425322,ok=425322,error=0, records=41
[INFO ] 2026-06-01 13:34:36.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:34:37.729 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:34:39.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 13:34:39.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425323,ok=425323,error=0, records=41
[INFO ] 2026-06-01 13:34:51.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:34:52.734 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:34:54.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 13:34:54.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425324,ok=425324,error=0, records=41
[INFO ] 2026-06-01 13:35:01.070 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21281/300s
[INFO ] 2026-06-01 13:35:04.739 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21272/300s
[INFO ] 2026-06-01 13:35:06.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:35:07.741 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:35:09.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 13:35:09.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425325,ok=425325,error=0, records=41
[INFO ] 2026-06-01 13:35:21.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:35:22.746 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:35:24.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 13:35:24.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425326,ok=425326,error=0, records=41
[INFO ] 2026-06-01 13:35:24.172 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21268/300s
[INFO ] 2026-06-01 13:35:36.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:35:37.750 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:35:39.179 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 13:35:39.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425327,ok=425327,error=0, records=41
[INFO ] 2026-06-01 13:35:41.528 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21281/300s
[INFO ] 2026-06-01 13:35:51.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:35:52.755 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:35:54.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 13:35:54.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425328,ok=425328,error=0, records=41
[INFO ] 2026-06-01 13:36:06.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:36:07.761 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:36:09.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 13:36:09.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425329,ok=425329,error=0, records=41
[INFO ] 2026-06-01 13:36:17.382 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21268/300s
[INFO ] 2026-06-01 13:36:21.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:36:22.766 [16799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:36:24.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 13:36:24.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425330,ok=425330,error=0, records=41
[INFO ] 2026-06-01 13:36:36.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:36:37.192 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21277/300s
[WARN ] 2026-06-01 13:36:37.770 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:36:39.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 13:36:39.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425331,ok=425331,error=0, records=41
[INFO ] 2026-06-01 13:36:51.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:36:52.775 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:36:54.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 13:36:54.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425332,ok=425332,error=0, records=41
[INFO ] 2026-06-01 13:37:06.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:37:06.301 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21280/300s
[INFO ] 2026-06-01 13:37:06.790 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859148},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:37:06.969 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:37:06.969 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 13:37:06.969 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:37:06.969 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:37:06.969 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:37:07.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:37:07.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:37:07.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:37:07.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:37:07.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:37:07.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:37:07.780 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:37:09.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 13:37:09.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425333,ok=425333,error=0, records=41
[INFO ] 2026-06-01 13:37:21.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:37:22.786 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:37:24.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 13:37:24.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425334,ok=425334,error=0, records=41
[INFO ] 2026-06-01 13:37:36.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:37:37.791 [16811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:37:39.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 13:37:39.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425335,ok=425335,error=0, records=41
[INFO ] 2026-06-01 13:37:43.796 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21278/300s
[INFO ] 2026-06-01 13:37:45.798 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21278/300s
[INFO ] 2026-06-01 13:37:51.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:37:52.795 [16799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:37:53.105 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21278/300s
[INFO ] 2026-06-01 13:37:54.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 13:37:54.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425336,ok=425336,error=0, records=41
[INFO ] 2026-06-01 13:38:06.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:38:07.801 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:38:09.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 13:38:09.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425337,ok=425337,error=0, records=41
[INFO ] 2026-06-01 13:38:21.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:38:22.807 [17371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:38:24.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 13:38:24.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425338,ok=425338,error=0, records=41
[INFO ] 2026-06-01 13:38:36.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:38:37.812 [16781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:38:39.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 13:38:39.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425339,ok=425339,error=0, records=41
[INFO ] 2026-06-01 13:38:51.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:38:51.305 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 13:38:52.817 [17376] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:38:54.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:38:54.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425340,ok=425340,error=0, records=41
[INFO ] 2026-06-01 13:39:06.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:39:07.822 [17371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:39:09.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 13:39:09.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425341,ok=425341,error=0, records=41
[INFO ] 2026-06-01 13:39:21.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:39:22.829 [17391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:39:24.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:39:24.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425342,ok=425342,error=0, records=41
[INFO ] 2026-06-01 13:39:36.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:39:37.834 [17420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:39:39.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 13:39:39.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425343,ok=425343,error=0, records=41
[INFO ] 2026-06-01 13:39:51.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:39:52.840 [17420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:39:54.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 13:39:54.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425344,ok=425344,error=0, records=41
[INFO ] 2026-06-01 13:40:01.074 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21282/300s
[INFO ] 2026-06-01 13:40:04.844 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21273/300s
[INFO ] 2026-06-01 13:40:06.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:40:06.969 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17718/300s
[INFO ] 2026-06-01 13:40:06.971 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859068},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:40:07.134 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:40:07.134 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 13:40:07.134 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:40:07.134 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:40:07.135 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:40:07.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:40:07.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:40:07.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:40:07.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:40:07.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:40:07.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:40:07.845 [16793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:40:09.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 13:40:09.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425345,ok=425345,error=0, records=41
[INFO ] 2026-06-01 13:40:21.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:40:22.851 [17475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:40:24.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 13:40:24.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425346,ok=425346,error=0, records=41
[INFO ] 2026-06-01 13:40:24.369 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21269/300s
[INFO ] 2026-06-01 13:40:36.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:40:37.856 [17420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:40:39.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 13:40:39.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425347,ok=425347,error=0, records=41
[INFO ] 2026-06-01 13:40:41.535 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21282/300s
[INFO ] 2026-06-01 13:40:51.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:40:52.861 [17475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:40:54.382 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:40:54.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425348,ok=425348,error=0, records=41
[INFO ] 2026-06-01 13:41:06.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:41:07.866 [17475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:41:09.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 13:41:09.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425349,ok=425349,error=0, records=41
[INFO ] 2026-06-01 13:41:17.571 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21269/300s
[INFO ] 2026-06-01 13:41:21.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:41:22.870 [17531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:41:24.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 13:41:24.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425350,ok=425350,error=0, records=41
[INFO ] 2026-06-01 13:41:36.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:41:37.251 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21278/300s
[WARN ] 2026-06-01 13:41:37.875 [17564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:41:39.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 13:41:39.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425351,ok=425351,error=0, records=41
[INFO ] 2026-06-01 13:41:51.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:41:52.880 [17559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:41:54.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 13:41:54.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425352,ok=425352,error=0, records=41
[INFO ] 2026-06-01 13:42:06.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:42:06.315 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21281/300s
[WARN ] 2026-06-01 13:42:07.885 [17475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:42:09.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 13:42:09.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425353,ok=425353,error=0, records=41
[INFO ] 2026-06-01 13:42:21.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:42:22.891 [17597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:42:24.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 13:42:24.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425354,ok=425354,error=0, records=41
[INFO ] 2026-06-01 13:42:36.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:42:37.895 [17631] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:42:39.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:42:39.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425355,ok=425355,error=0, records=41
[INFO ] 2026-06-01 13:42:43.867 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21279/300s
[INFO ] 2026-06-01 13:42:45.869 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21279/300s
[INFO ] 2026-06-01 13:42:51.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:42:52.900 [17643] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:42:53.174 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21279/300s
[INFO ] 2026-06-01 13:42:54.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 13:42:54.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425356,ok=425356,error=0, records=41
[INFO ] 2026-06-01 13:43:06.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:43:07.136 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858988},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:43:07.302 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:43:07.303 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 13:43:07.303 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:43:07.303 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:43:07.303 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:43:07.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:43:07.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:43:07.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:43:07.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:43:07.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:43:07.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:43:07.905 [17669] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:43:09.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 13:43:09.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425357,ok=425357,error=0, records=41
[INFO ] 2026-06-01 13:43:21.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:43:22.911 [17657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:43:24.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 13:43:24.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425358,ok=425358,error=0, records=41
[INFO ] 2026-06-01 13:43:36.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 13:43:36.319 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 13:43:37.917 [17697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:43:39.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 13:43:39.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425359,ok=425359,error=0, records=41
[INFO ] 2026-06-01 13:43:51.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:43:52.923 [17715] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:43:54.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-01 13:43:54.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425360,ok=425360,error=0, records=41
[INFO ] 2026-06-01 13:44:06.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:44:07.929 [17736] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:44:09.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 13:44:09.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425361,ok=425361,error=0, records=41
[INFO ] 2026-06-01 13:44:21.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:44:22.935 [17736] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:44:24.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 13:44:24.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425362,ok=425362,error=0, records=41
[INFO ] 2026-06-01 13:44:36.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:44:37.940 [17764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:44:39.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 13:44:39.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425363,ok=425363,error=0, records=41
[INFO ] 2026-06-01 13:44:51.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:44:52.946 [17781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:44:54.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 13:44:54.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425364,ok=425364,error=0, records=41
[INFO ] 2026-06-01 13:45:01.077 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21283/300s
[INFO ] 2026-06-01 13:45:04.950 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21274/300s
[INFO ] 2026-06-01 13:45:06.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:45:07.951 [17796] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:45:09.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 13:45:09.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425365,ok=425365,error=0, records=41
[INFO ] 2026-06-01 13:45:21.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:45:22.956 [17810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:45:24.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 13:45:24.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425366,ok=425366,error=0, records=41
[INFO ] 2026-06-01 13:45:24.573 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21270/300s
[INFO ] 2026-06-01 13:45:36.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:45:37.961 [17791] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:45:39.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 13:45:39.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425367,ok=425367,error=0, records=41
[INFO ] 2026-06-01 13:45:41.542 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21283/300s
[INFO ] 2026-06-01 13:45:51.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:45:52.966 [17781] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:45:54.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 13:45:54.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425368,ok=425368,error=0, records=41
[INFO ] 2026-06-01 13:46:06.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:46:07.303 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17719/300s
[INFO ] 2026-06-01 13:46:07.304 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858908},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:46:07.487 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:46:07.487 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 13:46:07.487 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:46:07.487 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:46:07.487 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:46:07.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:46:07.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:46:07.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:46:07.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:46:07.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:46:07.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:46:07.972 [17764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:46:09.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 13:46:09.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425369,ok=425369,error=0, records=41
[INFO ] 2026-06-01 13:46:17.757 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21270/300s
[INFO ] 2026-06-01 13:46:21.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:46:22.977 [17791] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:46:24.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 13:46:24.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425370,ok=425370,error=0, records=41
[INFO ] 2026-06-01 13:46:36.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:46:37.310 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21279/300s
[WARN ] 2026-06-01 13:46:37.982 [17867] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:46:39.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 13:46:39.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425371,ok=425371,error=0, records=41
[INFO ] 2026-06-01 13:46:51.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:46:52.988 [17895] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:46:54.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 13:46:54.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425372,ok=425372,error=0, records=41
[INFO ] 2026-06-01 13:47:06.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:47:06.330 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21282/300s
[WARN ] 2026-06-01 13:47:07.993 [17867] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:47:09.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 13:47:09.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425373,ok=425373,error=0, records=41
[INFO ] 2026-06-01 13:47:21.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:47:22.999 [17764] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:47:24.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 13:47:24.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425374,ok=425374,error=0, records=41
[INFO ] 2026-06-01 13:47:36.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:47:38.004 [17923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:47:39.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 13:47:39.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425375,ok=425375,error=0, records=41
[INFO ] 2026-06-01 13:47:43.938 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21280/300s
[INFO ] 2026-06-01 13:47:45.940 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21280/300s
[INFO ] 2026-06-01 13:47:51.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:47:53.009 [17909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:47:53.247 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21280/300s
[INFO ] 2026-06-01 13:47:54.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 13:47:54.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425376,ok=425376,error=0, records=41
[INFO ] 2026-06-01 13:48:06.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:48:08.015 [17923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:48:09.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 13:48:09.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425377,ok=425377,error=0, records=41
[INFO ] 2026-06-01 13:48:21.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:48:23.021 [17909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:48:24.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 13:48:24.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425378,ok=425378,error=0, records=41
[INFO ] 2026-06-01 13:48:36.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:48:38.026 [17867] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:48:39.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 13:48:39.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425379,ok=425379,error=0, records=41
[INFO ] 2026-06-01 13:48:51.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:48:53.032 [17993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:48:54.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 13:48:54.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425380,ok=425380,error=0, records=41
[INFO ] 2026-06-01 13:49:06.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:49:07.489 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858828},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:49:07.656 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:49:07.656 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 13:49:07.656 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:49:07.656 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:49:07.656 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:49:07.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:49:07.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:49:07.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:49:07.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:49:07.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:49:07.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 13:49:08.036 [17909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:49:09.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 13:49:09.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425381,ok=425381,error=0, records=41
[INFO ] 2026-06-01 13:49:21.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:49:23.042 [18028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:49:24.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 13:49:24.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425382,ok=425382,error=0, records=41
[INFO ] 2026-06-01 13:49:36.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:49:38.047 [18022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:49:39.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 13:49:39.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425383,ok=425383,error=0, records=41
[INFO ] 2026-06-01 13:49:51.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:49:53.053 [18022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:49:54.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 13:49:54.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425384,ok=425384,error=0, records=41
[INFO ] 2026-06-01 13:50:01.081 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21284/300s
[INFO ] 2026-06-01 13:50:05.057 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21275/300s
[INFO ] 2026-06-01 13:50:06.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:50:07.558 [18092] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:50:09.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 13:50:09.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425385,ok=425385,error=0, records=41
[INFO ] 2026-06-01 13:50:21.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:50:22.563 [18100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:50:24.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 13:50:24.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425386,ok=425386,error=0, records=41
[INFO ] 2026-06-01 13:50:24.859 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21271/300s
[INFO ] 2026-06-01 13:50:36.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:50:37.570 [18100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:50:39.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 13:50:39.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425387,ok=425387,error=0, records=41
[INFO ] 2026-06-01 13:50:41.548 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21284/300s
[INFO ] 2026-06-01 13:50:51.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:50:52.577 [18149] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:50:54.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 13:50:54.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425388,ok=425388,error=0, records=41
[INFO ] 2026-06-01 13:51:06.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:51:07.582 [18141] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:51:09.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 13:51:09.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425389,ok=425389,error=0, records=41
[INFO ] 2026-06-01 13:51:17.945 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21271/300s
[INFO ] 2026-06-01 13:51:21.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:51:22.586 [18129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:51:24.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 13:51:24.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425390,ok=425390,error=0, records=41
[INFO ] 2026-06-01 13:51:36.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:51:37.370 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21280/300s
[WARN ] 2026-06-01 13:51:37.591 [18182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:51:39.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 13:51:39.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425391,ok=425391,error=0, records=41
[INFO ] 2026-06-01 13:51:51.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:51:52.596 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:51:54.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 13:51:54.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425392,ok=425392,error=0, records=41
[INFO ] 2026-06-01 13:52:06.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:52:06.343 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21283/300s
[WARN ] 2026-06-01 13:52:07.602 [18177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:52:07.657 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17720/300s
[INFO ] 2026-06-01 13:52:07.658 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858752},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:52:07.824 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:52:07.825 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 13:52:07.825 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:52:07.825 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:52:07.825 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:52:07.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:52:07.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:52:07.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:52:07.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:52:07.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:52:07.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:52:09.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 13:52:09.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425393,ok=425393,error=0, records=41
[INFO ] 2026-06-01 13:52:21.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:52:22.607 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:52:24.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 13:52:24.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425394,ok=425394,error=0, records=41
[INFO ] 2026-06-01 13:52:36.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:52:37.613 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:52:39.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 13:52:39.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425395,ok=425395,error=0, records=41
[INFO ] 2026-06-01 13:52:44.024 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21281/300s
[INFO ] 2026-06-01 13:52:46.026 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21281/300s
[INFO ] 2026-06-01 13:52:51.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:52:52.618 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:52:53.332 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21281/300s
[INFO ] 2026-06-01 13:52:54.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 13:52:54.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425396,ok=425396,error=0, records=41
[INFO ] 2026-06-01 13:53:06.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:53:07.622 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:53:09.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 13:53:09.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425397,ok=425397,error=0, records=41
[INFO ] 2026-06-01 13:53:21.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:53:22.629 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:53:24.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 13:53:24.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425398,ok=425398,error=0, records=41
[INFO ] 2026-06-01 13:53:36.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 13:53:36.347 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 13:53:37.634 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:53:39.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 13:53:39.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425399,ok=425399,error=0, records=41
[INFO ] 2026-06-01 13:53:51.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:53:51.348 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 13:53:52.640 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:53:54.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 13:53:54.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425400,ok=425400,error=0, records=41
[INFO ] 2026-06-01 13:54:06.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:54:07.646 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:54:09.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 13:54:09.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425401,ok=425401,error=0, records=41
[INFO ] 2026-06-01 13:54:21.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:54:22.650 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:54:24.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 13:54:24.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425402,ok=425402,error=0, records=41
[INFO ] 2026-06-01 13:54:36.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:54:37.655 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:54:39.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 13:54:39.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425403,ok=425403,error=0, records=41
[INFO ] 2026-06-01 13:54:51.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:54:52.662 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:54:54.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 13:54:54.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425404,ok=425404,error=0, records=41
[INFO ] 2026-06-01 13:55:01.085 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21285/300s
[INFO ] 2026-06-01 13:55:05.166 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21276/300s
[INFO ] 2026-06-01 13:55:06.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:55:07.667 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:55:07.827 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858668},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:55:08.041 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:55:08.041 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 13:55:08.041 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:55:08.041 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:55:08.041 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:55:08.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:55:08.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:55:08.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:55:08.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:55:08.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:55:08.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:55:09.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 13:55:09.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425405,ok=425405,error=0, records=41
[INFO ] 2026-06-01 13:55:21.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:55:22.672 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:55:24.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 13:55:24.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425406,ok=425406,error=0, records=41
[INFO ] 2026-06-01 13:55:24.983 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21272/300s
[INFO ] 2026-06-01 13:55:36.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:55:37.679 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:55:39.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 13:55:39.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425407,ok=425407,error=0, records=41
[INFO ] 2026-06-01 13:55:41.556 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21285/300s
[INFO ] 2026-06-01 13:55:51.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:55:52.685 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:55:54.994 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 13:55:54.994 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425408,ok=425408,error=0, records=41
[INFO ] 2026-06-01 13:56:06.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:56:07.691 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:56:10.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 13:56:10.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425409,ok=425409,error=0, records=41
[INFO ] 2026-06-01 13:56:18.133 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21272/300s
[INFO ] 2026-06-01 13:56:21.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:56:22.696 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:56:25.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 13:56:25.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425410,ok=425410,error=0, records=41
[INFO ] 2026-06-01 13:56:36.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:56:37.427 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21281/300s
[WARN ] 2026-06-01 13:56:37.700 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:56:40.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 13:56:40.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425411,ok=425411,error=0, records=41
[INFO ] 2026-06-01 13:56:51.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:56:52.706 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:56:55.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 13:56:55.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425412,ok=425412,error=0, records=41
[INFO ] 2026-06-01 13:57:06.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 13:57:06.357 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21284/300s
[WARN ] 2026-06-01 13:57:07.712 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:57:10.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 13:57:10.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425413,ok=425413,error=0, records=41
[INFO ] 2026-06-01 13:57:21.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:57:22.717 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:57:25.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 13:57:25.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425414,ok=425414,error=0, records=41
[INFO ] 2026-06-01 13:57:36.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:57:37.722 [18177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:57:40.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 13:57:40.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425415,ok=425415,error=0, records=41
[INFO ] 2026-06-01 13:57:44.088 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21282/300s
[INFO ] 2026-06-01 13:57:46.090 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21282/300s
[INFO ] 2026-06-01 13:57:51.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:57:52.727 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:57:53.396 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21282/300s
[INFO ] 2026-06-01 13:57:55.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 13:57:55.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425416,ok=425416,error=0, records=41
[INFO ] 2026-06-01 13:58:06.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:58:07.732 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:58:08.041 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17721/300s
[INFO ] 2026-06-01 13:58:08.043 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858592},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 13:58:08.207 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 13:58:08.207 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 13:58:08.207 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 13:58:08.207 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 13:58:08.207 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 13:58:08.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:58:08.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 13:58:08.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:58:08.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 13:58:08.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:58:08.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 13:58:10.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 13:58:10.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425417,ok=425417,error=0, records=41
[INFO ] 2026-06-01 13:58:21.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:58:22.738 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:58:25.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 13:58:25.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425418,ok=425418,error=0, records=41
[INFO ] 2026-06-01 13:58:36.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:58:37.744 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:58:40.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 13:58:40.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425419,ok=425419,error=0, records=41
[INFO ] 2026-06-01 13:58:51.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:58:52.749 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:58:55.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 13:58:55.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425420,ok=425420,error=0, records=41
[INFO ] 2026-06-01 13:59:06.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:59:07.754 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:59:10.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 13:59:10.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425421,ok=425421,error=0, records=41
[INFO ] 2026-06-01 13:59:21.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:59:22.759 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:59:25.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 13:59:25.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425422,ok=425422,error=0, records=41
[INFO ] 2026-06-01 13:59:36.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:59:37.765 [18177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:59:40.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 13:59:40.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425423,ok=425423,error=0, records=41
[INFO ] 2026-06-01 13:59:51.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 13:59:52.770 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 13:59:55.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 13:59:55.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425424,ok=425424,error=0, records=41
[INFO ] 2026-06-01 14:00:01.089 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21286/300s
[INFO ] 2026-06-01 14:00:05.274 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21277/300s
[INFO ] 2026-06-01 14:00:06.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:00:07.775 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:00:10.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 14:00:10.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425425,ok=425425,error=0, records=41
[INFO ] 2026-06-01 14:00:21.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:00:22.780 [18177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:00:25.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 14:00:25.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425426,ok=425426,error=0, records=41
[INFO ] 2026-06-01 14:00:25.199 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21273/300s
[INFO ] 2026-06-01 14:00:36.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:00:37.785 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:00:40.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 14:00:40.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425427,ok=425427,error=0, records=41
[INFO ] 2026-06-01 14:00:41.562 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21286/300s
[INFO ] 2026-06-01 14:00:51.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:00:52.791 [18199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:00:55.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 14:00:55.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425428,ok=425428,error=0, records=41
[INFO ] 2026-06-01 14:01:06.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:01:07.796 [18177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:01:08.209 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858520},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:01:08.365 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:01:08.365 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 14:01:08.365 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:01:08.365 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:01:08.365 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:01:08.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:01:08.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:01:08.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:01:08.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:01:08.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:01:08.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:01:10.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 14:01:10.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425429,ok=425429,error=0, records=41
[INFO ] 2026-06-01 14:01:18.311 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21273/300s
[INFO ] 2026-06-01 14:01:21.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:01:22.801 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:01:25.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 14:01:25.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425430,ok=425430,error=0, records=41
[INFO ] 2026-06-01 14:01:36.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:01:37.482 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21282/300s
[WARN ] 2026-06-01 14:01:37.807 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:01:40.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 14:01:40.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425431,ok=425431,error=0, records=41
[INFO ] 2026-06-01 14:01:51.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:01:52.812 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:01:55.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 14:01:55.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425432,ok=425432,error=0, records=41
[INFO ] 2026-06-01 14:02:06.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:02:06.369 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21285/300s
[WARN ] 2026-06-01 14:02:07.817 [18205] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:02:10.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 14:02:10.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425433,ok=425433,error=0, records=41
[INFO ] 2026-06-01 14:02:21.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:02:22.822 [18775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:02:25.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 14:02:25.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425434,ok=425434,error=0, records=41
[INFO ] 2026-06-01 14:02:36.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:02:37.827 [18804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:02:40.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 14:02:40.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425435,ok=425435,error=0, records=41
[INFO ] 2026-06-01 14:02:44.153 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21283/300s
[INFO ] 2026-06-01 14:02:46.155 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21283/300s
[INFO ] 2026-06-01 14:02:51.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:02:52.833 [18790] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:02:53.461 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21283/300s
[INFO ] 2026-06-01 14:02:55.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 14:02:55.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425436,ok=425436,error=0, records=41
[INFO ] 2026-06-01 14:03:06.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:03:07.837 [18846] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:03:10.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 14:03:10.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425437,ok=425437,error=0, records=41
[INFO ] 2026-06-01 14:03:21.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:03:22.844 [18832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:03:25.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 14:03:25.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425438,ok=425438,error=0, records=41
[INFO ] 2026-06-01 14:03:36.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 14:03:36.373 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 14:03:37.849 [18832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:03:40.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 14:03:40.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425439,ok=425439,error=0, records=41
[INFO ] 2026-06-01 14:03:51.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:03:52.855 [18832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:03:55.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 14:03:55.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425440,ok=425440,error=0, records=41
[INFO ] 2026-06-01 14:04:06.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:04:07.861 [18884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:04:08.365 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17722/300s
[INFO ] 2026-06-01 14:04:08.367 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858436},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:04:08.524 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:04:08.524 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 14:04:08.524 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:04:08.525 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:04:08.525 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:04:08.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:04:08.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:04:08.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:04:08.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:04:08.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:04:08.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:04:10.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 14:04:10.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425441,ok=425441,error=0, records=41
[INFO ] 2026-06-01 14:04:21.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:04:22.866 [18846] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:04:25.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 14:04:25.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425442,ok=425442,error=0, records=41
[INFO ] 2026-06-01 14:04:36.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:04:37.870 [18846] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:04:40.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 14:04:40.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425443,ok=425443,error=0, records=41
[INFO ] 2026-06-01 14:04:51.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:04:52.875 [18832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:04:55.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 14:04:55.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425444,ok=425444,error=0, records=41
[INFO ] 2026-06-01 14:05:01.093 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21287/300s
[INFO ] 2026-06-01 14:05:05.379 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21278/300s
[INFO ] 2026-06-01 14:05:06.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:05:07.881 [18953] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:05:10.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 14:05:10.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425445,ok=425445,error=0, records=41
[INFO ] 2026-06-01 14:05:21.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:05:22.886 [18942] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:05:25.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 14:05:25.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425446,ok=425446,error=0, records=41
[INFO ] 2026-06-01 14:05:25.381 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21274/300s
[INFO ] 2026-06-01 14:05:36.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:05:37.890 [18991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:05:40.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 14:05:40.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425447,ok=425447,error=0, records=41
[INFO ] 2026-06-01 14:05:41.569 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21287/300s
[INFO ] 2026-06-01 14:05:51.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:05:52.896 [19012] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:05:55.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 14:05:55.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425448,ok=425448,error=0, records=41
[INFO ] 2026-06-01 14:06:06.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:06:07.902 [19026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:06:10.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 14:06:10.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425449,ok=425449,error=0, records=41
[INFO ] 2026-06-01 14:06:18.495 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21274/300s
[INFO ] 2026-06-01 14:06:21.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:06:22.908 [19026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:06:25.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 14:06:25.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425450,ok=425450,error=0, records=41
[INFO ] 2026-06-01 14:06:36.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:06:37.542 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21283/300s
[WARN ] 2026-06-01 14:06:37.914 [19026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:06:40.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 14:06:40.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425451,ok=425451,error=0, records=41
[INFO ] 2026-06-01 14:06:51.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:06:52.919 [19075] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:06:55.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 14:06:55.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425452,ok=425452,error=0, records=41
[INFO ] 2026-06-01 14:07:06.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:07:06.382 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21286/300s
[WARN ] 2026-06-01 14:07:07.925 [19075] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:07:08.526 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858364},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:07:08.700 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:07:08.700 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 14:07:08.700 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:07:08.700 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:07:08.700 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:07:08.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:07:08.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:07:08.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:07:08.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:07:08.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:07:08.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:07:10.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 14:07:10.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425453,ok=425453,error=0, records=41
[INFO ] 2026-06-01 14:07:21.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:07:22.931 [19026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:07:25.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 14:07:25.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425454,ok=425454,error=0, records=41
[INFO ] 2026-06-01 14:07:36.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:07:37.936 [19121] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:07:40.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 14:07:40.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425455,ok=425455,error=0, records=41
[INFO ] 2026-06-01 14:07:44.221 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21284/300s
[INFO ] 2026-06-01 14:07:46.223 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21284/300s
[INFO ] 2026-06-01 14:07:51.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:07:52.943 [19143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:07:53.530 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21284/300s
[INFO ] 2026-06-01 14:07:55.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 14:07:55.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425456,ok=425456,error=0, records=41
[INFO ] 2026-06-01 14:08:06.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:08:07.948 [19160] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:08:10.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-01 14:08:10.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425457,ok=425457,error=0, records=41
[INFO ] 2026-06-01 14:08:21.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:08:22.953 [19154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:08:25.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 14:08:25.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425458,ok=425458,error=0, records=41
[INFO ] 2026-06-01 14:08:36.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:08:37.960 [19143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:08:40.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 14:08:40.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425459,ok=425459,error=0, records=41
[INFO ] 2026-06-01 14:08:51.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:08:51.386 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 14:08:52.964 [19198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:08:55.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 14:08:55.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425460,ok=425460,error=0, records=41
[INFO ] 2026-06-01 14:09:06.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:09:07.969 [19154] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:09:10.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 14:09:10.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425461,ok=425461,error=0, records=41
[INFO ] 2026-06-01 14:09:21.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:09:22.973 [19226] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:09:25.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 14:09:25.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425462,ok=425462,error=0, records=41
[INFO ] 2026-06-01 14:09:36.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:09:37.979 [19212] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:09:40.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 14:09:40.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425463,ok=425463,error=0, records=41
[INFO ] 2026-06-01 14:09:51.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:09:52.984 [19137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:09:55.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 14:09:55.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425464,ok=425464,error=0, records=41
[INFO ] 2026-06-01 14:10:01.097 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21288/300s
[INFO ] 2026-06-01 14:10:05.488 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21279/300s
[INFO ] 2026-06-01 14:10:06.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:10:07.989 [19274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:10:08.700 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17723/300s
[INFO ] 2026-06-01 14:10:08.702 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:10:08.864 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:10:08.864 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 14:10:08.864 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:10:08.864 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:10:08.864 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:10:08.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:10:08.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:10:08.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:10:08.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:10:08.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:10:08.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:10:10.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 14:10:10.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425465,ok=425465,error=0, records=41
[INFO ] 2026-06-01 14:10:21.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:10:22.994 [19274] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:10:25.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 14:10:25.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425466,ok=425466,error=0, records=41
[INFO ] 2026-06-01 14:10:25.720 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21275/300s
[INFO ] 2026-06-01 14:10:36.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:10:38.000 [19289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:10:40.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 14:10:40.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425467,ok=425467,error=0, records=41
[INFO ] 2026-06-01 14:10:41.576 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21288/300s
[INFO ] 2026-06-01 14:10:51.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:10:53.006 [19137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:10:55.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 14:10:55.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425468,ok=425468,error=0, records=41
[INFO ] 2026-06-01 14:11:06.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:11:08.010 [19289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:11:10.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 14:11:10.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425469,ok=425469,error=0, records=41
[INFO ] 2026-06-01 14:11:18.679 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21275/300s
[INFO ] 2026-06-01 14:11:21.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:11:23.015 [19345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:11:25.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 14:11:25.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425470,ok=425470,error=0, records=41
[INFO ] 2026-06-01 14:11:36.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:11:37.595 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21284/300s
[WARN ] 2026-06-01 14:11:38.020 [19331] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:11:40.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 14:11:40.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425471,ok=425471,error=0, records=41
[INFO ] 2026-06-01 14:11:51.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:11:53.026 [19331] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:11:55.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 14:11:55.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425472,ok=425472,error=0, records=41
[INFO ] 2026-06-01 14:12:06.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:12:06.395 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21287/300s
[WARN ] 2026-06-01 14:12:08.032 [19254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:12:10.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 14:12:10.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425473,ok=425473,error=0, records=41
[INFO ] 2026-06-01 14:12:21.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:12:23.039 [19402] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:12:25.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 14:12:25.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425474,ok=425474,error=0, records=41
[INFO ] 2026-06-01 14:12:36.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:12:38.045 [19560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:12:40.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 14:12:40.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425475,ok=425475,error=0, records=41
[INFO ] 2026-06-01 14:12:44.256 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21285/300s
[INFO ] 2026-06-01 14:12:46.255 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21285/300s
[INFO ] 2026-06-01 14:12:51.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:12:53.050 [19331] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:12:53.562 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21285/300s
[INFO ] 2026-06-01 14:12:55.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 14:12:55.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425476,ok=425476,error=0, records=41
[INFO ] 2026-06-01 14:13:06.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:13:07.555 [19583] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:13:08.865 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858160},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:13:09.022 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:13:09.022 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 14:13:09.023 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:13:09.023 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:13:09.023 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:13:09.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:13:09.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:13:09.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:13:09.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:13:09.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:13:09.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:13:10.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 14:13:10.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425477,ok=425477,error=0, records=41
[INFO ] 2026-06-01 14:13:21.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:13:22.560 [19589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:13:25.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 14:13:25.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425478,ok=425478,error=0, records=41
[WARN ] 2026-06-01 14:13:32.563 [19613] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15241/stat), No such file or directory
[WARN ] 2026-06-01 14:13:32.564 [19613] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15242/stat), No such file or directory
[INFO ] 2026-06-01 14:13:36.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 14:13:36.399 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 14:13:37.564 [19626] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:13:40.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 14:13:40.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425479,ok=425479,error=0, records=41
[WARN ] 2026-06-01 14:13:47.568 [19628] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15241/stat), No such file or directory
[WARN ] 2026-06-01 14:13:47.568 [19628] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/15242/stat), No such file or directory
[INFO ] 2026-06-01 14:13:51.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:13:52.576 [19628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:13:55.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 14:13:55.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425480,ok=425480,error=0, records=41
[INFO ] 2026-06-01 14:14:06.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:14:07.593 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:14:10.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 14:14:10.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425481,ok=425481,error=0, records=41
[INFO ] 2026-06-01 14:14:21.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:14:22.604 [19613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:14:25.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 14:14:25.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425482,ok=425482,error=0, records=41
[INFO ] 2026-06-01 14:14:36.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:14:37.609 [19684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:14:40.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:14:40.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425483,ok=425483,error=0, records=41
[WARN ] 2026-06-01 14:14:47.614 [19613] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19488/stat), No such file or directory
[WARN ] 2026-06-01 14:14:47.619 [19613] cloudMonitor/base_collect.cpp:241: SicGetProcessState failed, err: FeadFileContent(/proc/19489/stat), No such file or directory
[INFO ] 2026-06-01 14:14:51.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:14:52.615 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:14:55.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 14:14:55.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425484,ok=425484,error=0, records=41
[INFO ] 2026-06-01 14:15:01.101 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21289/300s
[INFO ] 2026-06-01 14:15:05.619 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21280/300s
[INFO ] 2026-06-01 14:15:06.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:15:07.621 [19628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:15:10.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 14:15:10.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425485,ok=425485,error=0, records=41
[INFO ] 2026-06-01 14:15:21.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:15:22.640 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:15:25.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 14:15:25.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425486,ok=425486,error=0, records=41
[INFO ] 2026-06-01 14:15:25.829 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21276/300s
[WARN ] 2026-06-01 14:15:32.644 [19678] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19519/stat), No such file or directory
[INFO ] 2026-06-01 14:15:36.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:15:37.664 [19613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:15:40.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 14:15:40.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425487,ok=425487,error=0, records=41
[INFO ] 2026-06-01 14:15:41.586 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21289/300s
[WARN ] 2026-06-01 14:15:47.678 [19678] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19519/stat), No such file or directory
[WARN ] 2026-06-01 14:15:47.678 [19678] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19520/stat), No such file or directory
[WARN ] 2026-06-01 14:15:47.678 [19678] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19521/stat), No such file or directory
[WARN ] 2026-06-01 14:15:47.678 [19678] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19467/stat), No such file or directory
[INFO ] 2026-06-01 14:15:51.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:15:52.678 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:15:55.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 14:15:55.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425488,ok=425488,error=0, records=41
[INFO ] 2026-06-01 14:16:06.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:16:07.705 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:16:09.023 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17724/300s
[INFO ] 2026-06-01 14:16:09.024 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853456},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:16:09.188 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:16:09.188 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 14:16:09.188 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:16:09.188 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:16:09.188 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:16:09.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:16:09.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:16:09.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:16:09.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:16:09.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:16:09.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:16:10.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 14:16:10.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425489,ok=425489,error=0, records=41
[INFO ] 2026-06-01 14:16:18.962 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21276/300s
[INFO ] 2026-06-01 14:16:21.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:16:22.710 [19628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:16:25.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 14:16:25.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425490,ok=425490,error=0, records=41
[INFO ] 2026-06-01 14:16:36.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:16:37.693 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21285/300s
[WARN ] 2026-06-01 14:16:37.728 [19684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:16:40.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 14:16:40.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425491,ok=425491,error=0, records=41
[INFO ] 2026-06-01 14:16:51.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:16:52.741 [19684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:16:55.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 14:16:55.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425492,ok=425492,error=0, records=41
[INFO ] 2026-06-01 14:17:06.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:17:06.407 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21288/300s
[WARN ] 2026-06-01 14:17:07.748 [19684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:17:10.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 14:17:10.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425493,ok=425493,error=0, records=41
[WARN ] 2026-06-01 14:17:17.755 [19666] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19761/stat), No such file or directory
[INFO ] 2026-06-01 14:17:21.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:17:22.766 [19684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:17:25.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 14:17:25.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425494,ok=425494,error=0, records=41
[WARN ] 2026-06-01 14:17:32.779 [19684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19761/stat), No such file or directory
[INFO ] 2026-06-01 14:17:36.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:17:37.777 [19628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:17:40.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 14:17:40.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425495,ok=425495,error=0, records=41
[INFO ] 2026-06-01 14:17:44.292 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21286/300s
[INFO ] 2026-06-01 14:17:46.304 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21286/300s
[WARN ] 2026-06-01 14:17:47.808 [19684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19761/stat), No such file or directory
[WARN ] 2026-06-01 14:17:47.808 [19684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19815/stat), No such file or directory
[WARN ] 2026-06-01 14:17:47.808 [19684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19756/stat), No such file or directory
[WARN ] 2026-06-01 14:17:47.808 [19684] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19757/stat), No such file or directory
[INFO ] 2026-06-01 14:17:51.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:17:52.809 [19628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:17:53.608 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21286/300s
[INFO ] 2026-06-01 14:17:55.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 14:17:55.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425496,ok=425496,error=0, records=41
[INFO ] 2026-06-01 14:18:06.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:18:07.814 [20072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:18:10.994 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 14:18:10.994 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425497,ok=425497,error=0, records=41
[WARN ] 2026-06-01 14:18:17.818 [19666] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19911/stat), No such file or directory
[WARN ] 2026-06-01 14:18:17.819 [19666] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19918/stat), No such file or directory
[INFO ] 2026-06-01 14:18:21.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:18:22.819 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:18:26.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 14:18:26.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425498,ok=425498,error=0, records=41
[WARN ] 2026-06-01 14:18:32.323 [20062] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19911/stat), No such file or directory
[WARN ] 2026-06-01 14:18:32.324 [20062] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19918/stat), No such file or directory
[INFO ] 2026-06-01 14:18:36.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:18:37.825 [20157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:18:41.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 14:18:41.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425499,ok=425499,error=0, records=41
[WARN ] 2026-06-01 14:18:47.330 [19666] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19911/stat), No such file or directory
[WARN ] 2026-06-01 14:18:47.330 [19666] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19918/stat), No such file or directory
[INFO ] 2026-06-01 14:18:51.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:18:52.831 [20062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:18:56.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 14:18:56.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425500,ok=425500,error=0, records=41
[INFO ] 2026-06-01 14:19:06.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:19:07.836 [20057] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:19:09.190 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853332},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:19:09.489 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:19:09.489 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 14:19:09.489 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:19:09.490 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:19:09.490 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:19:09.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:19:09.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:19:09.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:19:09.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:19:09.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:19:09.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:19:11.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 14:19:11.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425501,ok=425501,error=0, records=41
[INFO ] 2026-06-01 14:19:21.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:19:22.842 [20057] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:19:26.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 14:19:26.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425502,ok=425502,error=0, records=41
[INFO ] 2026-06-01 14:19:36.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:19:37.846 [20211] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:19:41.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 14:19:41.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425503,ok=425503,error=0, records=41
[INFO ] 2026-06-01 14:19:51.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:19:52.853 [20226] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:19:56.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 14:19:56.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425504,ok=425504,error=0, records=41
[INFO ] 2026-06-01 14:20:01.109 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21290/300s
[INFO ] 2026-06-01 14:20:05.858 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21281/300s
[INFO ] 2026-06-01 14:20:06.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:20:07.859 [20057] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:20:11.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 14:20:11.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425505,ok=425505,error=0, records=41
[INFO ] 2026-06-01 14:20:21.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:20:22.864 [20057] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:20:26.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 14:20:26.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425506,ok=425506,error=0, records=41
[INFO ] 2026-06-01 14:20:26.130 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21277/300s
[INFO ] 2026-06-01 14:20:36.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:20:37.869 [20226] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:20:41.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 14:20:41.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425507,ok=425507,error=0, records=41
[INFO ] 2026-06-01 14:20:41.596 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21290/300s
[INFO ] 2026-06-01 14:20:51.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:20:52.879 [20259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:20:56.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 14:20:56.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425508,ok=425508,error=0, records=41
[INFO ] 2026-06-01 14:21:06.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:21:07.884 [20259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:21:11.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 14:21:11.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425509,ok=425509,error=0, records=41
[INFO ] 2026-06-01 14:21:19.203 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21277/300s
[INFO ] 2026-06-01 14:21:21.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:21:22.890 [20330] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:21:26.185 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 14:21:26.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425510,ok=425510,error=0, records=41
[INFO ] 2026-06-01 14:21:36.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:21:37.787 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21286/300s
[WARN ] 2026-06-01 14:21:37.895 [20335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:21:41.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 14:21:41.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425511,ok=425511,error=0, records=41
[INFO ] 2026-06-01 14:21:51.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:21:52.902 [20376] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:21:56.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 14:21:56.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425512,ok=425512,error=0, records=41
[INFO ] 2026-06-01 14:22:06.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:22:06.418 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21289/300s
[WARN ] 2026-06-01 14:22:07.907 [20376] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:22:09.490 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17725/300s
[INFO ] 2026-06-01 14:22:09.491 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853248},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:22:09.671 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:22:09.671 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 14:22:09.671 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:22:09.671 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:22:09.671 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:22:09.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:22:09.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:22:09.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:22:09.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:22:09.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:22:09.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:22:11.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 14:22:11.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425513,ok=425513,error=0, records=41
[INFO ] 2026-06-01 14:22:21.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:22:22.915 [20371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:22:26.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 14:22:26.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425514,ok=425514,error=0, records=41
[INFO ] 2026-06-01 14:22:36.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:22:37.921 [20371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:22:41.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 14:22:41.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425515,ok=425515,error=0, records=41
[INFO ] 2026-06-01 14:22:44.322 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21287/300s
[INFO ] 2026-06-01 14:22:46.325 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21287/300s
[INFO ] 2026-06-01 14:22:51.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:22:52.927 [20436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:22:53.631 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21287/300s
[INFO ] 2026-06-01 14:22:56.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 14:22:56.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425516,ok=425516,error=0, records=41
[INFO ] 2026-06-01 14:23:06.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:23:07.933 [20330] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:23:11.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 14:23:11.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425517,ok=425517,error=0, records=41
[INFO ] 2026-06-01 14:23:21.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:23:22.939 [20462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:23:26.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 14:23:26.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425518,ok=425518,error=0, records=41
[INFO ] 2026-06-01 14:23:36.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 14:23:36.422 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 14:23:37.945 [20489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:23:41.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 14:23:41.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425519,ok=425519,error=0, records=41
[INFO ] 2026-06-01 14:23:51.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:23:51.423 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 14:23:52.951 [20472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:23:56.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 14:23:56.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425520,ok=425520,error=0, records=41
[INFO ] 2026-06-01 14:24:06.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:24:07.957 [20472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:24:11.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 14:24:11.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425521,ok=425521,error=0, records=41
[INFO ] 2026-06-01 14:24:21.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:24:22.962 [20527] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:24:26.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 14:24:26.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425522,ok=425522,error=0, records=41
[INFO ] 2026-06-01 14:24:36.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:24:37.967 [20462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:24:41.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 14:24:41.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425523,ok=425523,error=0, records=41
[INFO ] 2026-06-01 14:24:51.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:24:52.972 [20472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:24:56.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 14:24:56.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425524,ok=425524,error=0, records=41
[INFO ] 2026-06-01 14:25:01.112 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21291/300s
[INFO ] 2026-06-01 14:25:05.975 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21282/300s
[INFO ] 2026-06-01 14:25:06.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:25:07.976 [20527] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:25:09.673 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853168},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:25:09.844 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:25:09.844 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 14:25:09.845 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:25:09.845 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:25:09.845 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:25:09.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:25:09.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:25:09.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:25:09.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:25:09.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:25:09.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:25:11.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 14:25:11.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425525,ok=425525,error=0, records=41
[INFO ] 2026-06-01 14:25:21.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:25:22.982 [20462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:25:26.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 14:25:26.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425526,ok=425526,error=0, records=41
[INFO ] 2026-06-01 14:25:26.289 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21278/300s
[INFO ] 2026-06-01 14:25:36.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:25:37.986 [20597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:25:41.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 14:25:41.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425527,ok=425527,error=0, records=41
[INFO ] 2026-06-01 14:25:41.604 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21291/300s
[INFO ] 2026-06-01 14:25:51.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:25:52.991 [20555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:25:56.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 14:25:56.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425528,ok=425528,error=0, records=41
[INFO ] 2026-06-01 14:26:06.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:26:07.996 [20597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:26:11.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 14:26:11.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425529,ok=425529,error=0, records=41
[INFO ] 2026-06-01 14:26:19.383 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21278/300s
[INFO ] 2026-06-01 14:26:21.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:26:23.002 [20625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:26:26.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 14:26:26.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425530,ok=425530,error=0, records=41
[INFO ] 2026-06-01 14:26:36.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:26:37.841 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21287/300s
[WARN ] 2026-06-01 14:26:38.007 [20625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:26:41.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 14:26:41.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425531,ok=425531,error=0, records=41
[INFO ] 2026-06-01 14:26:51.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:26:53.012 [20597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:26:56.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 14:26:56.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425532,ok=425532,error=0, records=41
[INFO ] 2026-06-01 14:27:06.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:27:06.432 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21290/300s
[WARN ] 2026-06-01 14:27:08.016 [20597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:27:11.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 14:27:11.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425533,ok=425533,error=0, records=41
[INFO ] 2026-06-01 14:27:21.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:27:23.022 [20555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:27:26.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 14:27:26.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425534,ok=425534,error=0, records=41
[INFO ] 2026-06-01 14:27:36.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:27:38.027 [20667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:27:41.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 14:27:41.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425535,ok=425535,error=0, records=41
[INFO ] 2026-06-01 14:27:44.385 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21288/300s
[INFO ] 2026-06-01 14:27:46.387 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21288/300s
[INFO ] 2026-06-01 14:27:51.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:27:53.032 [20710] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:27:53.694 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21288/300s
[INFO ] 2026-06-01 14:27:56.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 14:27:56.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425536,ok=425536,error=0, records=41
[INFO ] 2026-06-01 14:28:06.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:28:08.038 [20710] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:28:09.845 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17726/300s
[INFO ] 2026-06-01 14:28:09.846 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:28:10.033 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:28:10.033 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 14:28:10.034 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:28:10.034 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:28:10.034 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:28:10.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:28:10.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:28:10.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:28:10.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:28:10.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:28:10.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:28:11.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 14:28:11.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425537,ok=425537,error=0, records=41
[INFO ] 2026-06-01 14:28:21.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:28:23.044 [20738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:28:26.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 14:28:26.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425538,ok=425538,error=0, records=41
[INFO ] 2026-06-01 14:28:36.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:28:38.048 [20772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:28:41.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 14:28:41.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425539,ok=425539,error=0, records=41
[INFO ] 2026-06-01 14:28:51.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:28:53.054 [20771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:28:56.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 14:28:56.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425540,ok=425540,error=0, records=41
[INFO ] 2026-06-01 14:29:06.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:29:07.559 [20806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:29:11.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 14:29:11.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425541,ok=425541,error=0, records=41
[INFO ] 2026-06-01 14:29:21.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:29:22.564 [20809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:29:26.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 14:29:26.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425542,ok=425542,error=0, records=41
[INFO ] 2026-06-01 14:29:36.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:29:37.569 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:29:41.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:29:41.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425543,ok=425543,error=0, records=41
[INFO ] 2026-06-01 14:29:51.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:29:52.574 [20862] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:29:56.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 14:29:56.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425544,ok=425544,error=0, records=41
[INFO ] 2026-06-01 14:30:01.116 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21292/300s
[INFO ] 2026-06-01 14:30:06.080 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21283/300s
[INFO ] 2026-06-01 14:30:06.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:30:07.581 [20881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:30:11.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 14:30:11.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425545,ok=425545,error=0, records=41
[INFO ] 2026-06-01 14:30:21.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:30:22.587 [20896] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:30:26.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 14:30:26.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425546,ok=425546,error=0, records=41
[INFO ] 2026-06-01 14:30:26.479 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21279/300s
[INFO ] 2026-06-01 14:30:36.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:30:37.593 [20881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:30:41.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 14:30:41.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425547,ok=425547,error=0, records=41
[INFO ] 2026-06-01 14:30:41.610 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21292/300s
[INFO ] 2026-06-01 14:30:51.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:30:52.598 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:30:56.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 14:30:56.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425548,ok=425548,error=0, records=41
[INFO ] 2026-06-01 14:31:06.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:31:07.603 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:31:10.035 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:31:10.203 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:31:10.203 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 14:31:10.203 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:31:10.203 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:31:10.203 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:31:10.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:31:10.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:31:10.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:31:10.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:31:10.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:31:10.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:31:11.499 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10401, records=41
[INFO ] 2026-06-01 14:31:11.499 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425549,ok=425549,error=0, records=41
[INFO ] 2026-06-01 14:31:19.569 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21279/300s
[INFO ] 2026-06-01 14:31:21.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:31:22.609 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:31:26.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10390, records=41
[INFO ] 2026-06-01 14:31:26.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425550,ok=425550,error=0, records=41
[INFO ] 2026-06-01 14:31:36.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:31:37.615 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:31:37.898 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21288/300s
[INFO ] 2026-06-01 14:31:41.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 14:31:41.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425551,ok=425551,error=0, records=41
[INFO ] 2026-06-01 14:31:51.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:31:52.621 [20926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:31:56.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-01 14:31:56.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425552,ok=425552,error=0, records=41
[INFO ] 2026-06-01 14:32:06.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:32:06.445 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21291/300s
[WARN ] 2026-06-01 14:32:07.627 [20926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:32:11.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 14:32:11.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425553,ok=425553,error=0, records=41
[INFO ] 2026-06-01 14:32:21.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:32:22.632 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:32:26.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 14:32:26.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425554,ok=425554,error=0, records=41
[INFO ] 2026-06-01 14:32:36.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:32:37.637 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:32:41.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 14:32:41.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425555,ok=425555,error=0, records=41
[INFO ] 2026-06-01 14:32:44.478 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21289/300s
[INFO ] 2026-06-01 14:32:46.480 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21289/300s
[INFO ] 2026-06-01 14:32:51.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:32:52.643 [20902] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:32:53.786 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21289/300s
[INFO ] 2026-06-01 14:32:56.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 14:32:56.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425556,ok=425556,error=0, records=41
[INFO ] 2026-06-01 14:33:06.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:33:07.648 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:33:11.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 14:33:11.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425557,ok=425557,error=0, records=41
[INFO ] 2026-06-01 14:33:21.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:33:22.653 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:33:26.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 14:33:26.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425558,ok=425558,error=0, records=41
[INFO ] 2026-06-01 14:33:36.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 14:33:36.449 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 14:33:37.658 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:33:41.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 14:33:41.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425559,ok=425559,error=0, records=41
[INFO ] 2026-06-01 14:33:51.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:33:52.665 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:33:56.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 14:33:56.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425560,ok=425560,error=0, records=41
[INFO ] 2026-06-01 14:34:06.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:34:07.670 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:34:10.203 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17727/300s
[INFO ] 2026-06-01 14:34:10.206 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:34:10.373 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:34:10.373 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 14:34:10.374 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:34:10.374 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:34:10.374 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:34:10.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:34:10.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:34:10.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:34:10.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:34:10.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:34:10.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:34:11.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 14:34:11.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425561,ok=425561,error=0, records=41
[INFO ] 2026-06-01 14:34:21.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:34:22.675 [20926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:34:26.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 14:34:26.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425562,ok=425562,error=0, records=41
[INFO ] 2026-06-01 14:34:36.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:34:37.680 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:34:41.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 14:34:41.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425563,ok=425563,error=0, records=41
[INFO ] 2026-06-01 14:34:51.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:34:52.685 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:34:56.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 14:34:56.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425564,ok=425564,error=0, records=41
[INFO ] 2026-06-01 14:35:01.120 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21293/300s
[INFO ] 2026-06-01 14:35:06.190 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21284/300s
[INFO ] 2026-06-01 14:35:06.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:35:07.691 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:35:11.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 14:35:11.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425565,ok=425565,error=0, records=41
[INFO ] 2026-06-01 14:35:21.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:35:22.696 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:35:26.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 14:35:26.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425566,ok=425566,error=0, records=41
[INFO ] 2026-06-01 14:35:26.728 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21280/300s
[INFO ] 2026-06-01 14:35:36.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:35:37.701 [20902] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:35:41.618 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21293/300s
[INFO ] 2026-06-01 14:35:41.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 14:35:41.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425567,ok=425567,error=0, records=41
[INFO ] 2026-06-01 14:35:51.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:35:52.707 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:35:56.740 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 14:35:56.740 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425568,ok=425568,error=0, records=41
[INFO ] 2026-06-01 14:36:06.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:36:07.713 [20902] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:36:11.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 14:36:11.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425569,ok=425569,error=0, records=41
[INFO ] 2026-06-01 14:36:19.757 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21280/300s
[INFO ] 2026-06-01 14:36:21.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:36:22.720 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:36:26.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 14:36:26.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425570,ok=425570,error=0, records=41
[INFO ] 2026-06-01 14:36:36.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:36:37.725 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:36:37.962 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21289/300s
[INFO ] 2026-06-01 14:36:41.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 14:36:41.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425571,ok=425571,error=0, records=41
[INFO ] 2026-06-01 14:36:51.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:36:52.730 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:36:56.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 14:36:56.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425572,ok=425572,error=0, records=41
[INFO ] 2026-06-01 14:37:06.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:37:06.458 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21292/300s
[WARN ] 2026-06-01 14:37:07.735 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:37:10.376 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852856},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:37:10.517 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:37:10.517 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 14:37:10.517 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:37:10.518 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:37:10.518 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:37:10.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:37:10.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:37:10.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:37:10.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:37:10.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:37:10.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:37:11.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 14:37:11.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425573,ok=425573,error=0, records=41
[INFO ] 2026-06-01 14:37:21.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:37:22.741 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:37:26.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 14:37:26.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425574,ok=425574,error=0, records=41
[INFO ] 2026-06-01 14:37:36.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:37:37.746 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:37:41.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 14:37:41.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425575,ok=425575,error=0, records=41
[INFO ] 2026-06-01 14:37:44.571 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21290/300s
[INFO ] 2026-06-01 14:37:46.573 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21290/300s
[INFO ] 2026-06-01 14:37:51.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:37:52.751 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:37:53.879 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21290/300s
[INFO ] 2026-06-01 14:37:56.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 14:37:56.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425576,ok=425576,error=0, records=41
[INFO ] 2026-06-01 14:38:06.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:38:07.757 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:38:11.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 14:38:11.842 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425577,ok=425577,error=0, records=41
[INFO ] 2026-06-01 14:38:21.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:38:22.763 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:38:26.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 14:38:26.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425578,ok=425578,error=0, records=41
[INFO ] 2026-06-01 14:38:36.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:38:37.768 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:38:41.852 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 14:38:41.852 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425579,ok=425579,error=0, records=41
[INFO ] 2026-06-01 14:38:51.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:38:51.463 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 14:38:52.774 [20926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:38:56.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 14:38:56.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425580,ok=425580,error=0, records=41
[INFO ] 2026-06-01 14:39:06.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:39:07.779 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:39:11.863 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 14:39:11.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425581,ok=425581,error=0, records=41
[INFO ] 2026-06-01 14:39:21.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:39:22.784 [20926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:39:26.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 14:39:26.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425582,ok=425582,error=0, records=41
[INFO ] 2026-06-01 14:39:36.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:39:37.789 [20926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:39:41.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 14:39:41.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425583,ok=425583,error=0, records=41
[INFO ] 2026-06-01 14:39:51.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:39:52.794 [20844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:39:56.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 14:39:56.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425584,ok=425584,error=0, records=41
[INFO ] 2026-06-01 14:40:01.124 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21294/300s
[INFO ] 2026-06-01 14:40:06.299 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21285/300s
[INFO ] 2026-06-01 14:40:06.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:40:07.801 [20912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:40:10.518 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17728/300s
[INFO ] 2026-06-01 14:40:10.519 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852776},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:40:10.655 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:40:10.655 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 14:40:10.655 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:40:10.656 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:40:10.656 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:40:10.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:40:10.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:40:10.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:40:10.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:40:10.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:40:10.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:40:11.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-01 14:40:11.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425585,ok=425585,error=0, records=41
[INFO ] 2026-06-01 14:40:21.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:40:22.806 [20931] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:40:26.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-01 14:40:26.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425586,ok=425586,error=0, records=41
[INFO ] 2026-06-01 14:40:26.898 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21281/300s
[INFO ] 2026-06-01 14:40:36.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:40:37.813 [21456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:40:41.625 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21294/300s
[INFO ] 2026-06-01 14:40:41.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 14:40:41.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425587,ok=425587,error=0, records=41
[INFO ] 2026-06-01 14:40:51.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:40:52.818 [21441] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:40:56.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 14:40:56.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425588,ok=425588,error=0, records=41
[INFO ] 2026-06-01 14:41:06.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:41:07.823 [21466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:41:11.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 14:41:11.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425589,ok=425589,error=0, records=41
[INFO ] 2026-06-01 14:41:19.940 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21281/300s
[INFO ] 2026-06-01 14:41:21.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:41:22.828 [21485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:41:26.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 14:41:26.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425590,ok=425590,error=0, records=41
[INFO ] 2026-06-01 14:41:36.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:41:37.833 [21513] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:41:38.019 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21290/300s
[INFO ] 2026-06-01 14:41:41.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 14:41:41.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425591,ok=425591,error=0, records=41
[INFO ] 2026-06-01 14:41:51.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:41:52.837 [21456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:41:56.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 14:41:56.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425592,ok=425592,error=0, records=41
[INFO ] 2026-06-01 14:42:06.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:42:06.472 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21293/300s
[WARN ] 2026-06-01 14:42:07.843 [21456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:42:12.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 14:42:12.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425593,ok=425593,error=0, records=41
[INFO ] 2026-06-01 14:42:21.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:42:22.848 [21527] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:42:27.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 14:42:27.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425594,ok=425594,error=0, records=41
[INFO ] 2026-06-01 14:42:36.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:42:37.853 [21551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:42:42.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 14:42:42.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425595,ok=425595,error=0, records=41
[INFO ] 2026-06-01 14:42:44.641 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21291/300s
[INFO ] 2026-06-01 14:42:46.643 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21291/300s
[INFO ] 2026-06-01 14:42:51.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:42:52.857 [21499] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:42:53.950 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21291/300s
[INFO ] 2026-06-01 14:42:57.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 14:42:57.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425596,ok=425596,error=0, records=41
[INFO ] 2026-06-01 14:43:06.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:43:07.863 [21597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:43:10.657 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852704},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:43:10.826 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:43:10.826 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 14:43:10.826 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:43:10.826 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:43:10.826 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:43:10.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:43:10.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:43:10.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:43:10.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:43:10.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:43:10.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:43:12.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 14:43:12.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425597,ok=425597,error=0, records=41
[INFO ] 2026-06-01 14:43:21.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:43:22.867 [21597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:43:27.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 14:43:27.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425598,ok=425598,error=0, records=41
[INFO ] 2026-06-01 14:43:36.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 14:43:36.476 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 14:43:37.871 [21597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:43:42.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 14:43:42.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425599,ok=425599,error=0, records=41
[INFO ] 2026-06-01 14:43:51.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:43:52.880 [21597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:43:57.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 14:43:57.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425600,ok=425600,error=0, records=41
[INFO ] 2026-06-01 14:44:06.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:44:07.888 [21663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:44:12.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 14:44:12.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425601,ok=425601,error=0, records=41
[INFO ] 2026-06-01 14:44:21.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:44:22.898 [21652] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:44:27.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 14:44:27.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425602,ok=425602,error=0, records=41
[INFO ] 2026-06-01 14:44:36.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:44:37.902 [21693] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:44:42.179 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 14:44:42.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425603,ok=425603,error=0, records=41
[INFO ] 2026-06-01 14:44:51.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:44:52.906 [21717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:44:57.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 14:44:57.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425604,ok=425604,error=0, records=41
[INFO ] 2026-06-01 14:45:01.127 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21295/300s
[INFO ] 2026-06-01 14:45:06.412 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21286/300s
[INFO ] 2026-06-01 14:45:06.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:45:07.913 [21718] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:45:12.190 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 14:45:12.190 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425605,ok=425605,error=0, records=41
[INFO ] 2026-06-01 14:45:21.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:45:22.948 [21757] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:45:27.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 14:45:27.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425606,ok=425606,error=0, records=41
[INFO ] 2026-06-01 14:45:27.239 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21282/300s
[INFO ] 2026-06-01 14:45:36.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:45:38.057 [21740] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:45:41.634 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21295/300s
[INFO ] 2026-06-01 14:45:42.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 14:45:42.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425607,ok=425607,error=0, records=41
[WARN ] 2026-06-01 14:45:47.572 [21740] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19872/stat), No such file or directory
[WARN ] 2026-06-01 14:45:47.573 [21740] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19913/stat), No such file or directory
[INFO ] 2026-06-01 14:45:51.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:45:52.579 [21819] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:45:57.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 14:45:57.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425608,ok=425608,error=0, records=41
[INFO ] 2026-06-01 14:46:06.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:46:07.624 [21740] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:46:10.826 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17729/300s
[INFO ] 2026-06-01 14:46:10.828 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853040},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:46:10.984 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:46:10.984 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 14:46:10.984 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:46:10.984 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:46:10.984 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:46:11.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:46:11.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:46:11.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:46:11.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:46:11.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:46:11.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:46:12.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 14:46:12.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425609,ok=425609,error=0, records=41
[INFO ] 2026-06-01 14:46:20.235 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21282/300s
[INFO ] 2026-06-01 14:46:21.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:46:22.663 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:46:27.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:46:27.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425610,ok=425610,error=0, records=41
[WARN ] 2026-06-01 14:46:32.668 [21837] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19818/stat), No such file or directory
[INFO ] 2026-06-01 14:46:36.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:46:37.679 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:46:38.252 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21291/300s
[INFO ] 2026-06-01 14:46:42.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 14:46:42.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425611,ok=425611,error=0, records=41
[WARN ] 2026-06-01 14:46:47.691 [21825] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19779/stat), No such file or directory
[WARN ] 2026-06-01 14:46:47.693 [21825] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19818/stat), No such file or directory
[WARN ] 2026-06-01 14:46:47.693 [21825] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19824/stat), No such file or directory
[WARN ] 2026-06-01 14:46:47.693 [21825] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19877/stat), No such file or directory
[INFO ] 2026-06-01 14:46:51.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:46:52.703 [21825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:46:57.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 14:46:57.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425612,ok=425612,error=0, records=41
[INFO ] 2026-06-01 14:47:06.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:47:06.493 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21294/300s
[WARN ] 2026-06-01 14:47:07.709 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:47:12.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 14:47:12.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425613,ok=425613,error=0, records=41
[INFO ] 2026-06-01 14:47:21.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:47:22.725 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:47:27.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 14:47:27.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425614,ok=425614,error=0, records=41
[INFO ] 2026-06-01 14:47:36.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:47:37.828 [21825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:47:42.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 14:47:42.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425615,ok=425615,error=0, records=41
[INFO ] 2026-06-01 14:47:44.653 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21292/300s
[INFO ] 2026-06-01 14:47:46.685 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21292/300s
[WARN ] 2026-06-01 14:47:47.334 [21825] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19885/stat), No such file or directory
[INFO ] 2026-06-01 14:47:51.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:47:52.840 [21825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:47:54.002 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21292/300s
[INFO ] 2026-06-01 14:47:57.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 14:47:57.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425616,ok=425616,error=0, records=41
[INFO ] 2026-06-01 14:48:06.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:48:07.909 [21830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:48:12.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 14:48:12.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425617,ok=425617,error=0, records=41
[WARN ] 2026-06-01 14:48:17.423 [22022] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21860/stat), No such file or directory
[INFO ] 2026-06-01 14:48:21.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:48:22.922 [22016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:48:27.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 14:48:27.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425618,ok=425618,error=0, records=41
[WARN ] 2026-06-01 14:48:32.429 [22016] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21860/stat), No such file or directory
[INFO ] 2026-06-01 14:48:36.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:48:37.936 [22016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:48:42.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 14:48:42.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425619,ok=425619,error=0, records=41
[WARN ] 2026-06-01 14:48:47.460 [22066] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21860/stat), No such file or directory
[INFO ] 2026-06-01 14:48:51.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:48:52.969 [22071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:48:57.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 14:48:57.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425620,ok=425620,error=0, records=41
[INFO ] 2026-06-01 14:49:06.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:49:07.974 [22060] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:49:10.986 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852928},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:49:11.142 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:49:11.142 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 14:49:11.142 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:49:11.142 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:49:11.142 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:49:11.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:49:11.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:49:11.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:49:11.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:49:11.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:49:11.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:49:12.407 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:49:12.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425621,ok=425621,error=0, records=41
[INFO ] 2026-06-01 14:49:21.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:49:22.979 [22127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:49:27.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 14:49:27.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425622,ok=425622,error=0, records=41
[WARN ] 2026-06-01 14:49:32.483 [22054] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21871/stat), No such file or directory
[INFO ] 2026-06-01 14:49:36.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:49:37.984 [22066] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:49:42.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:49:42.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425623,ok=425623,error=0, records=41
[WARN ] 2026-06-01 14:49:47.488 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21871/stat), No such file or directory
[INFO ] 2026-06-01 14:49:51.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:49:52.989 [22060] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:49:57.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 14:49:57.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425624,ok=425624,error=0, records=41
[INFO ] 2026-06-01 14:50:01.144 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21296/300s
[INFO ] 2026-06-01 14:50:06.493 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21287/300s
[INFO ] 2026-06-01 14:50:06.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:50:07.994 [22249] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:50:12.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 14:50:12.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425625,ok=425625,error=0, records=41
[WARN ] 2026-06-01 14:50:17.499 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21862/stat), No such file or directory
[WARN ] 2026-06-01 14:50:17.499 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21977/stat), No such file or directory
[WARN ] 2026-06-01 14:50:17.499 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21980/stat), No such file or directory
[WARN ] 2026-06-01 14:50:17.499 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21886/stat), No such file or directory
[WARN ] 2026-06-01 14:50:17.500 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21867/stat), No such file or directory
[INFO ] 2026-06-01 14:50:21.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:50:23.000 [22249] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:50:27.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 14:50:27.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425626,ok=425626,error=0, records=41
[INFO ] 2026-06-01 14:50:27.518 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21283/300s
[WARN ] 2026-06-01 14:50:32.503 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21858/stat), No such file or directory
[WARN ] 2026-06-01 14:50:32.503 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21862/stat), No such file or directory
[WARN ] 2026-06-01 14:50:32.503 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21977/stat), No such file or directory
[WARN ] 2026-06-01 14:50:32.503 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21980/stat), No such file or directory
[WARN ] 2026-06-01 14:50:32.503 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21886/stat), No such file or directory
[WARN ] 2026-06-01 14:50:32.504 [22175] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21867/stat), No such file or directory
[INFO ] 2026-06-01 14:50:36.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:50:38.004 [22175] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:50:41.652 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21296/300s
[INFO ] 2026-06-01 14:50:42.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:50:42.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425627,ok=425627,error=0, records=41
[WARN ] 2026-06-01 14:50:47.508 [22060] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21858/stat), No such file or directory
[WARN ] 2026-06-01 14:50:47.508 [22060] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21862/stat), No such file or directory
[WARN ] 2026-06-01 14:50:47.508 [22060] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21977/stat), No such file or directory
[WARN ] 2026-06-01 14:50:47.508 [22060] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21980/stat), No such file or directory
[WARN ] 2026-06-01 14:50:47.509 [22060] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21886/stat), No such file or directory
[WARN ] 2026-06-01 14:50:47.509 [22060] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21867/stat), No such file or directory
[INFO ] 2026-06-01 14:50:51.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:50:53.009 [22277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:50:57.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 14:50:57.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425628,ok=425628,error=0, records=41
[INFO ] 2026-06-01 14:51:06.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:51:08.014 [22249] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:51:12.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 14:51:12.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425629,ok=425629,error=0, records=41
[INFO ] 2026-06-01 14:51:20.565 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21283/300s
[INFO ] 2026-06-01 14:51:21.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:51:23.018 [22292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:51:27.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 14:51:27.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425630,ok=425630,error=0, records=41
[INFO ] 2026-06-01 14:51:36.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:51:38.024 [22319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:51:38.515 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21292/300s
[INFO ] 2026-06-01 14:51:42.544 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 14:51:42.544 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425631,ok=425631,error=0, records=41
[INFO ] 2026-06-01 14:51:51.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:51:53.028 [22060] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:51:57.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 14:51:57.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425632,ok=425632,error=0, records=41
[INFO ] 2026-06-01 14:52:06.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:52:06.507 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21295/300s
[WARN ] 2026-06-01 14:52:08.034 [22361] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:52:11.143 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17730/300s
[INFO ] 2026-06-01 14:52:11.144 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852800},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:52:11.305 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:52:11.306 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 14:52:11.306 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:52:11.306 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:52:11.306 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:52:11.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:52:11.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:52:11.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:52:11.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:52:11.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:52:11.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:52:12.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 14:52:12.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425633,ok=425633,error=0, records=41
[INFO ] 2026-06-01 14:52:21.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:52:23.039 [22319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:52:27.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 14:52:27.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425634,ok=425634,error=0, records=41
[INFO ] 2026-06-01 14:52:36.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:52:38.045 [22319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:52:42.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 14:52:42.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425635,ok=425635,error=0, records=41
[INFO ] 2026-06-01 14:52:44.711 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21293/300s
[INFO ] 2026-06-01 14:52:46.713 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21293/300s
[INFO ] 2026-06-01 14:52:51.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:52:53.051 [22060] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:52:54.021 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21293/300s
[INFO ] 2026-06-01 14:52:57.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 14:52:57.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425636,ok=425636,error=0, records=41
[INFO ] 2026-06-01 14:53:06.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:53:07.558 [22400] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:53:12.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 14:53:12.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425637,ok=425637,error=0, records=41
[INFO ] 2026-06-01 14:53:21.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:53:22.563 [22445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:53:27.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 14:53:27.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425638,ok=425638,error=0, records=41
[INFO ] 2026-06-01 14:53:36.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 14:53:36.511 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 14:53:37.567 [22463] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:53:42.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 14:53:42.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425639,ok=425639,error=0, records=41
[INFO ] 2026-06-01 14:53:51.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:53:51.512 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 14:53:52.573 [22445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:53:57.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 14:53:57.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425640,ok=425640,error=0, records=41
[INFO ] 2026-06-01 14:54:06.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:54:07.578 [22497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:54:12.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 14:54:12.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425641,ok=425641,error=0, records=41
[INFO ] 2026-06-01 14:54:21.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:54:22.584 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:54:27.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 14:54:27.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425642,ok=425642,error=0, records=41
[WARN ] 2026-06-01 14:54:32.588 [22503] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22239/stat), No such file or directory
[WARN ] 2026-06-01 14:54:32.588 [22503] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22236/stat), No such file or directory
[INFO ] 2026-06-01 14:54:36.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:54:37.589 [22543] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:54:42.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 14:54:42.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425643,ok=425643,error=0, records=41
[WARN ] 2026-06-01 14:54:47.594 [22491] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22239/stat), No such file or directory
[WARN ] 2026-06-01 14:54:47.594 [22491] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22236/stat), No such file or directory
[INFO ] 2026-06-01 14:54:51.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:54:52.594 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:54:57.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 14:54:57.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425644,ok=425644,error=0, records=41
[INFO ] 2026-06-01 14:55:01.147 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21297/300s
[INFO ] 2026-06-01 14:55:06.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:55:06.599 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21288/300s
[WARN ] 2026-06-01 14:55:07.600 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:55:11.308 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:55:11.477 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:55:11.477 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 14:55:11.477 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:55:11.477 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:55:11.477 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:55:11.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:55:11.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:55:11.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:55:11.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:55:11.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:55:11.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:55:12.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 14:55:12.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425645,ok=425645,error=0, records=41
[INFO ] 2026-06-01 14:55:21.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:55:22.617 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:55:27.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 14:55:27.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425646,ok=425646,error=0, records=41
[INFO ] 2026-06-01 14:55:27.644 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21284/300s
[INFO ] 2026-06-01 14:55:36.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:55:37.630 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:55:41.659 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21297/300s
[INFO ] 2026-06-01 14:55:42.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 14:55:42.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425647,ok=425647,error=0, records=41
[WARN ] 2026-06-01 14:55:47.640 [22497] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22182/stat), No such file or directory
[WARN ] 2026-06-01 14:55:47.642 [22497] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22226/stat), No such file or directory
[INFO ] 2026-06-01 14:55:51.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:55:52.642 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:55:57.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 14:55:57.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425648,ok=425648,error=0, records=41
[INFO ] 2026-06-01 14:56:06.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:56:07.647 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:56:12.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 14:56:12.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425649,ok=425649,error=0, records=41
[INFO ] 2026-06-01 14:56:20.786 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21284/300s
[INFO ] 2026-06-01 14:56:21.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:56:22.652 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:56:27.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 14:56:27.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425650,ok=425650,error=0, records=41
[INFO ] 2026-06-01 14:56:36.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:56:37.657 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:56:38.601 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21293/300s
[INFO ] 2026-06-01 14:56:42.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 14:56:42.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425651,ok=425651,error=0, records=41
[INFO ] 2026-06-01 14:56:51.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:56:52.663 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:56:57.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 14:56:57.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425652,ok=425652,error=0, records=41
[INFO ] 2026-06-01 14:57:06.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 14:57:06.521 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21296/300s
[WARN ] 2026-06-01 14:57:07.668 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:57:12.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 14:57:12.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425653,ok=425653,error=0, records=41
[INFO ] 2026-06-01 14:57:21.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:57:22.672 [22497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:57:27.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 14:57:27.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425654,ok=425654,error=0, records=41
[INFO ] 2026-06-01 14:57:36.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:57:37.678 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:57:42.745 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 14:57:42.745 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425655,ok=425655,error=0, records=41
[INFO ] 2026-06-01 14:57:44.731 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21294/300s
[INFO ] 2026-06-01 14:57:46.753 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21294/300s
[INFO ] 2026-06-01 14:57:51.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:57:52.684 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:57:54.061 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21294/300s
[INFO ] 2026-06-01 14:57:57.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 14:57:57.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425656,ok=425656,error=0, records=41
[INFO ] 2026-06-01 14:58:06.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:58:07.689 [22560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:58:11.477 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17731/300s
[INFO ] 2026-06-01 14:58:11.479 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 14:58:11.671 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 14:58:11.671 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 14:58:11.671 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 14:58:11.671 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 14:58:11.671 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 14:58:11.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:58:11.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 14:58:11.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:58:11.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 14:58:11.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:58:11.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 14:58:12.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 14:58:12.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425657,ok=425657,error=0, records=41
[INFO ] 2026-06-01 14:58:21.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:58:22.695 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:58:27.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 14:58:27.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425658,ok=425658,error=0, records=41
[INFO ] 2026-06-01 14:58:36.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:58:37.700 [22497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:58:42.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 14:58:42.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425659,ok=425659,error=0, records=41
[INFO ] 2026-06-01 14:58:51.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:58:52.705 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:58:57.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 14:58:57.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425660,ok=425660,error=0, records=41
[INFO ] 2026-06-01 14:59:06.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:59:07.710 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:59:12.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 14:59:12.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425661,ok=425661,error=0, records=41
[INFO ] 2026-06-01 14:59:21.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:59:22.716 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:59:27.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 14:59:27.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425662,ok=425662,error=0, records=41
[INFO ] 2026-06-01 14:59:36.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:59:37.740 [22560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:59:42.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 14:59:42.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425663,ok=425663,error=0, records=41
[WARN ] 2026-06-01 14:59:47.745 [22538] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22591/stat), No such file or directory
[WARN ] 2026-06-01 14:59:47.745 [22538] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22523/stat), No such file or directory
[INFO ] 2026-06-01 14:59:51.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 14:59:52.766 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 14:59:57.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 14:59:57.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425664,ok=425664,error=0, records=41
[INFO ] 2026-06-01 15:00:01.161 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21298/300s
[INFO ] 2026-06-01 15:00:06.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:00:06.775 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21289/300s
[WARN ] 2026-06-01 15:00:07.776 [22560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:00:12.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 15:00:12.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425665,ok=425665,error=0, records=41
[INFO ] 2026-06-01 15:00:21.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:00:22.781 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:00:27.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 15:00:27.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425666,ok=425666,error=0, records=41
[INFO ] 2026-06-01 15:00:27.885 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21285/300s
[INFO ] 2026-06-01 15:00:36.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:00:37.788 [22497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:00:41.667 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21298/300s
[INFO ] 2026-06-01 15:00:42.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-01 15:00:42.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425667,ok=425667,error=0, records=41
[INFO ] 2026-06-01 15:00:51.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:00:52.793 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:00:57.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 15:00:57.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425668,ok=425668,error=0, records=41
[INFO ] 2026-06-01 15:01:06.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:01:07.799 [22503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:01:11.673 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852504},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:01:11.840 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:01:11.840 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 15:01:11.840 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:01:11.840 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:01:11.840 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:01:11.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:01:11.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:01:11.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:01:11.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:01:11.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:01:11.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:01:12.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10386, records=41
[INFO ] 2026-06-01 15:01:12.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425669,ok=425669,error=0, records=41
[INFO ] 2026-06-01 15:01:21.023 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21285/300s
[INFO ] 2026-06-01 15:01:21.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:01:22.804 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:01:27.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 15:01:27.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425670,ok=425670,error=0, records=41
[INFO ] 2026-06-01 15:01:36.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:01:37.810 [23011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:01:38.687 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21294/300s
[INFO ] 2026-06-01 15:01:42.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 15:01:42.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425671,ok=425671,error=0, records=41
[INFO ] 2026-06-01 15:01:51.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:01:52.815 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:01:57.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 15:01:57.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425672,ok=425672,error=0, records=41
[INFO ] 2026-06-01 15:02:06.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:02:06.543 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21297/300s
[WARN ] 2026-06-01 15:02:07.822 [22995] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:02:12.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 15:02:12.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425673,ok=425673,error=0, records=41
[INFO ] 2026-06-01 15:02:21.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:02:22.828 [23064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:02:27.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 15:02:27.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425674,ok=425674,error=0, records=41
[WARN ] 2026-06-01 15:02:32.333 [23064] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22218/stat), No such file or directory
[INFO ] 2026-06-01 15:02:36.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:02:37.834 [23064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:02:42.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 15:02:42.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425675,ok=425675,error=0, records=41
[INFO ] 2026-06-01 15:02:44.742 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21295/300s
[INFO ] 2026-06-01 15:02:46.840 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21295/300s
[WARN ] 2026-06-01 15:02:47.339 [23079] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22218/stat), No such file or directory
[WARN ] 2026-06-01 15:02:47.340 [23079] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22230/stat), No such file or directory
[INFO ] 2026-06-01 15:02:51.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:02:52.840 [23064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:02:54.144 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21295/300s
[INFO ] 2026-06-01 15:02:57.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 15:02:57.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425676,ok=425676,error=0, records=41
[INFO ] 2026-06-01 15:03:06.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:03:07.846 [23021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:03:12.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 15:03:12.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425677,ok=425677,error=0, records=41
[INFO ] 2026-06-01 15:03:21.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:03:22.852 [22491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:03:27.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 15:03:27.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425678,ok=425678,error=0, records=41
[INFO ] 2026-06-01 15:03:36.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 15:03:36.546 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 15:03:37.858 [23026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:03:42.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 15:03:42.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425679,ok=425679,error=0, records=41
[INFO ] 2026-06-01 15:03:51.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:03:52.864 [23064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:03:57.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 15:03:57.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425680,ok=425680,error=0, records=41
[INFO ] 2026-06-01 15:04:06.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:04:07.889 [23064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:04:11.841 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17732/300s
[INFO ] 2026-06-01 15:04:11.842 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852396},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:04:12.002 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:04:12.002 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 15:04:12.002 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:04:12.002 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:04:12.003 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:04:12.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:04:12.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:04:12.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:04:12.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:04:12.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:04:12.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:04:12.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 15:04:12.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425681,ok=425681,error=0, records=41
[INFO ] 2026-06-01 15:04:21.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:04:22.897 [23177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:04:28.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 15:04:28.044 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425682,ok=425682,error=0, records=41
[INFO ] 2026-06-01 15:04:36.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:04:37.914 [23189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:04:43.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 15:04:43.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425683,ok=425683,error=0, records=41
[INFO ] 2026-06-01 15:04:51.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:04:52.920 [23178] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:04:58.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 15:04:58.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425684,ok=425684,error=0, records=41
[INFO ] 2026-06-01 15:05:01.164 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21299/300s
[INFO ] 2026-06-01 15:05:06.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:05:06.926 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21290/300s
[WARN ] 2026-06-01 15:05:07.927 [23231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:05:13.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 15:05:13.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425685,ok=425685,error=0, records=41
[INFO ] 2026-06-01 15:05:21.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:05:22.933 [23248] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:05:28.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 15:05:28.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425686,ok=425686,error=0, records=41
[INFO ] 2026-06-01 15:05:28.067 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21286/300s
[INFO ] 2026-06-01 15:05:36.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:05:37.939 [23232] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:05:41.672 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21299/300s
[INFO ] 2026-06-01 15:05:43.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 15:05:43.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425687,ok=425687,error=0, records=41
[INFO ] 2026-06-01 15:05:51.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:05:52.946 [23293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:05:58.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 15:05:58.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425688,ok=425688,error=0, records=41
[INFO ] 2026-06-01 15:06:06.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:06:07.952 [23310] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:06:13.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 15:06:13.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425689,ok=425689,error=0, records=41
[INFO ] 2026-06-01 15:06:21.228 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21286/300s
[INFO ] 2026-06-01 15:06:21.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:06:22.961 [23282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:06:28.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 15:06:28.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425690,ok=425690,error=0, records=41
[INFO ] 2026-06-01 15:06:36.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:06:37.965 [23283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:06:38.802 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21295/300s
[INFO ] 2026-06-01 15:06:43.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 15:06:43.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425691,ok=425691,error=0, records=41
[INFO ] 2026-06-01 15:06:51.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:06:52.971 [23283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:06:58.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 15:06:58.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425692,ok=425692,error=0, records=41
[INFO ] 2026-06-01 15:07:06.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:07:06.563 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21298/300s
[WARN ] 2026-06-01 15:07:07.975 [23344] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:07:12.004 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852300},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:07:12.183 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:07:12.183 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 15:07:12.183 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:07:12.183 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:07:12.183 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:07:12.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:07:12.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:07:12.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:07:12.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:07:12.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:07:12.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:07:13.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 15:07:13.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425693,ok=425693,error=0, records=41
[INFO ] 2026-06-01 15:07:21.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:07:22.979 [23344] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:07:28.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 15:07:28.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425694,ok=425694,error=0, records=41
[INFO ] 2026-06-01 15:07:36.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:07:37.990 [23360] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:07:43.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 15:07:43.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425695,ok=425695,error=0, records=41
[INFO ] 2026-06-01 15:07:44.787 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21296/300s
[INFO ] 2026-06-01 15:07:46.926 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21296/300s
[INFO ] 2026-06-01 15:07:51.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:07:53.013 [23422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:07:54.234 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21296/300s
[INFO ] 2026-06-01 15:07:58.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 15:07:58.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425696,ok=425696,error=0, records=41
[INFO ] 2026-06-01 15:08:06.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:08:08.021 [23360] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:08:13.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 15:08:13.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425697,ok=425697,error=0, records=41
[WARN ] 2026-06-01 15:08:17.526 [23316] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23051/stat), No such file or directory
[WARN ] 2026-06-01 15:08:17.527 [23316] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22598/stat), No such file or directory
[INFO ] 2026-06-01 15:08:21.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:08:23.029 [23316] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:08:28.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 15:08:28.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425698,ok=425698,error=0, records=41
[WARN ] 2026-06-01 15:08:32.534 [23375] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23051/stat), No such file or directory
[WARN ] 2026-06-01 15:08:32.534 [23375] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22598/stat), No such file or directory
[INFO ] 2026-06-01 15:08:36.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:08:38.034 [23316] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:08:43.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 15:08:43.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425699,ok=425699,error=0, records=41
[WARN ] 2026-06-01 15:08:47.538 [23468] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23051/stat), No such file or directory
[WARN ] 2026-06-01 15:08:47.539 [23468] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22598/stat), No such file or directory
[INFO ] 2026-06-01 15:08:51.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:08:51.566 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 15:08:53.038 [23468] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:08:58.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 15:08:58.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425700,ok=425700,error=0, records=41
[INFO ] 2026-06-01 15:09:06.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:09:08.043 [23503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:09:13.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 15:09:13.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425701,ok=425701,error=0, records=41
[INFO ] 2026-06-01 15:09:21.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:09:23.048 [23483] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:09:28.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 15:09:28.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425702,ok=425702,error=0, records=41
[INFO ] 2026-06-01 15:09:36.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:09:38.053 [23503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:09:43.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 15:09:43.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425703,ok=425703,error=0, records=41
[INFO ] 2026-06-01 15:09:51.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:09:52.558 [23552] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:09:58.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 15:09:58.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425704,ok=425704,error=0, records=41
[INFO ] 2026-06-01 15:10:01.166 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21300/300s
[INFO ] 2026-06-01 15:10:06.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:10:07.062 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21291/300s
[WARN ] 2026-06-01 15:10:07.562 [23547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:10:12.183 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17733/300s
[INFO ] 2026-06-01 15:10:12.185 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852188},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:10:12.344 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:10:12.344 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 15:10:12.344 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:10:12.344 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:10:12.344 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:10:12.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:10:12.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:10:12.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:10:12.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:10:12.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:10:12.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:10:13.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 15:10:13.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425705,ok=425705,error=0, records=41
[INFO ] 2026-06-01 15:10:21.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:10:22.567 [23590] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:10:28.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 15:10:28.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425706,ok=425706,error=0, records=41
[INFO ] 2026-06-01 15:10:28.244 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21287/300s
[INFO ] 2026-06-01 15:10:36.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:10:37.572 [23609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:10:41.684 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21300/300s
[INFO ] 2026-06-01 15:10:43.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 15:10:43.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425707,ok=425707,error=0, records=41
[WARN ] 2026-06-01 15:10:47.576 [23621] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23054/stat), No such file or directory
[INFO ] 2026-06-01 15:10:51.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:10:52.576 [23637] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:10:58.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 15:10:58.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425708,ok=425708,error=0, records=41
[INFO ] 2026-06-01 15:11:06.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:11:07.582 [23643] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:11:13.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 15:11:13.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425709,ok=425709,error=0, records=41
[INFO ] 2026-06-01 15:11:21.432 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21287/300s
[INFO ] 2026-06-01 15:11:21.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:11:22.587 [23666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:11:28.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 15:11:28.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425710,ok=425710,error=0, records=41
[INFO ] 2026-06-01 15:11:36.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:11:37.592 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:11:38.885 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21296/300s
[INFO ] 2026-06-01 15:11:43.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 15:11:43.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425711,ok=425711,error=0, records=41
[INFO ] 2026-06-01 15:11:51.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:11:52.596 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:11:58.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 15:11:58.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425712,ok=425712,error=0, records=41
[INFO ] 2026-06-01 15:12:06.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:12:06.575 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21299/300s
[WARN ] 2026-06-01 15:12:07.603 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:12:13.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 15:12:13.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425713,ok=425713,error=0, records=41
[INFO ] 2026-06-01 15:12:21.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:12:22.607 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:12:28.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 15:12:28.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425714,ok=425714,error=0, records=41
[INFO ] 2026-06-01 15:12:36.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:12:37.617 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:12:43.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 15:12:43.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425715,ok=425715,error=0, records=41
[INFO ] 2026-06-01 15:12:44.861 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21297/300s
[INFO ] 2026-06-01 15:12:46.963 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21297/300s
[INFO ] 2026-06-01 15:12:51.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:12:52.633 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:12:54.327 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21297/300s
[INFO ] 2026-06-01 15:12:58.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 15:12:58.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425716,ok=425716,error=0, records=41
[INFO ] 2026-06-01 15:13:06.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:13:07.647 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:13:12.346 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:13:12.502 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:13:12.502 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 15:13:12.503 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:13:12.503 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:13:12.503 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:13:12.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:13:12.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:13:12.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:13:12.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:13:12.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:13:12.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:13:13.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 15:13:13.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425717,ok=425717,error=0, records=41
[WARN ] 2026-06-01 15:13:17.652 [23648] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23195/stat), No such file or directory
[INFO ] 2026-06-01 15:13:21.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:13:22.658 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:13:28.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 15:13:28.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425718,ok=425718,error=0, records=41
[WARN ] 2026-06-01 15:13:32.663 [23648] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23195/stat), No such file or directory
[INFO ] 2026-06-01 15:13:36.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 15:13:36.578 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 15:13:37.664 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:13:43.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 15:13:43.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425719,ok=425719,error=0, records=41
[WARN ] 2026-06-01 15:13:47.669 [23648] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23195/stat), No such file or directory
[INFO ] 2026-06-01 15:13:51.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:13:52.670 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:13:58.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 15:13:58.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425720,ok=425720,error=0, records=41
[INFO ] 2026-06-01 15:14:06.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:14:07.675 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:14:13.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 15:14:13.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425721,ok=425721,error=0, records=41
[INFO ] 2026-06-01 15:14:21.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:14:22.681 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:14:28.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 15:14:28.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425722,ok=425722,error=0, records=41
[INFO ] 2026-06-01 15:14:36.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:14:37.685 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:14:43.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 15:14:43.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425723,ok=425723,error=0, records=41
[INFO ] 2026-06-01 15:14:51.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:14:52.691 [23666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:14:58.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 15:14:58.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425724,ok=425724,error=0, records=41
[INFO ] 2026-06-01 15:15:01.171 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21301/300s
[INFO ] 2026-06-01 15:15:06.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:15:07.198 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21292/300s
[WARN ] 2026-06-01 15:15:07.699 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:15:13.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10390, records=41
[INFO ] 2026-06-01 15:15:13.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425725,ok=425725,error=0, records=41
[INFO ] 2026-06-01 15:15:21.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:15:22.705 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:15:28.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 15:15:28.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425726,ok=425726,error=0, records=41
[INFO ] 2026-06-01 15:15:28.541 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21288/300s
[INFO ] 2026-06-01 15:15:36.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:15:37.711 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:15:41.690 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21301/300s
[INFO ] 2026-06-01 15:15:43.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 15:15:43.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425727,ok=425727,error=0, records=41
[INFO ] 2026-06-01 15:15:51.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:15:52.717 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:15:58.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 15:15:58.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425728,ok=425728,error=0, records=41
[INFO ] 2026-06-01 15:16:06.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:16:07.723 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:16:12.503 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17734/300s
[INFO ] 2026-06-01 15:16:12.504 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852004},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:16:12.681 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:16:12.681 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 15:16:12.681 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:16:12.681 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:16:12.681 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:16:12.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:16:12.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:16:12.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:16:12.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:16:12.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:16:12.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:16:13.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 15:16:13.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425729,ok=425729,error=0, records=41
[INFO ] 2026-06-01 15:16:21.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:16:21.652 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21288/300s
[WARN ] 2026-06-01 15:16:22.728 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:16:28.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 15:16:28.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425730,ok=425730,error=0, records=41
[INFO ] 2026-06-01 15:16:36.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:16:37.735 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:16:38.970 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21297/300s
[INFO ] 2026-06-01 15:16:43.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 15:16:43.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425731,ok=425731,error=0, records=41
[INFO ] 2026-06-01 15:16:51.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:16:52.741 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:16:58.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-01 15:16:58.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425732,ok=425732,error=0, records=41
[INFO ] 2026-06-01 15:17:06.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:17:06.587 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21300/300s
[WARN ] 2026-06-01 15:17:07.746 [23666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:17:13.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 15:17:13.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425733,ok=425733,error=0, records=41
[INFO ] 2026-06-01 15:17:21.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:17:22.751 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:17:28.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 15:17:28.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425734,ok=425734,error=0, records=41
[INFO ] 2026-06-01 15:17:36.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:17:37.756 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:17:43.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 15:17:43.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425735,ok=425735,error=0, records=41
[INFO ] 2026-06-01 15:17:44.874 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21298/300s
[INFO ] 2026-06-01 15:17:46.976 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21298/300s
[INFO ] 2026-06-01 15:17:51.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:17:52.762 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:17:54.337 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21298/300s
[INFO ] 2026-06-01 15:17:58.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 15:17:58.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425736,ok=425736,error=0, records=41
[INFO ] 2026-06-01 15:18:06.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:18:07.767 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:18:13.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 15:18:13.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425737,ok=425737,error=0, records=41
[INFO ] 2026-06-01 15:18:21.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:18:22.772 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:18:28.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 15:18:28.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425738,ok=425738,error=0, records=41
[INFO ] 2026-06-01 15:18:36.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:18:37.777 [23621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:18:43.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 15:18:43.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425739,ok=425739,error=0, records=41
[INFO ] 2026-06-01 15:18:51.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:18:52.782 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:18:58.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 15:18:58.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425740,ok=425740,error=0, records=41
[INFO ] 2026-06-01 15:19:06.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:19:07.787 [23660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:19:12.682 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851928},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:19:12.857 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:19:12.857 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 15:19:12.857 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:19:12.857 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:19:12.857 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:19:12.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:19:12.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:19:12.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:19:12.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:19:12.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:19:12.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:19:13.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 15:19:13.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425741,ok=425741,error=0, records=41
[INFO ] 2026-06-01 15:19:21.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:19:22.793 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:19:28.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 15:19:28.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425742,ok=425742,error=0, records=41
[INFO ] 2026-06-01 15:19:36.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:19:37.798 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:19:43.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 15:19:43.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425743,ok=425743,error=0, records=41
[INFO ] 2026-06-01 15:19:51.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:19:52.804 [23648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:19:58.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 15:19:58.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425744,ok=425744,error=0, records=41
[INFO ] 2026-06-01 15:20:01.173 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21302/300s
[INFO ] 2026-06-01 15:20:06.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:20:07.308 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21293/300s
[WARN ] 2026-06-01 15:20:07.809 [24185] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:20:13.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 15:20:13.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425745,ok=425745,error=0, records=41
[INFO ] 2026-06-01 15:20:21.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:20:22.815 [24165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:20:28.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 15:20:28.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425746,ok=425746,error=0, records=41
[INFO ] 2026-06-01 15:20:28.727 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21289/300s
[INFO ] 2026-06-01 15:20:36.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:20:37.820 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:20:41.696 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21302/300s
[INFO ] 2026-06-01 15:20:43.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10380, records=41
[INFO ] 2026-06-01 15:20:43.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425747,ok=425747,error=0, records=41
[INFO ] 2026-06-01 15:20:51.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:20:52.826 [24232] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:20:58.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 15:20:58.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425748,ok=425748,error=0, records=41
[INFO ] 2026-06-01 15:21:06.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:21:07.835 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:21:13.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 15:21:13.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425749,ok=425749,error=0, records=41
[INFO ] 2026-06-01 15:21:21.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:21:21.833 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21289/300s
[WARN ] 2026-06-01 15:21:22.839 [24218] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:21:28.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 15:21:28.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425750,ok=425750,error=0, records=41
[INFO ] 2026-06-01 15:21:36.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:21:37.845 [24218] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:21:39.021 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21298/300s
[INFO ] 2026-06-01 15:21:43.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 15:21:43.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425751,ok=425751,error=0, records=41
[INFO ] 2026-06-01 15:21:51.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:21:52.850 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:21:58.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 15:21:58.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425752,ok=425752,error=0, records=41
[INFO ] 2026-06-01 15:22:06.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:22:06.599 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21301/300s
[WARN ] 2026-06-01 15:22:07.855 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:22:12.857 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17735/300s
[INFO ] 2026-06-01 15:22:12.859 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:22:13.043 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:22:13.043 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 15:22:13.044 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:22:13.044 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:22:13.044 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:22:13.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:22:13.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:22:13.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:22:13.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:22:13.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:22:13.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:22:13.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 15:22:13.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425753,ok=425753,error=0, records=41
[INFO ] 2026-06-01 15:22:21.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:22:22.860 [24249] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:22:28.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 15:22:28.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425754,ok=425754,error=0, records=41
[INFO ] 2026-06-01 15:22:36.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:22:37.865 [24300] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:22:43.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 15:22:43.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425755,ok=425755,error=0, records=41
[INFO ] 2026-06-01 15:22:44.934 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21299/300s
[INFO ] 2026-06-01 15:22:47.036 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21299/300s
[INFO ] 2026-06-01 15:22:51.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:22:52.871 [24315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:22:54.383 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21299/300s
[INFO ] 2026-06-01 15:22:58.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 15:22:58.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425756,ok=425756,error=0, records=41
[INFO ] 2026-06-01 15:23:06.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:23:07.875 [23702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:23:13.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 15:23:13.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425757,ok=425757,error=0, records=41
[INFO ] 2026-06-01 15:23:21.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:23:22.882 [24374] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:23:28.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 15:23:28.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425758,ok=425758,error=0, records=41
[INFO ] 2026-06-01 15:23:36.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 15:23:36.603 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 15:23:37.887 [24390] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:23:43.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 15:23:43.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425759,ok=425759,error=0, records=41
[INFO ] 2026-06-01 15:23:51.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:23:51.603 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 15:23:52.892 [24390] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:23:58.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 15:23:58.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425760,ok=425760,error=0, records=41
[INFO ] 2026-06-01 15:24:06.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:24:07.898 [24391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:24:13.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 15:24:13.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425761,ok=425761,error=0, records=41
[INFO ] 2026-06-01 15:24:21.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:24:22.904 [24446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:24:28.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 15:24:28.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425762,ok=425762,error=0, records=41
[INFO ] 2026-06-01 15:24:36.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:24:37.909 [24446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:24:43.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 15:24:43.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425763,ok=425763,error=0, records=41
[INFO ] 2026-06-01 15:24:51.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:24:52.915 [24451] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:24:58.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 15:24:58.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425764,ok=425764,error=0, records=41
[INFO ] 2026-06-01 15:25:01.177 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21303/300s
[INFO ] 2026-06-01 15:25:06.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:25:07.421 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21294/300s
[WARN ] 2026-06-01 15:25:07.922 [24461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:25:13.046 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:25:13.219 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:25:13.219 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 15:25:13.220 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:25:13.220 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:25:13.220 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:25:13.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:25:13.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:25:13.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:25:13.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:25:13.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:25:13.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:25:13.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 15:25:13.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425765,ok=425765,error=0, records=41
[INFO ] 2026-06-01 15:25:21.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:25:22.926 [24481] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:25:28.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 15:25:28.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425766,ok=425766,error=0, records=41
[INFO ] 2026-06-01 15:25:28.872 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21290/300s
[INFO ] 2026-06-01 15:25:36.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:25:37.931 [24523] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:25:41.703 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21303/300s
[INFO ] 2026-06-01 15:25:43.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 15:25:43.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425767,ok=425767,error=0, records=41
[INFO ] 2026-06-01 15:25:51.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:25:52.935 [24516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:25:58.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 15:25:58.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425768,ok=425768,error=0, records=41
[INFO ] 2026-06-01 15:26:06.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:26:07.940 [24516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:26:13.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 15:26:13.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425769,ok=425769,error=0, records=41
[INFO ] 2026-06-01 15:26:21.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:26:22.012 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21290/300s
[WARN ] 2026-06-01 15:26:22.946 [24545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:26:28.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 15:26:28.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425770,ok=425770,error=0, records=41
[INFO ] 2026-06-01 15:26:36.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:26:37.951 [24574] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:26:39.077 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21299/300s
[INFO ] 2026-06-01 15:26:43.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 15:26:43.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425771,ok=425771,error=0, records=41
[INFO ] 2026-06-01 15:26:51.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:26:52.959 [24556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:26:58.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 15:26:58.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425772,ok=425772,error=0, records=41
[INFO ] 2026-06-01 15:27:06.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:27:06.612 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21302/300s
[WARN ] 2026-06-01 15:27:07.964 [24589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:27:13.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 15:27:13.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425773,ok=425773,error=0, records=41
[INFO ] 2026-06-01 15:27:21.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:27:22.970 [24631] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:27:28.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 15:27:28.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425774,ok=425774,error=0, records=41
[INFO ] 2026-06-01 15:27:36.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:27:37.976 [24589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:27:43.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 15:27:43.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425775,ok=425775,error=0, records=41
[INFO ] 2026-06-01 15:27:45.017 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21300/300s
[INFO ] 2026-06-01 15:27:47.119 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21300/300s
[INFO ] 2026-06-01 15:27:51.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:27:52.980 [24556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:27:54.448 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21300/300s
[INFO ] 2026-06-01 15:27:58.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 15:27:58.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425776,ok=425776,error=0, records=41
[INFO ] 2026-06-01 15:28:06.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:28:07.986 [24589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:28:13.220 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17736/300s
[INFO ] 2026-06-01 15:28:13.221 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851692},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:28:13.367 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:28:13.368 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 15:28:13.368 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:28:13.368 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:28:13.368 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:28:13.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:28:13.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:28:13.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:28:13.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:28:13.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:28:13.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:28:14.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 15:28:14.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425777,ok=425777,error=0, records=41
[INFO ] 2026-06-01 15:28:21.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:28:22.990 [24645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:28:29.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 15:28:29.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425778,ok=425778,error=0, records=41
[INFO ] 2026-06-01 15:28:36.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:28:37.995 [24556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:28:44.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 15:28:44.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425779,ok=425779,error=0, records=41
[INFO ] 2026-06-01 15:28:51.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:28:53.000 [24645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:28:59.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 15:28:59.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425780,ok=425780,error=0, records=41
[INFO ] 2026-06-01 15:29:06.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:29:08.005 [24659] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:29:14.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 15:29:14.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425781,ok=425781,error=0, records=41
[INFO ] 2026-06-01 15:29:21.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:29:23.010 [24562] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:29:29.091 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 15:29:29.091 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425782,ok=425782,error=0, records=41
[INFO ] 2026-06-01 15:29:36.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:29:38.016 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:29:44.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 15:29:44.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425783,ok=425783,error=0, records=41
[INFO ] 2026-06-01 15:29:51.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:29:53.021 [24556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:29:59.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 15:29:59.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425784,ok=425784,error=0, records=41
[INFO ] 2026-06-01 15:30:01.180 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21304/300s
[INFO ] 2026-06-01 15:30:06.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:30:07.525 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21295/300s
[WARN ] 2026-06-01 15:30:08.026 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:30:14.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 15:30:14.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425785,ok=425785,error=0, records=41
[INFO ] 2026-06-01 15:30:21.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:30:23.032 [24802] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:30:29.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 15:30:29.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425786,ok=425786,error=0, records=41
[INFO ] 2026-06-01 15:30:29.114 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21291/300s
[INFO ] 2026-06-01 15:30:36.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:30:38.037 [24742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:30:41.710 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21304/300s
[INFO ] 2026-06-01 15:30:44.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 15:30:44.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425787,ok=425787,error=0, records=41
[INFO ] 2026-06-01 15:30:51.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:30:53.042 [24834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:30:59.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 15:30:59.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425788,ok=425788,error=0, records=41
[INFO ] 2026-06-01 15:31:06.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:31:08.048 [24851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:31:13.369 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:31:13.520 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:31:13.520 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 15:31:13.520 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:31:13.520 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:31:13.520 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:31:13.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:31:13.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:31:13.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:31:13.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:31:13.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:31:13.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:31:14.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 15:31:14.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425789,ok=425789,error=0, records=41
[INFO ] 2026-06-01 15:31:21.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:31:22.193 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21291/300s
[WARN ] 2026-06-01 15:31:23.053 [24827] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:31:29.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 15:31:29.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425790,ok=425790,error=0, records=41
[INFO ] 2026-06-01 15:31:36.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:31:37.558 [24886] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:31:39.133 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21300/300s
[INFO ] 2026-06-01 15:31:44.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 15:31:44.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425791,ok=425791,error=0, records=41
[INFO ] 2026-06-01 15:31:51.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:31:52.562 [24827] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:31:59.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 15:31:59.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425792,ok=425792,error=0, records=41
[INFO ] 2026-06-01 15:32:06.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:32:06.625 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21303/300s
[WARN ] 2026-06-01 15:32:07.567 [24892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:32:14.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 15:32:14.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425793,ok=425793,error=0, records=41
[INFO ] 2026-06-01 15:32:21.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:32:22.571 [24892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:32:29.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 15:32:29.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425794,ok=425794,error=0, records=41
[INFO ] 2026-06-01 15:32:36.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:32:37.579 [24957] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:32:44.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 15:32:44.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425795,ok=425795,error=0, records=41
[INFO ] 2026-06-01 15:32:45.095 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21301/300s
[INFO ] 2026-06-01 15:32:47.197 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21301/300s
[INFO ] 2026-06-01 15:32:51.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:32:52.583 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:32:54.504 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21301/300s
[INFO ] 2026-06-01 15:32:59.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 15:32:59.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425796,ok=425796,error=0, records=41
[INFO ] 2026-06-01 15:33:06.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:33:07.589 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:33:14.179 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 15:33:14.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425797,ok=425797,error=0, records=41
[INFO ] 2026-06-01 15:33:21.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:33:22.594 [25006] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:33:29.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 15:33:29.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425798,ok=425798,error=0, records=41
[INFO ] 2026-06-01 15:33:36.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 15:33:36.629 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 15:33:37.599 [25016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:33:44.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 15:33:44.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425799,ok=425799,error=0, records=41
[INFO ] 2026-06-01 15:33:51.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:33:52.604 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:33:59.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 15:33:59.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425800,ok=425800,error=0, records=41
[INFO ] 2026-06-01 15:34:06.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:34:07.610 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:34:13.520 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17737/300s
[INFO ] 2026-06-01 15:34:13.522 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851528},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:34:13.695 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:34:13.695 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 15:34:13.695 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:34:13.696 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:34:13.696 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:34:13.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:34:13.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:34:13.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:34:13.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:34:13.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:34:13.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:34:14.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10384, records=41
[INFO ] 2026-06-01 15:34:14.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425801,ok=425801,error=0, records=41
[INFO ] 2026-06-01 15:34:21.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:34:22.616 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:34:29.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 15:34:29.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425802,ok=425802,error=0, records=41
[INFO ] 2026-06-01 15:34:36.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:34:37.621 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:34:44.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 15:34:44.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425803,ok=425803,error=0, records=41
[INFO ] 2026-06-01 15:34:51.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:34:52.626 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:34:59.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 15:34:59.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425804,ok=425804,error=0, records=41
[INFO ] 2026-06-01 15:35:01.184 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21305/300s
[INFO ] 2026-06-01 15:35:06.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:35:07.631 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21296/300s
[WARN ] 2026-06-01 15:35:07.632 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:35:14.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 15:35:14.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425805,ok=425805,error=0, records=41
[INFO ] 2026-06-01 15:35:21.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:35:22.637 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:35:29.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 15:35:29.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425806,ok=425806,error=0, records=41
[INFO ] 2026-06-01 15:35:29.233 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21292/300s
[INFO ] 2026-06-01 15:35:36.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:35:37.643 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:35:41.717 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21305/300s
[INFO ] 2026-06-01 15:35:44.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 15:35:44.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425807,ok=425807,error=0, records=41
[INFO ] 2026-06-01 15:35:51.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:35:52.648 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:35:59.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 15:35:59.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425808,ok=425808,error=0, records=41
[INFO ] 2026-06-01 15:36:06.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:36:07.654 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:36:14.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 15:36:14.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425809,ok=425809,error=0, records=41
[INFO ] 2026-06-01 15:36:21.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:36:22.378 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21292/300s
[WARN ] 2026-06-01 15:36:22.659 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:36:29.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 15:36:29.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425810,ok=425810,error=0, records=41
[INFO ] 2026-06-01 15:36:36.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:36:37.664 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:36:39.188 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21301/300s
[INFO ] 2026-06-01 15:36:44.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 15:36:44.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425811,ok=425811,error=0, records=41
[INFO ] 2026-06-01 15:36:51.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:36:52.668 [25016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:36:59.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 15:36:59.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425812,ok=425812,error=0, records=41
[INFO ] 2026-06-01 15:37:06.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:37:06.637 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21304/300s
[WARN ] 2026-06-01 15:37:07.673 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:37:13.697 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851452},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:37:13.854 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:37:13.854 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 15:37:13.854 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:37:13.855 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:37:13.855 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:37:13.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:37:13.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:37:13.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:37:13.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:37:13.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:37:13.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:37:14.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 15:37:14.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425813,ok=425813,error=0, records=41
[INFO ] 2026-06-01 15:37:21.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:37:22.677 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:37:29.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 15:37:29.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425814,ok=425814,error=0, records=41
[INFO ] 2026-06-01 15:37:36.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:37:37.681 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:37:44.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 15:37:44.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425815,ok=425815,error=0, records=41
[INFO ] 2026-06-01 15:37:45.167 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21302/300s
[INFO ] 2026-06-01 15:37:47.268 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21302/300s
[INFO ] 2026-06-01 15:37:51.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:37:52.686 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:37:54.574 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21302/300s
[INFO ] 2026-06-01 15:37:59.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 15:37:59.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425816,ok=425816,error=0, records=41
[INFO ] 2026-06-01 15:38:06.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:38:07.690 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:38:14.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 15:38:14.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425817,ok=425817,error=0, records=41
[INFO ] 2026-06-01 15:38:21.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:38:22.696 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:38:29.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 15:38:29.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425818,ok=425818,error=0, records=41
[INFO ] 2026-06-01 15:38:36.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:38:37.701 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:38:44.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 15:38:44.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425819,ok=425819,error=0, records=41
[INFO ] 2026-06-01 15:38:51.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:38:51.642 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 15:38:52.706 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:38:59.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 15:38:59.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425820,ok=425820,error=0, records=41
[INFO ] 2026-06-01 15:39:06.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:39:07.711 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:39:14.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 15:39:14.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425821,ok=425821,error=0, records=41
[INFO ] 2026-06-01 15:39:21.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:39:22.717 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:39:29.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 15:39:29.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425822,ok=425822,error=0, records=41
[INFO ] 2026-06-01 15:39:36.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:39:37.722 [25016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:39:44.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 15:39:44.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425823,ok=425823,error=0, records=41
[INFO ] 2026-06-01 15:39:51.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:39:52.727 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:39:59.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 15:39:59.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425824,ok=425824,error=0, records=41
[INFO ] 2026-06-01 15:40:01.187 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21306/300s
[INFO ] 2026-06-01 15:40:06.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:40:07.732 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21297/300s
[WARN ] 2026-06-01 15:40:07.733 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:40:13.855 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17738/300s
[INFO ] 2026-06-01 15:40:13.856 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851372},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:40:14.019 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:40:14.019 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 15:40:14.019 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:40:14.019 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:40:14.019 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:40:14.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:40:14.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:40:14.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:40:14.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:40:14.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:40:14.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:40:14.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 15:40:14.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425825,ok=425825,error=0, records=41
[INFO ] 2026-06-01 15:40:21.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:40:22.738 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:40:29.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 15:40:29.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425826,ok=425826,error=0, records=41
[INFO ] 2026-06-01 15:40:29.455 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21293/300s
[INFO ] 2026-06-01 15:40:36.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:40:37.743 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:40:41.724 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21306/300s
[INFO ] 2026-06-01 15:40:44.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 15:40:44.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425827,ok=425827,error=0, records=41
[INFO ] 2026-06-01 15:40:51.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:40:52.749 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:40:59.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 15:40:59.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425828,ok=425828,error=0, records=41
[INFO ] 2026-06-01 15:41:06.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:41:07.754 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:41:14.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 15:41:14.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425829,ok=425829,error=0, records=41
[INFO ] 2026-06-01 15:41:21.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:41:22.563 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21293/300s
[WARN ] 2026-06-01 15:41:22.759 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:41:29.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 15:41:29.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425830,ok=425830,error=0, records=41
[INFO ] 2026-06-01 15:41:36.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:41:37.763 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:41:39.250 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21302/300s
[INFO ] 2026-06-01 15:41:44.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 15:41:44.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425831,ok=425831,error=0, records=41
[INFO ] 2026-06-01 15:41:51.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:41:52.768 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:41:59.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 15:41:59.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425832,ok=425832,error=0, records=41
[INFO ] 2026-06-01 15:42:06.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:42:06.652 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21305/300s
[WARN ] 2026-06-01 15:42:07.773 [25016] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:42:14.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 15:42:14.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425833,ok=425833,error=0, records=41
[INFO ] 2026-06-01 15:42:21.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:42:22.777 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:42:29.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 15:42:29.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425834,ok=425834,error=0, records=41
[INFO ] 2026-06-01 15:42:36.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:42:37.782 [25001] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:42:44.508 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 15:42:44.508 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425835,ok=425835,error=0, records=41
[INFO ] 2026-06-01 15:42:45.228 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21303/300s
[INFO ] 2026-06-01 15:42:47.330 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21303/300s
[INFO ] 2026-06-01 15:42:51.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:42:52.788 [24958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:42:54.638 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21303/300s
[INFO ] 2026-06-01 15:42:59.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 15:42:59.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425836,ok=425836,error=0, records=41
[INFO ] 2026-06-01 15:43:06.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:43:07.793 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:43:14.021 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851292},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:43:14.187 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:43:14.187 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 15:43:14.187 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:43:14.187 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:43:14.187 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:43:14.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:43:14.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:43:14.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:43:14.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:43:14.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:43:14.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:43:14.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 15:43:14.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425837,ok=425837,error=0, records=41
[INFO ] 2026-06-01 15:43:21.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:43:22.797 [24963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:43:29.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 15:43:29.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425838,ok=425838,error=0, records=41
[INFO ] 2026-06-01 15:43:36.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 15:43:36.656 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 15:43:37.803 [24986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:43:44.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 15:43:44.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425839,ok=425839,error=0, records=41
[INFO ] 2026-06-01 15:43:51.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:43:52.808 [25550] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:43:59.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 15:43:59.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425840,ok=425840,error=0, records=41
[INFO ] 2026-06-01 15:44:06.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:44:07.814 [25566] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:44:14.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 15:44:14.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425841,ok=425841,error=0, records=41
[INFO ] 2026-06-01 15:44:21.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:44:22.820 [25593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:44:29.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 15:44:29.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425842,ok=425842,error=0, records=41
[INFO ] 2026-06-01 15:44:36.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:44:37.825 [25550] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:44:44.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 15:44:44.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425843,ok=425843,error=0, records=41
[INFO ] 2026-06-01 15:44:51.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:44:52.830 [25550] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:44:59.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 15:44:59.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425844,ok=425844,error=0, records=41
[INFO ] 2026-06-01 15:45:01.191 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21307/300s
[INFO ] 2026-06-01 15:45:06.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:45:07.836 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21298/300s
[WARN ] 2026-06-01 15:45:07.836 [25635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:45:14.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 15:45:14.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425845,ok=425845,error=0, records=41
[INFO ] 2026-06-01 15:45:21.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:45:22.841 [25607] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:45:29.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 15:45:29.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425846,ok=425846,error=0, records=41
[INFO ] 2026-06-01 15:45:29.582 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21294/300s
[INFO ] 2026-06-01 15:45:36.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:45:37.847 [25635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:45:41.731 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21307/300s
[INFO ] 2026-06-01 15:45:44.587 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 15:45:44.587 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425847,ok=425847,error=0, records=41
[INFO ] 2026-06-01 15:45:51.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:45:52.853 [25607] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:45:59.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 15:45:59.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425848,ok=425848,error=0, records=41
[INFO ] 2026-06-01 15:46:06.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:46:07.858 [25673] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:46:14.188 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17739/300s
[INFO ] 2026-06-01 15:46:14.189 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851216},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:46:14.346 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:46:14.346 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 15:46:14.346 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:46:14.346 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:46:14.346 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:46:14.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:46:14.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:46:14.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:46:14.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:46:14.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:46:14.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:46:14.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 15:46:14.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425849,ok=425849,error=0, records=41
[INFO ] 2026-06-01 15:46:21.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:46:22.745 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21294/300s
[WARN ] 2026-06-01 15:46:22.862 [25635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:46:29.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 15:46:29.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425850,ok=425850,error=0, records=41
[INFO ] 2026-06-01 15:46:36.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:46:37.866 [25659] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:46:39.308 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21303/300s
[INFO ] 2026-06-01 15:46:44.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 15:46:44.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425851,ok=425851,error=0, records=41
[INFO ] 2026-06-01 15:46:51.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:46:52.871 [25687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:46:59.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 15:46:59.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425852,ok=425852,error=0, records=41
[INFO ] 2026-06-01 15:47:06.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:47:06.666 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21306/300s
[WARN ] 2026-06-01 15:47:07.875 [25687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:47:14.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 15:47:14.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425853,ok=425853,error=0, records=41
[INFO ] 2026-06-01 15:47:21.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:47:22.880 [25702] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:47:29.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 15:47:29.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425854,ok=425854,error=0, records=41
[INFO ] 2026-06-01 15:47:36.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:47:37.884 [25762] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:47:44.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 15:47:44.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425855,ok=425855,error=0, records=41
[INFO ] 2026-06-01 15:47:45.296 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21304/300s
[INFO ] 2026-06-01 15:47:47.398 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21304/300s
[INFO ] 2026-06-01 15:47:51.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:47:52.890 [25799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:47:54.704 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21304/300s
[INFO ] 2026-06-01 15:47:59.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 15:47:59.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425856,ok=425856,error=0, records=41
[INFO ] 2026-06-01 15:48:06.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:48:07.895 [25811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:48:14.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 15:48:14.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425857,ok=425857,error=0, records=41
[INFO ] 2026-06-01 15:48:21.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:48:22.900 [25788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:48:29.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 15:48:29.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425858,ok=425858,error=0, records=41
[INFO ] 2026-06-01 15:48:36.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:48:37.905 [25843] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:48:44.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 15:48:44.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425859,ok=425859,error=0, records=41
[INFO ] 2026-06-01 15:48:51.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:48:52.911 [25867] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:48:59.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 15:48:59.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425860,ok=425860,error=0, records=41
[INFO ] 2026-06-01 15:49:06.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:49:07.917 [25883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:49:14.347 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851128},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:49:14.506 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:49:14.506 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 15:49:14.506 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:49:14.506 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:49:14.506 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:49:14.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:49:14.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:49:14.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:49:14.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:49:14.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:49:14.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:49:14.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 15:49:14.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425861,ok=425861,error=0, records=41
[INFO ] 2026-06-01 15:49:21.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:49:22.922 [25901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:49:29.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 15:49:29.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425862,ok=425862,error=0, records=41
[INFO ] 2026-06-01 15:49:36.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:49:37.928 [25913] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:49:44.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 15:49:44.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425863,ok=425863,error=0, records=41
[INFO ] 2026-06-01 15:49:51.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:49:52.935 [25888] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:49:59.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 15:49:59.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425864,ok=425864,error=0, records=41
[INFO ] 2026-06-01 15:50:01.194 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21308/300s
[INFO ] 2026-06-01 15:50:06.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:50:07.940 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21299/300s
[WARN ] 2026-06-01 15:50:07.941 [25888] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:50:14.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 15:50:14.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425865,ok=425865,error=0, records=41
[INFO ] 2026-06-01 15:50:21.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:50:22.949 [25951] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:50:29.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 15:50:29.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425866,ok=425866,error=0, records=41
[INFO ] 2026-06-01 15:50:29.715 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21295/300s
[INFO ] 2026-06-01 15:50:36.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:50:37.953 [25923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:50:41.738 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21308/300s
[INFO ] 2026-06-01 15:50:44.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 15:50:44.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425867,ok=425867,error=0, records=41
[INFO ] 2026-06-01 15:50:51.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:50:52.959 [25965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:50:59.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 15:50:59.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425868,ok=425868,error=0, records=41
[INFO ] 2026-06-01 15:51:06.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:51:07.963 [25939] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:51:14.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 15:51:14.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425869,ok=425869,error=0, records=41
[INFO ] 2026-06-01 15:51:21.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:51:22.930 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21295/300s
[WARN ] 2026-06-01 15:51:22.967 [25939] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:51:29.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 15:51:29.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425870,ok=425870,error=0, records=41
[INFO ] 2026-06-01 15:51:36.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:51:37.972 [25939] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:51:39.365 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21304/300s
[INFO ] 2026-06-01 15:51:44.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 15:51:44.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425871,ok=425871,error=0, records=41
[INFO ] 2026-06-01 15:51:51.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:51:52.978 [25966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:51:59.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 15:51:59.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425872,ok=425872,error=0, records=41
[INFO ] 2026-06-01 15:52:06.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:52:06.679 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21307/300s
[WARN ] 2026-06-01 15:52:07.982 [25965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:52:14.506 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17740/300s
[INFO ] 2026-06-01 15:52:14.508 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851052},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:52:14.662 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:52:14.662 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 15:52:14.662 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:52:14.663 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:52:14.663 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:52:14.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:52:14.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:52:14.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:52:14.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:52:14.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:52:14.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:52:14.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 15:52:14.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425873,ok=425873,error=0, records=41
[INFO ] 2026-06-01 15:52:21.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:52:22.986 [25965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:52:29.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 15:52:29.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425874,ok=425874,error=0, records=41
[INFO ] 2026-06-01 15:52:36.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:52:37.992 [26051] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:52:44.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 15:52:44.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425875,ok=425875,error=0, records=41
[INFO ] 2026-06-01 15:52:45.368 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21305/300s
[INFO ] 2026-06-01 15:52:47.470 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21305/300s
[INFO ] 2026-06-01 15:52:51.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:52:52.996 [26080] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:52:54.776 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21305/300s
[INFO ] 2026-06-01 15:52:59.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 15:52:59.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425876,ok=425876,error=0, records=41
[INFO ] 2026-06-01 15:53:06.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:53:08.003 [26108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:53:14.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 15:53:14.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425877,ok=425877,error=0, records=41
[INFO ] 2026-06-01 15:53:21.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:53:23.008 [26108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:53:29.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 15:53:29.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425878,ok=425878,error=0, records=41
[INFO ] 2026-06-01 15:53:36.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 15:53:36.683 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 15:53:38.013 [26150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:53:44.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 15:53:44.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425879,ok=425879,error=0, records=41
[INFO ] 2026-06-01 15:53:51.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:53:51.684 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 15:53:53.019 [26136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:53:59.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 15:53:59.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425880,ok=425880,error=0, records=41
[INFO ] 2026-06-01 15:54:06.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:54:08.024 [26094] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:54:14.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 15:54:14.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425881,ok=425881,error=0, records=41
[INFO ] 2026-06-01 15:54:21.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:54:23.029 [26150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:54:29.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 15:54:29.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425882,ok=425882,error=0, records=41
[INFO ] 2026-06-01 15:54:36.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:54:38.035 [26178] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:54:44.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-06-01 15:54:44.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425883,ok=425883,error=0, records=41
[INFO ] 2026-06-01 15:54:51.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:54:53.040 [26150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:54:59.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10144, records=41
[INFO ] 2026-06-01 15:54:59.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425884,ok=425884,error=0, records=41
[INFO ] 2026-06-01 15:55:01.198 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21309/300s
[INFO ] 2026-06-01 15:55:06.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:55:08.046 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21300/300s
[WARN ] 2026-06-01 15:55:08.046 [26210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:55:14.664 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:55:14.840 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:55:14.840 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 15:55:14.840 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:55:14.840 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:55:14.840 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:55:14.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:55:14.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:55:14.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:55:14.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:55:14.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:55:14.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:55:14.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-01 15:55:14.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425885,ok=425885,error=0, records=41
[INFO ] 2026-06-01 15:55:21.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:55:23.051 [26255] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:55:29.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 15:55:29.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425886,ok=425886,error=0, records=41
[INFO ] 2026-06-01 15:55:29.901 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21296/300s
[INFO ] 2026-06-01 15:55:36.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:55:37.556 [26272] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:55:41.745 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21309/300s
[INFO ] 2026-06-01 15:55:44.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 15:55:44.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425887,ok=425887,error=0, records=41
[INFO ] 2026-06-01 15:55:51.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:55:52.561 [26290] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:55:59.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 15:55:59.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425888,ok=425888,error=0, records=41
[INFO ] 2026-06-01 15:56:06.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:56:07.567 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:56:14.916 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 15:56:14.916 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425889,ok=425889,error=0, records=41
[INFO ] 2026-06-01 15:56:21.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:56:22.573 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:56:23.110 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21296/300s
[INFO ] 2026-06-01 15:56:29.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 15:56:29.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425890,ok=425890,error=0, records=41
[INFO ] 2026-06-01 15:56:36.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:56:37.577 [26339] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:56:39.423 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21305/300s
[INFO ] 2026-06-01 15:56:44.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 15:56:44.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425891,ok=425891,error=0, records=41
[INFO ] 2026-06-01 15:56:51.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:56:52.583 [26362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:56:59.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 15:56:59.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425892,ok=425892,error=0, records=41
[INFO ] 2026-06-01 15:57:06.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 15:57:06.693 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21308/300s
[WARN ] 2026-06-01 15:57:07.589 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:57:14.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 15:57:14.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425893,ok=425893,error=0, records=41
[INFO ] 2026-06-01 15:57:21.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:57:22.595 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:57:30.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 15:57:30.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425894,ok=425894,error=0, records=41
[INFO ] 2026-06-01 15:57:36.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:57:37.599 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:57:45.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 15:57:45.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425895,ok=425895,error=0, records=41
[INFO ] 2026-06-01 15:57:45.432 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21306/300s
[INFO ] 2026-06-01 15:57:47.534 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21306/300s
[INFO ] 2026-06-01 15:57:51.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:57:52.604 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:57:54.841 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21306/300s
[INFO ] 2026-06-01 15:58:00.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 15:58:00.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425896,ok=425896,error=0, records=41
[INFO ] 2026-06-01 15:58:06.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:58:07.610 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:58:14.840 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17741/300s
[INFO ] 2026-06-01 15:58:14.842 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850900},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 15:58:15.012 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 15:58:15.012 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 15:58:15.012 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 15:58:15.013 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 15:58:15.013 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 15:58:15.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 15:58:15.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425897,ok=425897,error=0, records=41
[INFO ] 2026-06-01 15:58:15.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:58:15.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 15:58:15.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:58:15.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 15:58:15.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:58:15.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 15:58:21.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:58:22.615 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:58:30.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 15:58:30.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425898,ok=425898,error=0, records=41
[INFO ] 2026-06-01 15:58:36.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:58:37.620 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:58:45.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 15:58:45.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425899,ok=425899,error=0, records=41
[INFO ] 2026-06-01 15:58:51.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:58:52.625 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:59:00.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 15:59:00.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425900,ok=425900,error=0, records=41
[INFO ] 2026-06-01 15:59:06.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:59:07.631 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:59:15.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 15:59:15.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425901,ok=425901,error=0, records=41
[INFO ] 2026-06-01 15:59:21.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:59:22.636 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:59:30.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 15:59:30.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425902,ok=425902,error=0, records=41
[INFO ] 2026-06-01 15:59:36.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:59:37.641 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 15:59:45.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 15:59:45.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425903,ok=425903,error=0, records=41
[INFO ] 2026-06-01 15:59:51.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 15:59:52.646 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:00:00.080 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-01 16:00:00.080 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425904,ok=425904,error=0, records=41
[INFO ] 2026-06-01 16:00:01.202 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21310/300s
[INFO ] 2026-06-01 16:00:06.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:00:07.652 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:00:08.152 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21301/300s
[INFO ] 2026-06-01 16:00:15.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 16:00:15.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425905,ok=425905,error=0, records=41
[INFO ] 2026-06-01 16:00:21.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:00:22.657 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:00:30.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 16:00:30.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425906,ok=425906,error=0, records=41
[INFO ] 2026-06-01 16:00:30.092 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21297/300s
[INFO ] 2026-06-01 16:00:36.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:00:37.663 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:00:41.752 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21310/300s
[INFO ] 2026-06-01 16:00:45.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 16:00:45.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425907,ok=425907,error=0, records=41
[INFO ] 2026-06-01 16:00:51.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:00:52.667 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:01:00.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 16:01:00.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425908,ok=425908,error=0, records=41
[INFO ] 2026-06-01 16:01:06.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:01:07.671 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:01:15.014 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850808},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:01:15.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 16:01:15.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425909,ok=425909,error=0, records=41
[INFO ] 2026-06-01 16:01:15.184 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:01:15.184 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 16:01:15.184 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:01:15.184 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:01:15.184 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:01:15.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:01:15.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:01:15.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:01:15.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:01:15.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:01:15.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:01:21.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:01:22.676 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:01:23.294 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21297/300s
[INFO ] 2026-06-01 16:01:30.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 16:01:30.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425910,ok=425910,error=0, records=41
[INFO ] 2026-06-01 16:01:36.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:01:37.681 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:01:39.481 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21306/300s
[INFO ] 2026-06-01 16:01:45.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 16:01:45.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425911,ok=425911,error=0, records=41
[INFO ] 2026-06-01 16:01:51.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:01:52.687 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:02:00.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 16:02:00.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425912,ok=425912,error=0, records=41
[INFO ] 2026-06-01 16:02:06.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:02:06.706 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21309/300s
[WARN ] 2026-06-01 16:02:07.692 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:02:15.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 16:02:15.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425913,ok=425913,error=0, records=41
[INFO ] 2026-06-01 16:02:21.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:02:22.696 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:02:30.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 16:02:30.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425914,ok=425914,error=0, records=41
[INFO ] 2026-06-01 16:02:36.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:02:37.701 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:02:45.218 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 16:02:45.218 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425915,ok=425915,error=0, records=41
[INFO ] 2026-06-01 16:02:45.510 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21307/300s
[INFO ] 2026-06-01 16:02:47.612 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21307/300s
[INFO ] 2026-06-01 16:02:51.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:02:52.706 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:02:54.918 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21307/300s
[INFO ] 2026-06-01 16:03:00.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 16:03:00.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425916,ok=425916,error=0, records=41
[INFO ] 2026-06-01 16:03:06.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:03:07.712 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:03:15.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 16:03:15.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425917,ok=425917,error=0, records=41
[INFO ] 2026-06-01 16:03:21.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:03:22.718 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:03:30.236 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 16:03:30.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425918,ok=425918,error=0, records=41
[INFO ] 2026-06-01 16:03:36.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 16:03:36.710 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 16:03:37.723 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:03:45.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 16:03:45.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425919,ok=425919,error=0, records=41
[INFO ] 2026-06-01 16:03:51.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:03:52.728 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:04:00.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 16:04:00.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425920,ok=425920,error=0, records=41
[INFO ] 2026-06-01 16:04:06.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:04:07.735 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:04:15.184 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17742/300s
[INFO ] 2026-06-01 16:04:15.186 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850732},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:04:15.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 16:04:15.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425921,ok=425921,error=0, records=41
[INFO ] 2026-06-01 16:04:15.342 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:04:15.342 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:04:15.342 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:04:15.342 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:04:15.342 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:04:15.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:04:15.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:04:15.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:04:15.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:04:15.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:04:15.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:04:21.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:04:22.743 [26381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:04:30.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 16:04:30.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425922,ok=425922,error=0, records=41
[INFO ] 2026-06-01 16:04:36.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:04:37.747 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:04:45.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10137, records=41
[INFO ] 2026-06-01 16:04:45.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425923,ok=425923,error=0, records=41
[INFO ] 2026-06-01 16:04:51.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:04:52.753 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:05:00.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 16:05:00.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425924,ok=425924,error=0, records=41
[INFO ] 2026-06-01 16:05:01.205 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21311/300s
[INFO ] 2026-06-01 16:05:06.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:05:07.758 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:05:08.257 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21302/300s
[INFO ] 2026-06-01 16:05:15.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10381, records=41
[INFO ] 2026-06-01 16:05:15.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425925,ok=425925,error=0, records=41
[INFO ] 2026-06-01 16:05:21.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:05:22.762 [26363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:05:30.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 16:05:30.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425926,ok=425926,error=0, records=41
[INFO ] 2026-06-01 16:05:30.285 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21298/300s
[INFO ] 2026-06-01 16:05:36.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:05:37.767 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:05:41.759 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21311/300s
[INFO ] 2026-06-01 16:05:45.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 16:05:45.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425927,ok=425927,error=0, records=41
[INFO ] 2026-06-01 16:05:51.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:05:52.774 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:06:00.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 16:06:00.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425928,ok=425928,error=0, records=41
[INFO ] 2026-06-01 16:06:06.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:06:07.780 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:06:15.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 16:06:15.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425929,ok=425929,error=0, records=41
[INFO ] 2026-06-01 16:06:21.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:06:22.784 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:06:23.479 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21298/300s
[INFO ] 2026-06-01 16:06:30.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 16:06:30.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425930,ok=425930,error=0, records=41
[INFO ] 2026-06-01 16:06:36.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:06:37.794 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:06:39.541 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21307/300s
[INFO ] 2026-06-01 16:06:45.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 16:06:45.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425931,ok=425931,error=0, records=41
[INFO ] 2026-06-01 16:06:51.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:06:52.800 [26406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:07:00.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 16:07:00.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425932,ok=425932,error=0, records=41
[INFO ] 2026-06-01 16:07:06.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:07:06.719 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21310/300s
[WARN ] 2026-06-01 16:07:07.806 [26396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:07:15.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 16:07:15.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425933,ok=425933,error=0, records=41
[INFO ] 2026-06-01 16:07:15.344 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850652},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:07:15.529 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:07:15.529 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:07:15.529 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:07:15.529 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:07:15.529 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:07:15.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:07:15.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:07:15.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:07:15.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:07:15.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:07:15.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:07:21.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:07:22.811 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:07:30.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 16:07:30.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425934,ok=425934,error=0, records=41
[INFO ] 2026-06-01 16:07:36.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:07:37.816 [26948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:07:45.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 16:07:45.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425935,ok=425935,error=0, records=41
[INFO ] 2026-06-01 16:07:45.582 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21308/300s
[INFO ] 2026-06-01 16:07:47.684 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21308/300s
[INFO ] 2026-06-01 16:07:51.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:07:52.821 [26968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:07:54.991 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21308/300s
[INFO ] 2026-06-01 16:08:00.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 16:08:00.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425936,ok=425936,error=0, records=41
[INFO ] 2026-06-01 16:08:06.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:08:07.826 [26968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:08:15.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 16:08:15.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425937,ok=425937,error=0, records=41
[INFO ] 2026-06-01 16:08:21.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:08:22.832 [26982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:08:30.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 16:08:30.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425938,ok=425938,error=0, records=41
[INFO ] 2026-06-01 16:08:36.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:08:37.836 [27024] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:08:45.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 16:08:45.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425939,ok=425939,error=0, records=41
[INFO ] 2026-06-01 16:08:51.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:08:51.723 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 16:08:52.840 [26996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:09:00.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 16:09:00.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425940,ok=425940,error=0, records=41
[INFO ] 2026-06-01 16:09:06.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:09:07.847 [26982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:09:15.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 16:09:15.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425941,ok=425941,error=0, records=41
[INFO ] 2026-06-01 16:09:21.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:09:22.853 [26996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:09:30.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 16:09:30.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425942,ok=425942,error=0, records=41
[INFO ] 2026-06-01 16:09:36.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:09:37.858 [26996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:09:45.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 16:09:45.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425943,ok=425943,error=0, records=41
[INFO ] 2026-06-01 16:09:51.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:09:52.862 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:10:00.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 16:10:00.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425944,ok=425944,error=0, records=41
[INFO ] 2026-06-01 16:10:01.209 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21312/300s
[INFO ] 2026-06-01 16:10:06.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:10:07.867 [26293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:10:08.367 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21303/300s
[INFO ] 2026-06-01 16:10:15.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 16:10:15.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425945,ok=425945,error=0, records=41
[INFO ] 2026-06-01 16:10:15.530 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17743/300s
[INFO ] 2026-06-01 16:10:15.531 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850576},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:10:15.684 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:10:15.684 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:10:15.684 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:10:15.684 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:10:15.684 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:10:15.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:10:15.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:10:15.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:10:15.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:10:15.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:10:15.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:10:21.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:10:22.872 [27024] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:10:30.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 16:10:30.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425946,ok=425946,error=0, records=41
[INFO ] 2026-06-01 16:10:30.408 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21299/300s
[INFO ] 2026-06-01 16:10:36.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:10:37.878 [27089] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:10:41.766 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21312/300s
[INFO ] 2026-06-01 16:10:45.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 16:10:45.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425947,ok=425947,error=0, records=41
[INFO ] 2026-06-01 16:10:51.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:10:52.883 [27160] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:11:00.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 16:11:00.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425948,ok=425948,error=0, records=41
[INFO ] 2026-06-01 16:11:06.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:11:07.889 [27176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:11:15.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10470, records=41
[INFO ] 2026-06-01 16:11:15.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425949,ok=425949,error=0, records=41
[INFO ] 2026-06-01 16:11:21.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:11:22.894 [27148] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:11:23.664 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21299/300s
[INFO ] 2026-06-01 16:11:30.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10458, records=41
[INFO ] 2026-06-01 16:11:30.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425950,ok=425950,error=0, records=41
[INFO ] 2026-06-01 16:11:35.415 [27166] sic/src/linux_system_information_collector.cpp:1324: /bin/udevadm info --query=property --name=/dev/vda1
DEVLINKS=/dev/disk/by-id/virtio-j6c1gqesu0zk5kutcqel-part1 /dev/disk/by-path/pci-0000:00:04.0-part1 /dev/disk/by-path/virtio-pci-0000:00:04.0-part1 /dev/disk/by-uuid/87ba1103-a0d7-49ef-a8ae-6ce1d3fd2453
DEVNAME=/dev/vda1
DEVPATH=/devices/pci0000:00/0000:00:04.0/virtio1/block/vda/vda1
DEVTYPE=partition
ID_FS_TYPE=ext4
ID_FS_USAGE=filesystem
ID_FS_UUID=87ba1103-a0d7-49ef-a8ae-6ce1d3fd2453
ID_FS_UUID_ENC=87ba1103-a0d7-49ef-a8ae-6ce1d3fd2453
ID_FS_VERSION=1.0
ID_PART_ENTRY_DISK=253:0
ID_PART_ENTRY_FLAGS=0x80
ID_PART_ENTRY_NUMBER=1
ID_PART_ENTRY_OFFSET=2048
ID_PART_ENTRY_SCHEME=dos
ID_PART_ENTRY_SIZE=209713119
ID_PART_ENTRY_TYPE=0x83
ID_PART_TABLE_TYPE=dos
ID_PATH=pci-0000:00:04.0
ID_PATH_TAG=pci-0000_00_04_0
ID_SERIAL=j6c1gqesu0zk5kutcqel
MAJOR=253
MINOR=1
SUBSYSTEM=block
TAGS=:systemd:
USEC_INITIALIZED=25124
[INFO ] 2026-06-01 16:11:35.415 [27166] sic/src/linux_system_information_collector.cpp:1335: queryDevSerialId: {"/dev/vda1":"j6c1gqesu0zk5kutcqel"}
[INFO ] 2026-06-01 16:11:36.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:11:37.900 [27160] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:11:39.598 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21308/300s
[INFO ] 2026-06-01 16:11:45.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10448, records=41
[INFO ] 2026-06-01 16:11:45.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425951,ok=425951,error=0, records=41
[INFO ] 2026-06-01 16:11:51.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:11:52.906 [27166] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:12:00.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10487, records=41
[INFO ] 2026-06-01 16:12:00.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425952,ok=425952,error=0, records=41
[INFO ] 2026-06-01 16:12:06.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:12:06.732 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21311/300s
[WARN ] 2026-06-01 16:12:07.911 [27228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:12:15.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 16:12:15.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425953,ok=425953,error=0, records=41
[INFO ] 2026-06-01 16:12:21.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:12:22.917 [27228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:12:30.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 16:12:30.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425954,ok=425954,error=0, records=41
[INFO ] 2026-06-01 16:12:36.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:12:37.923 [27243] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:12:45.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 16:12:45.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425955,ok=425955,error=0, records=41
[INFO ] 2026-06-01 16:12:45.637 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21309/300s
[INFO ] 2026-06-01 16:12:47.738 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21309/300s
[INFO ] 2026-06-01 16:12:51.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:12:52.929 [27287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:12:55.043 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21309/300s
[INFO ] 2026-06-01 16:13:00.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 16:13:00.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425956,ok=425956,error=0, records=41
[INFO ] 2026-06-01 16:13:06.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:13:07.934 [27304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:13:15.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 16:13:15.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425957,ok=425957,error=0, records=41
[INFO ] 2026-06-01 16:13:15.685 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850484},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:13:15.844 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:13:15.844 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:13:15.844 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:13:15.844 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:13:15.844 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:13:15.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:13:15.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:13:15.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:13:15.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:13:15.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:13:15.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:13:21.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:13:22.941 [27259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:13:30.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 16:13:30.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425958,ok=425958,error=0, records=41
[INFO ] 2026-06-01 16:13:36.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 16:13:36.736 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 16:13:37.946 [27336] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:13:45.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 16:13:45.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425959,ok=425959,error=0, records=41
[INFO ] 2026-06-01 16:13:51.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:13:52.951 [27348] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:14:00.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 16:14:00.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425960,ok=425960,error=0, records=41
[INFO ] 2026-06-01 16:14:06.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:14:07.956 [27342] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:14:15.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 16:14:15.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425961,ok=425961,error=0, records=41
[INFO ] 2026-06-01 16:14:21.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:14:22.962 [27342] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:14:30.511 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 16:14:30.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425962,ok=425962,error=0, records=41
[INFO ] 2026-06-01 16:14:36.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:14:37.968 [27311] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:14:45.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 16:14:45.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425963,ok=425963,error=0, records=41
[INFO ] 2026-06-01 16:14:51.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:14:52.972 [27311] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:15:00.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 16:15:00.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425964,ok=425964,error=0, records=41
[INFO ] 2026-06-01 16:15:01.212 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21313/300s
[INFO ] 2026-06-01 16:15:06.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:15:07.978 [27348] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:15:08.478 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21304/300s
[INFO ] 2026-06-01 16:15:15.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 16:15:15.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425965,ok=425965,error=0, records=41
[INFO ] 2026-06-01 16:15:21.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:15:22.983 [27386] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:15:30.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 16:15:30.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425966,ok=425966,error=0, records=41
[INFO ] 2026-06-01 16:15:30.535 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21300/300s
[INFO ] 2026-06-01 16:15:36.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:15:37.987 [27428] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:15:41.772 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21313/300s
[INFO ] 2026-06-01 16:15:45.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 16:15:45.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425967,ok=425967,error=0, records=41
[INFO ] 2026-06-01 16:15:51.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:15:52.992 [27470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:16:00.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 16:16:00.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425968,ok=425968,error=0, records=41
[INFO ] 2026-06-01 16:16:06.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:16:07.996 [27386] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:16:15.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 16:16:15.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425969,ok=425969,error=0, records=41
[INFO ] 2026-06-01 16:16:15.844 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17744/300s
[INFO ] 2026-06-01 16:16:15.845 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850396},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:16:16.025 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:16:16.026 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:16:16.026 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:16:16.026 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:16:16.026 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:16:16.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:16:16.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:16:16.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:16:16.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:16:16.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:16:16.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:16:21.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:16:23.002 [27500] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:16:23.841 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21300/300s
[INFO ] 2026-06-01 16:16:30.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 16:16:30.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425970,ok=425970,error=0, records=41
[INFO ] 2026-06-01 16:16:36.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:16:38.007 [27514] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:16:39.649 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21309/300s
[INFO ] 2026-06-01 16:16:45.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 16:16:45.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425971,ok=425971,error=0, records=41
[INFO ] 2026-06-01 16:16:51.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:16:53.012 [27456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:17:00.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 16:17:00.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425972,ok=425972,error=0, records=41
[INFO ] 2026-06-01 16:17:06.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:17:06.744 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21312/300s
[WARN ] 2026-06-01 16:17:08.018 [27456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:17:15.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-01 16:17:15.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425973,ok=425973,error=0, records=41
[INFO ] 2026-06-01 16:17:21.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:17:23.024 [27484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:17:30.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 16:17:30.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425974,ok=425974,error=0, records=41
[INFO ] 2026-06-01 16:17:36.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:17:38.029 [27386] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:17:45.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 16:17:45.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425975,ok=425975,error=0, records=41
[INFO ] 2026-06-01 16:17:45.673 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21310/300s
[INFO ] 2026-06-01 16:17:47.775 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21310/300s
[INFO ] 2026-06-01 16:17:51.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:17:53.034 [27554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:17:55.081 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21310/300s
[INFO ] 2026-06-01 16:18:00.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 16:18:00.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425976,ok=425976,error=0, records=41
[INFO ] 2026-06-01 16:18:06.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:18:08.039 [27554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:18:15.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 16:18:15.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425977,ok=425977,error=0, records=41
[INFO ] 2026-06-01 16:18:21.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:18:23.046 [27470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:18:30.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 16:18:30.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425978,ok=425978,error=0, records=41
[INFO ] 2026-06-01 16:18:36.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:18:38.052 [27627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:18:45.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 16:18:45.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425979,ok=425979,error=0, records=41
[INFO ] 2026-06-01 16:18:51.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:18:52.557 [27596] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:19:00.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 16:19:00.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425980,ok=425980,error=0, records=41
[INFO ] 2026-06-01 16:19:06.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:19:07.563 [27654] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:19:15.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 16:19:15.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425981,ok=425981,error=0, records=41
[INFO ] 2026-06-01 16:19:16.027 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:19:16.208 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:19:16.208 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:19:16.208 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:19:16.208 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:19:16.208 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:19:16.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:19:16.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:19:16.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:19:16.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:19:16.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:19:16.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:19:21.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:19:22.568 [27678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:19:30.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 16:19:30.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425982,ok=425982,error=0, records=41
[INFO ] 2026-06-01 16:19:36.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:19:37.573 [27701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:19:45.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 16:19:45.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425983,ok=425983,error=0, records=41
[INFO ] 2026-06-01 16:19:51.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:19:52.578 [27677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:20:00.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 16:20:00.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425984,ok=425984,error=0, records=41
[INFO ] 2026-06-01 16:20:01.216 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21314/300s
[INFO ] 2026-06-01 16:20:06.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:20:07.584 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:20:08.584 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21305/300s
[INFO ] 2026-06-01 16:20:15.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 16:20:15.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425985,ok=425985,error=0, records=41
[INFO ] 2026-06-01 16:20:21.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:20:22.590 [27755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:20:30.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 16:20:30.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425986,ok=425986,error=0, records=41
[INFO ] 2026-06-01 16:20:30.720 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21301/300s
[INFO ] 2026-06-01 16:20:36.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:20:37.595 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:20:41.779 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21314/300s
[INFO ] 2026-06-01 16:20:45.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 16:20:45.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425987,ok=425987,error=0, records=41
[INFO ] 2026-06-01 16:20:51.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:20:52.600 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:21:00.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 16:21:00.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425988,ok=425988,error=0, records=41
[INFO ] 2026-06-01 16:21:06.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:21:07.606 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:21:15.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 16:21:15.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425989,ok=425989,error=0, records=41
[INFO ] 2026-06-01 16:21:21.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:21:22.611 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:21:24.023 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21301/300s
[INFO ] 2026-06-01 16:21:30.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 16:21:30.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425990,ok=425990,error=0, records=41
[INFO ] 2026-06-01 16:21:36.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:21:37.616 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:21:39.702 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21310/300s
[INFO ] 2026-06-01 16:21:45.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 16:21:45.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425991,ok=425991,error=0, records=41
[INFO ] 2026-06-01 16:21:51.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:21:52.621 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:22:00.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 16:22:00.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425992,ok=425992,error=0, records=41
[INFO ] 2026-06-01 16:22:06.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:22:06.757 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21313/300s
[WARN ] 2026-06-01 16:22:07.626 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:22:15.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 16:22:15.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425993,ok=425993,error=0, records=41
[INFO ] 2026-06-01 16:22:16.208 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17745/300s
[INFO ] 2026-06-01 16:22:16.209 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850236},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:22:16.359 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:22:16.359 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:22:16.359 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:22:16.359 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:22:16.359 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:22:16.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:22:16.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:22:16.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:22:16.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:22:16.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:22:16.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 16:22:17.631 [27777] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23161/stat), No such file or directory
[INFO ] 2026-06-01 16:22:21.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:22:22.632 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:22:30.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 16:22:30.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425994,ok=425994,error=0, records=41
[WARN ] 2026-06-01 16:22:32.636 [27777] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23161/stat), No such file or directory
[INFO ] 2026-06-01 16:22:36.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:22:37.637 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:22:45.728 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21311/300s
[INFO ] 2026-06-01 16:22:45.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 16:22:45.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425995,ok=425995,error=0, records=41
[WARN ] 2026-06-01 16:22:47.642 [27787] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23161/stat), No such file or directory
[INFO ] 2026-06-01 16:22:47.830 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21311/300s
[INFO ] 2026-06-01 16:22:51.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:22:52.642 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:22:55.136 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21311/300s
[INFO ] 2026-06-01 16:23:00.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 16:23:00.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425996,ok=425996,error=0, records=41
[INFO ] 2026-06-01 16:23:06.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:23:07.648 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:23:15.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 16:23:15.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425997,ok=425997,error=0, records=41
[INFO ] 2026-06-01 16:23:21.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:23:22.653 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:23:30.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 16:23:30.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425998,ok=425998,error=0, records=41
[INFO ] 2026-06-01 16:23:36.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 16:23:36.761 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 16:23:37.663 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:23:45.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 16:23:45.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=425999,ok=425999,error=0, records=41
[INFO ] 2026-06-01 16:23:51.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:23:51.762 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 16:23:52.667 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:24:00.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 16:24:00.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426000,ok=426000,error=0, records=41
[INFO ] 2026-06-01 16:24:06.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:24:07.673 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:24:15.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 16:24:15.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426001,ok=426001,error=0, records=41
[INFO ] 2026-06-01 16:24:21.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:24:22.678 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:24:30.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10143, records=41
[INFO ] 2026-06-01 16:24:30.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426002,ok=426002,error=0, records=41
[INFO ] 2026-06-01 16:24:36.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:24:37.683 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:24:45.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 16:24:45.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426003,ok=426003,error=0, records=41
[INFO ] 2026-06-01 16:24:51.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:24:52.688 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:25:00.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 16:25:00.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426004,ok=426004,error=0, records=41
[INFO ] 2026-06-01 16:25:01.219 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21315/300s
[INFO ] 2026-06-01 16:25:06.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:25:07.693 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:25:08.693 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21306/300s
[INFO ] 2026-06-01 16:25:15.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 16:25:15.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426005,ok=426005,error=0, records=41
[INFO ] 2026-06-01 16:25:16.361 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850160},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:25:16.540 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:25:16.540 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 16:25:16.541 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:25:16.541 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:25:16.541 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:25:16.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:25:16.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:25:16.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:25:16.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:25:16.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:25:16.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:25:21.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:25:22.700 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:25:30.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 16:25:30.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426006,ok=426006,error=0, records=41
[INFO ] 2026-06-01 16:25:30.847 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21302/300s
[INFO ] 2026-06-01 16:25:36.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:25:37.704 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:25:41.785 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21315/300s
[INFO ] 2026-06-01 16:25:45.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 16:25:45.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426007,ok=426007,error=0, records=41
[INFO ] 2026-06-01 16:25:51.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:25:52.710 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:26:00.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 16:26:00.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426008,ok=426008,error=0, records=41
[INFO ] 2026-06-01 16:26:06.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:26:07.715 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:26:15.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 16:26:15.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426009,ok=426009,error=0, records=41
[INFO ] 2026-06-01 16:26:21.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:26:22.720 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:26:24.203 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21302/300s
[INFO ] 2026-06-01 16:26:30.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 16:26:30.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426010,ok=426010,error=0, records=41
[INFO ] 2026-06-01 16:26:36.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:26:37.724 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:26:39.761 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21311/300s
[INFO ] 2026-06-01 16:26:45.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 16:26:45.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426011,ok=426011,error=0, records=41
[INFO ] 2026-06-01 16:26:51.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:26:52.728 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:27:00.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 16:27:00.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426012,ok=426012,error=0, records=41
[INFO ] 2026-06-01 16:27:06.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:27:06.771 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21314/300s
[WARN ] 2026-06-01 16:27:07.733 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:27:15.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 16:27:15.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426013,ok=426013,error=0, records=41
[INFO ] 2026-06-01 16:27:21.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:27:22.740 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:27:30.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 16:27:30.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426014,ok=426014,error=0, records=41
[INFO ] 2026-06-01 16:27:36.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:27:37.745 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:27:45.793 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21312/300s
[INFO ] 2026-06-01 16:27:45.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 16:27:45.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426015,ok=426015,error=0, records=41
[INFO ] 2026-06-01 16:27:47.895 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21312/300s
[INFO ] 2026-06-01 16:27:51.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:27:52.751 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:27:55.192 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21312/300s
[INFO ] 2026-06-01 16:28:00.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 16:28:00.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426016,ok=426016,error=0, records=41
[INFO ] 2026-06-01 16:28:06.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:28:07.756 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:28:15.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 16:28:15.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426017,ok=426017,error=0, records=41
[INFO ] 2026-06-01 16:28:16.541 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17746/300s
[INFO ] 2026-06-01 16:28:16.542 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:28:16.685 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:28:16.685 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 16:28:16.686 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:28:16.686 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:28:16.686 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:28:16.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:28:16.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:28:16.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:28:16.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:28:16.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:28:16.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:28:21.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:28:22.762 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:28:30.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 16:28:30.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426018,ok=426018,error=0, records=41
[INFO ] 2026-06-01 16:28:36.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:28:37.768 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:28:45.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 16:28:45.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426019,ok=426019,error=0, records=41
[INFO ] 2026-06-01 16:28:51.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:28:52.772 [27737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:29:00.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 16:29:00.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426020,ok=426020,error=0, records=41
[INFO ] 2026-06-01 16:29:06.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:29:07.778 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:29:15.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 16:29:15.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426021,ok=426021,error=0, records=41
[INFO ] 2026-06-01 16:29:21.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:29:22.784 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:29:30.957 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 16:29:30.957 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426022,ok=426022,error=0, records=41
[INFO ] 2026-06-01 16:29:36.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:29:37.789 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:29:45.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 16:29:45.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426023,ok=426023,error=0, records=41
[INFO ] 2026-06-01 16:29:51.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:29:52.794 [27787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:30:00.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 16:30:00.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426024,ok=426024,error=0, records=41
[INFO ] 2026-06-01 16:30:01.223 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21316/300s
[INFO ] 2026-06-01 16:30:06.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:30:07.800 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:30:08.800 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21307/300s
[INFO ] 2026-06-01 16:30:15.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 16:30:15.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426025,ok=426025,error=0, records=41
[INFO ] 2026-06-01 16:30:21.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:30:22.804 [28297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:30:30.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 16:30:30.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426026,ok=426026,error=0, records=41
[INFO ] 2026-06-01 16:30:30.980 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21303/300s
[INFO ] 2026-06-01 16:30:36.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:30:37.810 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:30:41.791 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21316/300s
[INFO ] 2026-06-01 16:30:45.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 16:30:45.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426027,ok=426027,error=0, records=41
[INFO ] 2026-06-01 16:30:51.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:30:52.815 [28297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:31:00.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 16:31:00.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426028,ok=426028,error=0, records=41
[INFO ] 2026-06-01 16:31:06.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:31:07.821 [27777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:31:15.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 16:31:15.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426029,ok=426029,error=0, records=41
[INFO ] 2026-06-01 16:31:16.687 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:31:16.830 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:31:16.830 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:31:16.830 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:31:16.830 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:31:16.830 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:31:16.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:31:16.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:31:16.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:31:16.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:31:16.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:31:16.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:31:21.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:31:22.825 [27772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:31:24.381 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21303/300s
[INFO ] 2026-06-01 16:31:31.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-01 16:31:31.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426030,ok=426030,error=0, records=41
[INFO ] 2026-06-01 16:31:36.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:31:37.829 [28346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:31:39.816 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21312/300s
[INFO ] 2026-06-01 16:31:46.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 16:31:46.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426031,ok=426031,error=0, records=41
[INFO ] 2026-06-01 16:31:51.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:31:52.834 [28375] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:32:01.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 16:32:01.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426032,ok=426032,error=0, records=41
[INFO ] 2026-06-01 16:32:06.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:32:06.783 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21315/300s
[WARN ] 2026-06-01 16:32:07.840 [28389] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:32:16.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 16:32:16.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426033,ok=426033,error=0, records=41
[INFO ] 2026-06-01 16:32:21.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:32:22.846 [28375] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:32:31.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 16:32:31.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426034,ok=426034,error=0, records=41
[INFO ] 2026-06-01 16:32:36.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:32:37.850 [28332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:32:45.850 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21313/300s
[INFO ] 2026-06-01 16:32:46.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 16:32:46.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426035,ok=426035,error=0, records=41
[INFO ] 2026-06-01 16:32:47.952 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21313/300s
[INFO ] 2026-06-01 16:32:51.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:32:52.855 [28346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:32:55.258 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21313/300s
[INFO ] 2026-06-01 16:33:01.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 16:33:01.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426036,ok=426036,error=0, records=41
[INFO ] 2026-06-01 16:33:06.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:33:07.860 [28452] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:33:16.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 16:33:16.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426037,ok=426037,error=0, records=41
[INFO ] 2026-06-01 16:33:21.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:33:22.865 [28375] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:33:31.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 16:33:31.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426038,ok=426038,error=0, records=41
[INFO ] 2026-06-01 16:33:36.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 16:33:36.787 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 16:33:37.870 [28466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:33:46.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 16:33:46.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426039,ok=426039,error=0, records=41
[INFO ] 2026-06-01 16:33:51.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:33:52.874 [27748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:34:01.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 16:34:01.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426040,ok=426040,error=0, records=41
[INFO ] 2026-06-01 16:34:06.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:34:07.881 [28466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:34:16.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 16:34:16.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426041,ok=426041,error=0, records=41
[INFO ] 2026-06-01 16:34:16.830 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17747/300s
[INFO ] 2026-06-01 16:34:16.832 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849948},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:34:16.994 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:34:16.994 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:34:16.994 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:34:16.994 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:34:16.994 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:34:17.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:34:17.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:34:17.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:34:17.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:34:17.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:34:17.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:34:21.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:34:22.887 [28510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:34:31.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 16:34:31.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426042,ok=426042,error=0, records=41
[INFO ] 2026-06-01 16:34:36.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:34:37.892 [28527] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:34:46.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 16:34:46.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426043,ok=426043,error=0, records=41
[INFO ] 2026-06-01 16:34:51.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:34:52.897 [28561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:35:01.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 16:35:01.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426044,ok=426044,error=0, records=41
[INFO ] 2026-06-01 16:35:01.227 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21317/300s
[INFO ] 2026-06-01 16:35:06.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:35:07.902 [28577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:35:08.903 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21308/300s
[INFO ] 2026-06-01 16:35:16.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 16:35:16.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426045,ok=426045,error=0, records=41
[INFO ] 2026-06-01 16:35:21.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:35:22.909 [28594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:35:31.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 16:35:31.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426046,ok=426046,error=0, records=41
[INFO ] 2026-06-01 16:35:31.151 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21304/300s
[INFO ] 2026-06-01 16:35:36.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:35:37.914 [28611] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:35:41.798 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21317/300s
[INFO ] 2026-06-01 16:35:46.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 16:35:46.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426047,ok=426047,error=0, records=41
[INFO ] 2026-06-01 16:35:51.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:35:52.919 [28628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:36:01.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 16:36:01.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426048,ok=426048,error=0, records=41
[INFO ] 2026-06-01 16:36:06.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:36:07.924 [28650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:36:16.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 16:36:16.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426049,ok=426049,error=0, records=41
[INFO ] 2026-06-01 16:36:21.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:36:22.930 [28662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:36:24.565 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21304/300s
[INFO ] 2026-06-01 16:36:31.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 16:36:31.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426050,ok=426050,error=0, records=41
[INFO ] 2026-06-01 16:36:36.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:36:37.935 [28661] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:36:39.874 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21313/300s
[INFO ] 2026-06-01 16:36:46.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 16:36:46.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426051,ok=426051,error=0, records=41
[INFO ] 2026-06-01 16:36:51.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:36:52.942 [28689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:37:01.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 16:37:01.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426052,ok=426052,error=0, records=41
[INFO ] 2026-06-01 16:37:06.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:37:06.796 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21316/300s
[WARN ] 2026-06-01 16:37:07.947 [28717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:37:16.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 16:37:16.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426053,ok=426053,error=0, records=41
[INFO ] 2026-06-01 16:37:16.996 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:37:17.161 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:37:17.161 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:37:17.161 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:37:17.161 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:37:17.161 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:37:17.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:37:17.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:37:17.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:37:17.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:37:17.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:37:17.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:37:21.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:37:22.953 [28679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:37:31.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 16:37:31.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426054,ok=426054,error=0, records=41
[INFO ] 2026-06-01 16:37:36.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:37:37.959 [28711] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:37:45.914 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21314/300s
[INFO ] 2026-06-01 16:37:46.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 16:37:46.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426055,ok=426055,error=0, records=41
[INFO ] 2026-06-01 16:37:48.016 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21314/300s
[INFO ] 2026-06-01 16:37:51.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:37:52.964 [28679] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:37:55.322 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21314/300s
[INFO ] 2026-06-01 16:38:01.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 16:38:01.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426056,ok=426056,error=0, records=41
[INFO ] 2026-06-01 16:38:06.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:38:07.968 [28717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:38:16.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 16:38:16.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426057,ok=426057,error=0, records=41
[INFO ] 2026-06-01 16:38:21.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:38:22.973 [28782] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:38:31.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 16:38:31.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426058,ok=426058,error=0, records=41
[INFO ] 2026-06-01 16:38:36.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:38:37.978 [28796] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:38:46.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 16:38:46.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426059,ok=426059,error=0, records=41
[INFO ] 2026-06-01 16:38:51.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:38:51.800 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 16:38:52.983 [28717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:39:01.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 16:39:01.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426060,ok=426060,error=0, records=41
[INFO ] 2026-06-01 16:39:06.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:39:07.988 [28824] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:39:16.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 16:39:16.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426061,ok=426061,error=0, records=41
[INFO ] 2026-06-01 16:39:21.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:39:22.993 [28838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:39:31.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 16:39:31.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426062,ok=426062,error=0, records=41
[INFO ] 2026-06-01 16:39:36.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:39:37.998 [28852] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:39:46.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 16:39:46.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426063,ok=426063,error=0, records=41
[INFO ] 2026-06-01 16:39:51.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:39:53.003 [28796] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:40:01.231 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21318/300s
[INFO ] 2026-06-01 16:40:01.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 16:40:01.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426064,ok=426064,error=0, records=41
[INFO ] 2026-06-01 16:40:06.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:40:08.009 [28717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:40:09.009 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21309/300s
[INFO ] 2026-06-01 16:40:16.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 16:40:16.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426065,ok=426065,error=0, records=41
[INFO ] 2026-06-01 16:40:17.161 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17748/300s
[INFO ] 2026-06-01 16:40:17.163 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849788},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:40:17.320 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:40:17.320 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:40:17.320 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:40:17.320 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:40:17.320 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:40:17.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:40:17.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:40:17.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:40:17.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:40:17.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:40:17.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:40:21.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:40:23.016 [28866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:40:31.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 16:40:31.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426066,ok=426066,error=0, records=41
[INFO ] 2026-06-01 16:40:31.335 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21305/300s
[INFO ] 2026-06-01 16:40:36.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:40:38.021 [28885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:40:41.805 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21318/300s
[INFO ] 2026-06-01 16:40:46.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 16:40:46.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426067,ok=426067,error=0, records=41
[INFO ] 2026-06-01 16:40:51.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:40:53.026 [28900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:41:01.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-01 16:41:01.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426068,ok=426068,error=0, records=41
[INFO ] 2026-06-01 16:41:06.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:41:08.031 [28914] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:41:16.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 16:41:16.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426069,ok=426069,error=0, records=41
[INFO ] 2026-06-01 16:41:21.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:41:23.037 [28942] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:41:24.749 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21305/300s
[INFO ] 2026-06-01 16:41:31.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 16:41:31.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426070,ok=426070,error=0, records=41
[INFO ] 2026-06-01 16:41:36.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:41:38.042 [28942] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:41:39.931 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21314/300s
[INFO ] 2026-06-01 16:41:46.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 16:41:46.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426071,ok=426071,error=0, records=41
[INFO ] 2026-06-01 16:41:51.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:41:53.047 [28991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:42:01.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 16:42:01.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426072,ok=426072,error=0, records=41
[INFO ] 2026-06-01 16:42:06.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:42:06.809 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21317/300s
[WARN ] 2026-06-01 16:42:08.052 [28991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:42:16.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 16:42:16.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426073,ok=426073,error=0, records=41
[INFO ] 2026-06-01 16:42:21.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:42:22.557 [29007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:42:31.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 16:42:31.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426074,ok=426074,error=0, records=41
[INFO ] 2026-06-01 16:42:36.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:42:37.566 [29007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:42:45.977 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21315/300s
[INFO ] 2026-06-01 16:42:46.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 16:42:46.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426075,ok=426075,error=0, records=41
[WARN ] 2026-06-01 16:42:47.570 [29012] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/22853/stat), No such file or directory
[INFO ] 2026-06-01 16:42:48.079 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21315/300s
[INFO ] 2026-06-01 16:42:51.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:42:52.571 [29062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:42:55.382 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21315/300s
[INFO ] 2026-06-01 16:43:01.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-01 16:43:01.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426076,ok=426076,error=0, records=41
[INFO ] 2026-06-01 16:43:06.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:43:07.576 [29051] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:43:16.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 16:43:16.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426077,ok=426077,error=0, records=41
[INFO ] 2026-06-01 16:43:17.322 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849704},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:43:17.491 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:43:17.491 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:43:17.491 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:43:17.491 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:43:17.491 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:43:17.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:43:17.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:43:17.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:43:17.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:43:17.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:43:17.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:43:21.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:43:22.582 [29051] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:43:31.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 16:43:31.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426078,ok=426078,error=0, records=41
[WARN ] 2026-06-01 16:43:32.587 [29111] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23748/stat), No such file or directory
[INFO ] 2026-06-01 16:43:36.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 16:43:36.813 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 16:43:37.587 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:43:46.413 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 16:43:46.413 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426079,ok=426079,error=0, records=41
[WARN ] 2026-06-01 16:43:47.592 [29130] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23748/stat), No such file or directory
[INFO ] 2026-06-01 16:43:51.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:43:52.592 [29130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:44:01.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 16:44:01.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426080,ok=426080,error=0, records=41
[INFO ] 2026-06-01 16:44:06.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:44:07.597 [29135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:44:16.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 16:44:16.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426081,ok=426081,error=0, records=41
[INFO ] 2026-06-01 16:44:21.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:44:22.602 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:44:31.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 16:44:31.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426082,ok=426082,error=0, records=41
[INFO ] 2026-06-01 16:44:36.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:44:37.608 [29145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:44:46.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 16:44:46.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426083,ok=426083,error=0, records=41
[INFO ] 2026-06-01 16:44:51.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:44:52.613 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:45:01.235 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21319/300s
[INFO ] 2026-06-01 16:45:01.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 16:45:01.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426084,ok=426084,error=0, records=41
[INFO ] 2026-06-01 16:45:06.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:45:07.619 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:45:09.119 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21310/300s
[INFO ] 2026-06-01 16:45:16.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 16:45:16.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426085,ok=426085,error=0, records=41
[INFO ] 2026-06-01 16:45:21.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:45:22.624 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:45:31.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 16:45:31.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426086,ok=426086,error=0, records=41
[INFO ] 2026-06-01 16:45:31.466 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21306/300s
[INFO ] 2026-06-01 16:45:36.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:45:37.629 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:45:41.811 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21319/300s
[INFO ] 2026-06-01 16:45:46.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10143, records=41
[INFO ] 2026-06-01 16:45:46.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426087,ok=426087,error=0, records=41
[INFO ] 2026-06-01 16:45:51.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:45:52.634 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:46:01.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 16:46:01.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426088,ok=426088,error=0, records=41
[INFO ] 2026-06-01 16:46:06.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:46:07.638 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:46:16.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 16:46:16.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426089,ok=426089,error=0, records=41
[INFO ] 2026-06-01 16:46:17.491 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17749/300s
[INFO ] 2026-06-01 16:46:17.493 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849628},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:46:17.644 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:46:17.644 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:46:17.644 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:46:17.644 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:46:17.644 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:46:17.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:46:17.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:46:17.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:46:17.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:46:17.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:46:17.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:46:21.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:46:22.643 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:46:24.930 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21306/300s
[INFO ] 2026-06-01 16:46:31.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 16:46:31.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426090,ok=426090,error=0, records=41
[WARN ] 2026-06-01 16:46:32.647 [29145] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19765/stat), No such file or directory
[INFO ] 2026-06-01 16:46:36.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:46:37.649 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:46:39.986 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21315/300s
[INFO ] 2026-06-01 16:46:46.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 16:46:46.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426091,ok=426091,error=0, records=41
[WARN ] 2026-06-01 16:46:47.653 [29100] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19765/stat), No such file or directory
[INFO ] 2026-06-01 16:46:51.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:46:52.654 [29135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:47:01.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 16:47:01.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426092,ok=426092,error=0, records=41
[INFO ] 2026-06-01 16:47:06.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:47:06.822 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21318/300s
[WARN ] 2026-06-01 16:47:07.660 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:47:16.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 16:47:16.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426093,ok=426093,error=0, records=41
[INFO ] 2026-06-01 16:47:21.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:47:22.666 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:47:31.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 16:47:31.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426094,ok=426094,error=0, records=41
[INFO ] 2026-06-01 16:47:36.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:47:37.671 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:47:46.031 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21316/300s
[INFO ] 2026-06-01 16:47:46.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 16:47:46.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426095,ok=426095,error=0, records=41
[INFO ] 2026-06-01 16:47:48.133 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21316/300s
[INFO ] 2026-06-01 16:47:51.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:47:52.677 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:47:55.439 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21316/300s
[INFO ] 2026-06-01 16:48:01.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 16:48:01.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426096,ok=426096,error=0, records=41
[INFO ] 2026-06-01 16:48:06.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:48:07.683 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:48:16.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 16:48:16.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426097,ok=426097,error=0, records=41
[INFO ] 2026-06-01 16:48:21.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:48:22.687 [29145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:48:31.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 16:48:31.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426098,ok=426098,error=0, records=41
[INFO ] 2026-06-01 16:48:36.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:48:37.692 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:48:46.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 16:48:46.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426099,ok=426099,error=0, records=41
[INFO ] 2026-06-01 16:48:51.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:48:52.698 [29135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:49:01.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 16:49:01.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426100,ok=426100,error=0, records=41
[INFO ] 2026-06-01 16:49:06.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:49:07.702 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:49:16.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 16:49:16.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426101,ok=426101,error=0, records=41
[INFO ] 2026-06-01 16:49:17.646 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849556},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:49:17.800 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:49:17.800 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 16:49:17.800 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:49:17.800 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:49:17.800 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:49:17.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:49:17.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:49:17.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:49:17.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:49:17.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:49:17.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:49:21.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:49:22.707 [29145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:49:31.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 16:49:31.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426102,ok=426102,error=0, records=41
[INFO ] 2026-06-01 16:49:36.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:49:37.713 [29145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:49:46.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 16:49:46.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426103,ok=426103,error=0, records=41
[INFO ] 2026-06-01 16:49:51.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:49:52.718 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:50:01.238 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21320/300s
[INFO ] 2026-06-01 16:50:01.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 16:50:01.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426104,ok=426104,error=0, records=41
[INFO ] 2026-06-01 16:50:06.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:50:07.724 [29145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:50:09.224 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21311/300s
[INFO ] 2026-06-01 16:50:16.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 16:50:16.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426105,ok=426105,error=0, records=41
[INFO ] 2026-06-01 16:50:21.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:50:22.730 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:50:31.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 16:50:31.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426106,ok=426106,error=0, records=41
[INFO ] 2026-06-01 16:50:31.581 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21307/300s
[INFO ] 2026-06-01 16:50:36.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:50:37.735 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:50:41.818 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21320/300s
[INFO ] 2026-06-01 16:50:46.587 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 16:50:46.587 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426107,ok=426107,error=0, records=41
[INFO ] 2026-06-01 16:50:51.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:50:52.739 [29135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:51:01.592 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 16:51:01.592 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426108,ok=426108,error=0, records=41
[INFO ] 2026-06-01 16:51:06.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:51:07.743 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:51:16.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 16:51:16.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426109,ok=426109,error=0, records=41
[INFO ] 2026-06-01 16:51:21.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:51:22.748 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:51:25.111 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21307/300s
[INFO ] 2026-06-01 16:51:31.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 16:51:31.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426110,ok=426110,error=0, records=41
[INFO ] 2026-06-01 16:51:36.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:51:37.752 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:51:40.043 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21316/300s
[INFO ] 2026-06-01 16:51:46.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 16:51:46.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426111,ok=426111,error=0, records=41
[INFO ] 2026-06-01 16:51:51.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:51:52.758 [29135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:52:01.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 16:52:01.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426112,ok=426112,error=0, records=41
[INFO ] 2026-06-01 16:52:06.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:52:06.834 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21319/300s
[WARN ] 2026-06-01 16:52:07.762 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:52:16.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-01 16:52:16.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426113,ok=426113,error=0, records=41
[INFO ] 2026-06-01 16:52:17.800 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17750/300s
[INFO ] 2026-06-01 16:52:17.802 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849468},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:52:17.962 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:52:17.962 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 16:52:17.962 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:52:17.962 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:52:17.962 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:52:18.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:52:18.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:52:18.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:52:18.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:52:18.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:52:18.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:52:21.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:52:22.767 [29105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:52:31.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 16:52:31.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426114,ok=426114,error=0, records=41
[INFO ] 2026-06-01 16:52:36.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:52:37.771 [29145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:52:46.097 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21317/300s
[INFO ] 2026-06-01 16:52:46.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 16:52:46.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426115,ok=426115,error=0, records=41
[INFO ] 2026-06-01 16:52:48.199 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21317/300s
[INFO ] 2026-06-01 16:52:51.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:52:52.776 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:52:55.506 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21317/300s
[INFO ] 2026-06-01 16:53:01.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 16:53:01.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426116,ok=426116,error=0, records=41
[INFO ] 2026-06-01 16:53:06.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:53:07.783 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:53:16.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 16:53:16.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426117,ok=426117,error=0, records=41
[INFO ] 2026-06-01 16:53:21.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:53:22.788 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:53:31.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 16:53:31.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426118,ok=426118,error=0, records=41
[INFO ] 2026-06-01 16:53:36.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 16:53:36.838 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 16:53:37.793 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:53:46.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 16:53:46.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426119,ok=426119,error=0, records=41
[INFO ] 2026-06-01 16:53:51.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:53:51.838 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 16:53:52.797 [29135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:54:01.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 16:54:01.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426120,ok=426120,error=0, records=41
[INFO ] 2026-06-01 16:54:06.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:54:07.802 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:54:16.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 16:54:16.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426121,ok=426121,error=0, records=41
[INFO ] 2026-06-01 16:54:21.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:54:22.806 [29697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:54:31.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 16:54:31.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426122,ok=426122,error=0, records=41
[INFO ] 2026-06-01 16:54:36.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:54:37.811 [29712] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:54:46.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 16:54:46.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426123,ok=426123,error=0, records=41
[INFO ] 2026-06-01 16:54:51.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:54:52.817 [29707] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:55:01.242 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21321/300s
[INFO ] 2026-06-01 16:55:01.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 16:55:01.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426124,ok=426124,error=0, records=41
[INFO ] 2026-06-01 16:55:06.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:55:07.823 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:55:09.324 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21312/300s
[INFO ] 2026-06-01 16:55:16.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 16:55:16.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426125,ok=426125,error=0, records=41
[INFO ] 2026-06-01 16:55:17.964 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849352},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:55:18.103 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:55:18.103 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 16:55:18.103 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:55:18.103 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:55:18.103 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:55:18.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:55:18.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:55:18.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:55:18.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:55:18.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:55:18.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:55:21.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:55:22.829 [29754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:55:31.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 16:55:31.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426126,ok=426126,error=0, records=41
[INFO ] 2026-06-01 16:55:31.704 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21308/300s
[INFO ] 2026-06-01 16:55:36.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:55:37.834 [29697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:55:41.824 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21321/300s
[INFO ] 2026-06-01 16:55:46.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 16:55:46.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426127,ok=426127,error=0, records=41
[INFO ] 2026-06-01 16:55:51.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:55:52.839 [29722] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:56:01.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 16:56:01.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426128,ok=426128,error=0, records=41
[INFO ] 2026-06-01 16:56:06.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:56:07.845 [29712] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:56:16.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 16:56:16.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426129,ok=426129,error=0, records=41
[INFO ] 2026-06-01 16:56:21.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:56:22.850 [29100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:56:25.296 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21308/300s
[INFO ] 2026-06-01 16:56:31.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 16:56:31.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426130,ok=426130,error=0, records=41
[INFO ] 2026-06-01 16:56:36.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:56:37.856 [29804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:56:40.101 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21317/300s
[INFO ] 2026-06-01 16:56:46.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 16:56:46.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426131,ok=426131,error=0, records=41
[INFO ] 2026-06-01 16:56:51.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:56:52.861 [29697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:57:01.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 16:57:01.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426132,ok=426132,error=0, records=41
[INFO ] 2026-06-01 16:57:06.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 16:57:06.847 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21320/300s
[WARN ] 2026-06-01 16:57:07.866 [29846] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:57:16.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 16:57:16.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426133,ok=426133,error=0, records=41
[INFO ] 2026-06-01 16:57:21.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:57:22.870 [29832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:57:31.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 16:57:31.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426134,ok=426134,error=0, records=41
[INFO ] 2026-06-01 16:57:36.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:57:37.874 [29879] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:57:46.165 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21318/300s
[INFO ] 2026-06-01 16:57:46.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 16:57:46.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426135,ok=426135,error=0, records=41
[INFO ] 2026-06-01 16:57:48.267 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21318/300s
[INFO ] 2026-06-01 16:57:51.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:57:52.879 [29832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:57:55.572 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21318/300s
[INFO ] 2026-06-01 16:58:01.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 16:58:01.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426136,ok=426136,error=0, records=41
[INFO ] 2026-06-01 16:58:06.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:58:07.883 [29912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:58:16.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 16:58:16.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426137,ok=426137,error=0, records=41
[INFO ] 2026-06-01 16:58:18.103 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17751/300s
[INFO ] 2026-06-01 16:58:18.105 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849192},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 16:58:18.250 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 16:58:18.251 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 16:58:18.251 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 16:58:18.251 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 16:58:18.251 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 16:58:18.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:58:18.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 16:58:18.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:58:18.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 16:58:18.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:58:18.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 16:58:21.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:58:22.887 [29912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:58:31.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 16:58:31.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426138,ok=426138,error=0, records=41
[INFO ] 2026-06-01 16:58:36.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:58:37.893 [29912] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:58:46.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 16:58:46.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426139,ok=426139,error=0, records=41
[INFO ] 2026-06-01 16:58:51.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:58:52.897 [29923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:59:01.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 16:59:01.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426140,ok=426140,error=0, records=41
[INFO ] 2026-06-01 16:59:06.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:59:07.902 [29978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:59:16.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 16:59:16.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426141,ok=426141,error=0, records=41
[INFO ] 2026-06-01 16:59:21.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:59:22.906 [29983] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:59:31.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 16:59:31.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426142,ok=426142,error=0, records=41
[INFO ] 2026-06-01 16:59:36.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:59:37.912 [30004] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 16:59:46.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 16:59:46.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426143,ok=426143,error=0, records=41
[INFO ] 2026-06-01 16:59:51.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 16:59:52.917 [30027] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:00:01.245 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21322/300s
[INFO ] 2026-06-01 17:00:01.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 17:00:01.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426144,ok=426144,error=0, records=41
[INFO ] 2026-06-01 17:00:06.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:00:07.922 [30049] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:00:09.423 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21313/300s
[INFO ] 2026-06-01 17:00:16.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:00:16.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426145,ok=426145,error=0, records=41
[INFO ] 2026-06-01 17:00:21.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:00:22.930 [30044] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:00:31.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 17:00:31.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426146,ok=426146,error=0, records=41
[INFO ] 2026-06-01 17:00:31.832 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21309/300s
[INFO ] 2026-06-01 17:00:36.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:00:37.935 [30074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:00:41.831 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21322/300s
[INFO ] 2026-06-01 17:00:46.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:00:46.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426147,ok=426147,error=0, records=41
[INFO ] 2026-06-01 17:00:51.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:00:52.939 [30064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:01:01.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 17:01:01.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426148,ok=426148,error=0, records=41
[INFO ] 2026-06-01 17:01:06.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:01:07.944 [30120] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:01:16.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 17:01:16.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426149,ok=426149,error=0, records=41
[INFO ] 2026-06-01 17:01:18.252 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849112},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:01:18.425 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:01:18.425 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 17:01:18.425 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:01:18.425 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:01:18.425 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:01:18.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:01:18.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:01:18.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:01:18.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:01:18.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:01:18.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:01:21.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:01:22.950 [30075] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:01:25.478 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21309/300s
[INFO ] 2026-06-01 17:01:31.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 17:01:31.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426150,ok=426150,error=0, records=41
[INFO ] 2026-06-01 17:01:36.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:01:37.955 [30151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:01:40.158 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21318/300s
[INFO ] 2026-06-01 17:01:46.863 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 17:01:46.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426151,ok=426151,error=0, records=41
[INFO ] 2026-06-01 17:01:51.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:01:52.960 [30151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:02:01.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 17:02:01.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426152,ok=426152,error=0, records=41
[INFO ] 2026-06-01 17:02:06.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:02:06.859 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21321/300s
[WARN ] 2026-06-01 17:02:07.965 [30129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:02:16.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 17:02:16.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426153,ok=426153,error=0, records=41
[INFO ] 2026-06-01 17:02:21.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:02:22.971 [30193] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:02:31.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-01 17:02:31.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426154,ok=426154,error=0, records=41
[INFO ] 2026-06-01 17:02:36.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:02:37.976 [30129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:02:46.216 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21319/300s
[INFO ] 2026-06-01 17:02:46.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 17:02:46.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426155,ok=426155,error=0, records=41
[INFO ] 2026-06-01 17:02:48.318 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21319/300s
[INFO ] 2026-06-01 17:02:51.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:02:52.980 [30207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:02:55.624 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21319/300s
[INFO ] 2026-06-01 17:03:01.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 17:03:01.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426156,ok=426156,error=0, records=41
[INFO ] 2026-06-01 17:03:06.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:03:07.986 [30207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:03:17.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 17:03:17.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426157,ok=426157,error=0, records=41
[INFO ] 2026-06-01 17:03:21.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:03:22.991 [30135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:03:32.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 17:03:32.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426158,ok=426158,error=0, records=41
[INFO ] 2026-06-01 17:03:36.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 17:03:36.863 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 17:03:37.996 [30262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:03:47.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 17:03:47.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426159,ok=426159,error=0, records=41
[INFO ] 2026-06-01 17:03:51.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:03:53.001 [30262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:04:02.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 17:04:02.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426160,ok=426160,error=0, records=41
[INFO ] 2026-06-01 17:04:06.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:04:08.007 [30151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:04:17.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 17:04:17.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426161,ok=426161,error=0, records=41
[INFO ] 2026-06-01 17:04:18.425 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17752/300s
[INFO ] 2026-06-01 17:04:18.427 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849032},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:04:18.577 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:04:18.577 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 17:04:18.577 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:04:18.577 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:04:18.577 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:04:18.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:04:18.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:04:18.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:04:18.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:04:18.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:04:18.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:04:21.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:04:23.013 [30221] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:04:32.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 17:04:32.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426162,ok=426162,error=0, records=41
[INFO ] 2026-06-01 17:04:36.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:04:38.018 [30303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:04:47.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 17:04:47.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426163,ok=426163,error=0, records=41
[INFO ] 2026-06-01 17:04:51.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:04:53.022 [30332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:05:01.249 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21323/300s
[INFO ] 2026-06-01 17:05:02.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 17:05:02.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426164,ok=426164,error=0, records=41
[INFO ] 2026-06-01 17:05:06.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:05:08.028 [30221] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:05:09.528 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21314/300s
[INFO ] 2026-06-01 17:05:17.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 17:05:17.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426165,ok=426165,error=0, records=41
[INFO ] 2026-06-01 17:05:21.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:05:23.034 [30318] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:05:32.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 17:05:32.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426166,ok=426166,error=0, records=41
[INFO ] 2026-06-01 17:05:32.124 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21310/300s
[INFO ] 2026-06-01 17:05:36.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:05:38.039 [30359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:05:41.838 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21323/300s
[INFO ] 2026-06-01 17:05:47.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 17:05:47.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426167,ok=426167,error=0, records=41
[INFO ] 2026-06-01 17:05:51.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:05:53.045 [30359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:06:02.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 17:06:02.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426168,ok=426168,error=0, records=41
[INFO ] 2026-06-01 17:06:06.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:06:08.049 [30409] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:06:17.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 17:06:17.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426169,ok=426169,error=0, records=41
[INFO ] 2026-06-01 17:06:21.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:06:22.554 [30431] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:06:25.660 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21310/300s
[INFO ] 2026-06-01 17:06:32.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 17:06:32.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426170,ok=426170,error=0, records=41
[INFO ] 2026-06-01 17:06:36.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:06:37.559 [30448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:06:40.211 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21319/300s
[INFO ] 2026-06-01 17:06:47.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 17:06:47.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426171,ok=426171,error=0, records=41
[INFO ] 2026-06-01 17:06:51.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:06:52.564 [30466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:07:02.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 17:07:02.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426172,ok=426172,error=0, records=41
[INFO ] 2026-06-01 17:07:06.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:07:06.872 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21322/300s
[WARN ] 2026-06-01 17:07:07.569 [30484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:07:17.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 17:07:17.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426173,ok=426173,error=0, records=41
[INFO ] 2026-06-01 17:07:18.579 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848952},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:07:18.719 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:07:18.719 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 17:07:18.719 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:07:18.719 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:07:18.719 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:07:18.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:07:18.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:07:18.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:07:18.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:07:18.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:07:18.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:07:21.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:07:22.574 [30502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:07:32.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 17:07:32.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426174,ok=426174,error=0, records=41
[INFO ] 2026-06-01 17:07:36.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:07:37.580 [30517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:07:46.283 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21320/300s
[INFO ] 2026-06-01 17:07:47.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 17:07:47.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426175,ok=426175,error=0, records=41
[INFO ] 2026-06-01 17:07:48.385 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21320/300s
[INFO ] 2026-06-01 17:07:51.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:07:52.585 [30513] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:07:55.691 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21320/300s
[INFO ] 2026-06-01 17:08:02.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 17:08:02.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426176,ok=426176,error=0, records=41
[INFO ] 2026-06-01 17:08:06.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:08:07.591 [30549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:08:17.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 17:08:17.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426177,ok=426177,error=0, records=41
[INFO ] 2026-06-01 17:08:21.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:08:22.597 [30549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:08:32.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 17:08:32.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426178,ok=426178,error=0, records=41
[INFO ] 2026-06-01 17:08:36.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:08:37.602 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:08:47.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 17:08:47.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426179,ok=426179,error=0, records=41
[INFO ] 2026-06-01 17:08:51.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:08:51.876 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 17:08:52.608 [30521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:09:02.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 17:09:02.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426180,ok=426180,error=0, records=41
[INFO ] 2026-06-01 17:09:06.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:09:07.614 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:09:17.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 17:09:17.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426181,ok=426181,error=0, records=41
[INFO ] 2026-06-01 17:09:21.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:09:22.620 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:09:32.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 17:09:32.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426182,ok=426182,error=0, records=41
[INFO ] 2026-06-01 17:09:36.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:09:37.626 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:09:47.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 17:09:47.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426183,ok=426183,error=0, records=41
[INFO ] 2026-06-01 17:09:51.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:09:52.632 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:10:01.253 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21324/300s
[INFO ] 2026-06-01 17:10:02.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 17:10:02.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426184,ok=426184,error=0, records=41
[INFO ] 2026-06-01 17:10:06.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:10:07.638 [30521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:10:09.638 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21315/300s
[INFO ] 2026-06-01 17:10:17.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 17:10:17.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426185,ok=426185,error=0, records=41
[INFO ] 2026-06-01 17:10:18.719 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17753/300s
[INFO ] 2026-06-01 17:10:18.721 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848876},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:10:18.882 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:10:18.882 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 17:10:18.883 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:10:18.883 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:10:18.883 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:10:18.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:10:18.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:10:18.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:10:18.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:10:18.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:10:18.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:10:21.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:10:22.642 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:10:32.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 17:10:32.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426186,ok=426186,error=0, records=41
[INFO ] 2026-06-01 17:10:32.305 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21311/300s
[INFO ] 2026-06-01 17:10:36.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:10:37.647 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:10:41.846 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21324/300s
[INFO ] 2026-06-01 17:10:47.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 17:10:47.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426187,ok=426187,error=0, records=41
[INFO ] 2026-06-01 17:10:51.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:10:52.652 [30531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:11:02.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 17:11:02.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426188,ok=426188,error=0, records=41
[INFO ] 2026-06-01 17:11:06.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:11:07.657 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:11:17.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 17:11:17.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426189,ok=426189,error=0, records=41
[INFO ] 2026-06-01 17:11:21.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:11:22.662 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:11:25.845 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21311/300s
[INFO ] 2026-06-01 17:11:32.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:11:32.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426190,ok=426190,error=0, records=41
[INFO ] 2026-06-01 17:11:36.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:11:37.668 [30531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:11:40.271 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21320/300s
[INFO ] 2026-06-01 17:11:47.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 17:11:47.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426191,ok=426191,error=0, records=41
[INFO ] 2026-06-01 17:11:51.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:11:52.672 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:12:02.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 17:12:02.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426192,ok=426192,error=0, records=41
[INFO ] 2026-06-01 17:12:06.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:12:06.885 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21323/300s
[WARN ] 2026-06-01 17:12:07.677 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:12:17.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 17:12:17.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426193,ok=426193,error=0, records=41
[INFO ] 2026-06-01 17:12:21.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:12:22.682 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:12:32.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:12:32.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426194,ok=426194,error=0, records=41
[INFO ] 2026-06-01 17:12:36.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:12:37.687 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:12:46.362 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21321/300s
[INFO ] 2026-06-01 17:12:47.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 17:12:47.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426195,ok=426195,error=0, records=41
[INFO ] 2026-06-01 17:12:48.463 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21321/300s
[INFO ] 2026-06-01 17:12:51.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:12:52.692 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:12:55.769 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21321/300s
[INFO ] 2026-06-01 17:13:02.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 17:13:02.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426196,ok=426196,error=0, records=41
[INFO ] 2026-06-01 17:13:06.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:13:07.697 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:13:17.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 17:13:17.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426197,ok=426197,error=0, records=41
[INFO ] 2026-06-01 17:13:18.884 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848796},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:13:19.062 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:13:19.062 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 17:13:19.062 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:13:19.062 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:13:19.062 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:13:19.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:13:19.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:13:19.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:13:19.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:13:19.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:13:19.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:13:21.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:13:22.703 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:13:32.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:13:32.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426198,ok=426198,error=0, records=41
[INFO ] 2026-06-01 17:13:36.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 17:13:36.889 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 17:13:37.707 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:13:47.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:13:47.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426199,ok=426199,error=0, records=41
[INFO ] 2026-06-01 17:13:51.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:13:52.713 [30531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:14:02.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 17:14:02.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426200,ok=426200,error=0, records=41
[INFO ] 2026-06-01 17:14:06.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:14:07.720 [30521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:14:17.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:14:17.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426201,ok=426201,error=0, records=41
[INFO ] 2026-06-01 17:14:21.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:14:22.726 [30531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:14:32.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:14:32.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426202,ok=426202,error=0, records=41
[INFO ] 2026-06-01 17:14:36.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:14:37.731 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:14:47.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 17:14:47.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426203,ok=426203,error=0, records=41
[INFO ] 2026-06-01 17:14:51.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:14:52.736 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:15:01.256 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21325/300s
[INFO ] 2026-06-01 17:15:02.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 17:15:02.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426204,ok=426204,error=0, records=41
[INFO ] 2026-06-01 17:15:06.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:15:07.742 [30521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:15:09.742 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21316/300s
[INFO ] 2026-06-01 17:15:17.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 17:15:17.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426205,ok=426205,error=0, records=41
[INFO ] 2026-06-01 17:15:21.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:15:22.747 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:15:32.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 17:15:32.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426206,ok=426206,error=0, records=41
[INFO ] 2026-06-01 17:15:32.575 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21312/300s
[INFO ] 2026-06-01 17:15:36.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:15:37.753 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:15:41.853 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21325/300s
[INFO ] 2026-06-01 17:15:47.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 17:15:47.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426207,ok=426207,error=0, records=41
[INFO ] 2026-06-01 17:15:51.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:15:52.758 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:16:02.587 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 17:16:02.587 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426208,ok=426208,error=0, records=41
[INFO ] 2026-06-01 17:16:06.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:16:07.764 [30521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:16:17.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 17:16:17.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426209,ok=426209,error=0, records=41
[INFO ] 2026-06-01 17:16:19.062 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17754/300s
[INFO ] 2026-06-01 17:16:19.064 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848724},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:16:19.232 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:16:19.232 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 17:16:19.232 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:16:19.232 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:16:19.232 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:16:19.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:16:19.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:16:19.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:16:19.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:16:19.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:16:19.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:16:21.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:16:22.768 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:16:26.029 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21312/300s
[INFO ] 2026-06-01 17:16:32.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 17:16:32.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426210,ok=426210,error=0, records=41
[INFO ] 2026-06-01 17:16:36.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:16:37.774 [30564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:16:40.328 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21321/300s
[INFO ] 2026-06-01 17:16:47.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 17:16:47.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426211,ok=426211,error=0, records=41
[INFO ] 2026-06-01 17:16:51.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:16:52.779 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:17:02.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 17:17:02.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426212,ok=426212,error=0, records=41
[INFO ] 2026-06-01 17:17:06.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:17:06.899 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21324/300s
[WARN ] 2026-06-01 17:17:07.785 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:17:17.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 17:17:17.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426213,ok=426213,error=0, records=41
[INFO ] 2026-06-01 17:17:21.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:17:22.791 [30521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:17:32.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 17:17:32.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426214,ok=426214,error=0, records=41
[INFO ] 2026-06-01 17:17:36.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:17:37.797 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:17:46.436 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21322/300s
[INFO ] 2026-06-01 17:17:47.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 17:17:47.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426215,ok=426215,error=0, records=41
[INFO ] 2026-06-01 17:17:48.538 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21322/300s
[INFO ] 2026-06-01 17:17:51.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:17:52.803 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:17:55.844 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21322/300s
[INFO ] 2026-06-01 17:18:02.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 17:18:02.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426216,ok=426216,error=0, records=41
[INFO ] 2026-06-01 17:18:06.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:18:07.809 [30531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:18:17.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 17:18:17.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426217,ok=426217,error=0, records=41
[INFO ] 2026-06-01 17:18:21.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:18:22.813 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:18:32.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 17:18:32.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426218,ok=426218,error=0, records=41
[INFO ] 2026-06-01 17:18:36.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:18:37.819 [31115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:18:47.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13274, records=49
[INFO ] 2026-06-01 17:18:47.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426219,ok=426219,error=0, records=49
[INFO ] 2026-06-01 17:18:51.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:18:52.825 [31130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:19:02.745 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 17:19:02.745 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426220,ok=426220,error=0, records=41
[INFO ] 2026-06-01 17:19:06.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:19:07.830 [31081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:19:17.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:19:17.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426221,ok=426221,error=0, records=41
[INFO ] 2026-06-01 17:19:19.234 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848648},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:19:19.404 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:19:19.404 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 17:19:19.404 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:19:19.404 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:19:19.404 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:19:19.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:19:19.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:19:19.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:19:19.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:19:19.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:19:19.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:19:21.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:19:22.836 [31081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:19:32.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:19:32.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426222,ok=426222,error=0, records=41
[INFO ] 2026-06-01 17:19:36.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:19:37.840 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:19:47.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 17:19:47.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426223,ok=426223,error=0, records=41
[INFO ] 2026-06-01 17:19:51.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:19:52.847 [30548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:20:01.260 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21326/300s
[INFO ] 2026-06-01 17:20:02.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 17:20:02.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426224,ok=426224,error=0, records=41
[INFO ] 2026-06-01 17:20:06.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:20:07.853 [30542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:20:09.853 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21317/300s
[INFO ] 2026-06-01 17:20:17.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 17:20:17.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426225,ok=426225,error=0, records=41
[INFO ] 2026-06-01 17:20:21.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:20:22.858 [31115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:20:32.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 17:20:32.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426226,ok=426226,error=0, records=41
[INFO ] 2026-06-01 17:20:32.830 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21313/300s
[INFO ] 2026-06-01 17:20:36.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:20:37.863 [31228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:20:41.859 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21326/300s
[INFO ] 2026-06-01 17:20:47.836 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 17:20:47.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426227,ok=426227,error=0, records=41
[INFO ] 2026-06-01 17:20:51.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:20:52.868 [31228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:21:02.842 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 17:21:02.842 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426228,ok=426228,error=0, records=41
[INFO ] 2026-06-01 17:21:06.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:21:07.873 [31181] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:21:17.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:21:17.848 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426229,ok=426229,error=0, records=41
[INFO ] 2026-06-01 17:21:21.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:21:22.878 [31228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:21:26.213 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21313/300s
[INFO ] 2026-06-01 17:21:32.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 17:21:32.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426230,ok=426230,error=0, records=41
[INFO ] 2026-06-01 17:21:36.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:21:37.884 [31081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:21:40.385 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21322/300s
[INFO ] 2026-06-01 17:21:47.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 17:21:47.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426231,ok=426231,error=0, records=41
[INFO ] 2026-06-01 17:21:51.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:21:52.889 [31304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:22:02.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 17:22:02.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426232,ok=426232,error=0, records=41
[INFO ] 2026-06-01 17:22:06.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:22:06.911 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21325/300s
[WARN ] 2026-06-01 17:22:07.894 [31271] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:22:17.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 17:22:17.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426233,ok=426233,error=0, records=41
[INFO ] 2026-06-01 17:22:19.404 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17755/300s
[INFO ] 2026-06-01 17:22:19.405 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848572},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:22:19.587 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:22:19.587 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 17:22:19.587 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:22:19.587 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:22:19.587 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:22:19.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:22:19.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:22:19.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:22:19.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:22:19.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:22:19.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:22:21.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:22:22.900 [31314] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:22:32.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 17:22:32.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426234,ok=426234,error=0, records=41
[INFO ] 2026-06-01 17:22:36.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:22:37.905 [31360] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:22:46.500 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21323/300s
[INFO ] 2026-06-01 17:22:47.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 17:22:47.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426235,ok=426235,error=0, records=41
[INFO ] 2026-06-01 17:22:48.602 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21323/300s
[INFO ] 2026-06-01 17:22:51.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:22:52.911 [31370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:22:55.909 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21323/300s
[INFO ] 2026-06-01 17:23:02.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 17:23:02.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426236,ok=426236,error=0, records=41
[INFO ] 2026-06-01 17:23:06.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:23:07.917 [31392] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:23:17.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 17:23:17.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426237,ok=426237,error=0, records=41
[INFO ] 2026-06-01 17:23:21.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:23:22.922 [31409] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:23:32.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 17:23:32.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426238,ok=426238,error=0, records=41
[INFO ] 2026-06-01 17:23:36.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 17:23:36.915 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 17:23:37.927 [31420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:23:47.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 17:23:47.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426239,ok=426239,error=0, records=41
[INFO ] 2026-06-01 17:23:51.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:23:51.916 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 17:23:52.934 [31426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:24:02.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:24:02.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426240,ok=426240,error=0, records=41
[INFO ] 2026-06-01 17:24:06.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:24:07.942 [31457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:24:17.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 17:24:17.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426241,ok=426241,error=0, records=41
[INFO ] 2026-06-01 17:24:21.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:24:22.948 [31450] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:24:32.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 17:24:32.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426242,ok=426242,error=0, records=41
[INFO ] 2026-06-01 17:24:36.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:24:37.953 [31420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:24:47.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 17:24:47.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426243,ok=426243,error=0, records=41
[INFO ] 2026-06-01 17:24:51.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:24:52.958 [31457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:25:01.263 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21327/300s
[INFO ] 2026-06-01 17:25:02.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 17:25:02.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426244,ok=426244,error=0, records=41
[INFO ] 2026-06-01 17:25:06.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:25:07.963 [31480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:25:09.963 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21318/300s
[INFO ] 2026-06-01 17:25:17.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 17:25:17.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426245,ok=426245,error=0, records=41
[INFO ] 2026-06-01 17:25:19.589 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848488},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:25:19.756 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:25:19.756 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 17:25:19.756 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:25:19.756 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:25:19.756 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:25:19.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:25:19.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:25:19.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:25:19.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:25:19.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:25:19.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:25:21.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:25:22.967 [31480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:25:32.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 17:25:32.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426246,ok=426246,error=0, records=41
[INFO ] 2026-06-01 17:25:32.949 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21314/300s
[INFO ] 2026-06-01 17:25:36.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:25:37.971 [31494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:25:41.865 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21327/300s
[INFO ] 2026-06-01 17:25:47.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 17:25:47.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426247,ok=426247,error=0, records=41
[INFO ] 2026-06-01 17:25:51.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:25:52.977 [31457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:26:02.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 17:26:02.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426248,ok=426248,error=0, records=41
[INFO ] 2026-06-01 17:26:06.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:26:07.981 [31480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:26:17.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 17:26:17.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426249,ok=426249,error=0, records=41
[INFO ] 2026-06-01 17:26:21.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:26:22.986 [31480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:26:26.399 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21314/300s
[INFO ] 2026-06-01 17:26:32.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 17:26:32.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426250,ok=426250,error=0, records=41
[INFO ] 2026-06-01 17:26:36.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:26:37.991 [31565] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:26:40.452 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21323/300s
[INFO ] 2026-06-01 17:26:47.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:26:47.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426251,ok=426251,error=0, records=41
[INFO ] 2026-06-01 17:26:51.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:26:52.997 [31565] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:27:02.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 17:27:02.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426252,ok=426252,error=0, records=41
[INFO ] 2026-06-01 17:27:06.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:27:06.925 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21326/300s
[WARN ] 2026-06-01 17:27:08.001 [31579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:27:17.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 17:27:17.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426253,ok=426253,error=0, records=41
[INFO ] 2026-06-01 17:27:21.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:27:23.008 [31607] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:27:32.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 17:27:32.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426254,ok=426254,error=0, records=41
[INFO ] 2026-06-01 17:27:36.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:27:38.012 [31607] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:27:46.574 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21324/300s
[INFO ] 2026-06-01 17:27:48.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 17:27:48.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426255,ok=426255,error=0, records=41
[INFO ] 2026-06-01 17:27:48.675 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21324/300s
[INFO ] 2026-06-01 17:27:51.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:27:53.018 [31649] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:27:55.981 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21324/300s
[INFO ] 2026-06-01 17:28:03.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 17:28:03.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426256,ok=426256,error=0, records=41
[INFO ] 2026-06-01 17:28:06.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:28:08.024 [31593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:28:18.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 17:28:18.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426257,ok=426257,error=0, records=41
[INFO ] 2026-06-01 17:28:19.756 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17756/300s
[INFO ] 2026-06-01 17:28:19.758 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848412},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:28:19.920 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:28:19.921 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 17:28:19.921 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:28:19.921 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:28:19.921 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:28:19.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:28:19.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:28:19.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:28:19.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:28:19.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:28:19.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:28:21.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:28:23.028 [31635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:28:33.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 17:28:33.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426258,ok=426258,error=0, records=41
[INFO ] 2026-06-01 17:28:36.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:28:38.033 [31706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:28:48.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11224, records=44
[INFO ] 2026-06-01 17:28:48.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426259,ok=426259,error=0, records=44
[INFO ] 2026-06-01 17:28:51.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:28:53.039 [31593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:29:03.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 17:29:03.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426260,ok=426260,error=0, records=41
[INFO ] 2026-06-01 17:29:06.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:29:08.044 [31744] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:29:18.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 17:29:18.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426261,ok=426261,error=0, records=41
[INFO ] 2026-06-01 17:29:21.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:29:23.052 [31755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:29:33.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 17:29:33.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426262,ok=426262,error=0, records=41
[INFO ] 2026-06-01 17:29:36.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:29:37.556 [31749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:29:48.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 17:29:48.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426263,ok=426263,error=0, records=41
[INFO ] 2026-06-01 17:29:51.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:29:52.562 [31788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:30:01.266 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21328/300s
[INFO ] 2026-06-01 17:30:03.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 17:30:03.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426264,ok=426264,error=0, records=41
[INFO ] 2026-06-01 17:30:06.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:30:07.566 [31810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:30:10.067 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21319/300s
[INFO ] 2026-06-01 17:30:18.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 17:30:18.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426265,ok=426265,error=0, records=41
[INFO ] 2026-06-01 17:30:21.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:30:22.572 [31836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:30:33.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 17:30:33.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426266,ok=426266,error=0, records=41
[INFO ] 2026-06-01 17:30:33.076 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21315/300s
[INFO ] 2026-06-01 17:30:36.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:30:37.577 [31853] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:30:41.872 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21328/300s
[INFO ] 2026-06-01 17:30:48.082 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 17:30:48.082 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426267,ok=426267,error=0, records=41
[INFO ] 2026-06-01 17:30:51.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:30:52.584 [31869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:31:03.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 17:31:03.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426268,ok=426268,error=0, records=41
[INFO ] 2026-06-01 17:31:06.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:31:07.591 [31869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:31:18.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 17:31:18.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426269,ok=426269,error=0, records=41
[INFO ] 2026-06-01 17:31:19.922 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848328},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:31:20.088 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:31:20.088 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 17:31:20.088 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:31:20.088 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:31:20.088 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:31:20.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:31:20.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:31:20.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:31:20.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:31:20.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:31:20.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:31:21.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:31:22.596 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:31:26.585 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21315/300s
[INFO ] 2026-06-01 17:31:33.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 17:31:33.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426270,ok=426270,error=0, records=41
[INFO ] 2026-06-01 17:31:36.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:31:37.601 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:31:40.509 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21324/300s
[INFO ] 2026-06-01 17:31:48.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 17:31:48.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426271,ok=426271,error=0, records=41
[INFO ] 2026-06-01 17:31:51.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:31:52.608 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:32:03.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 17:32:03.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426272,ok=426272,error=0, records=41
[INFO ] 2026-06-01 17:32:06.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:32:06.937 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21327/300s
[WARN ] 2026-06-01 17:32:07.613 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:32:18.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 17:32:18.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426273,ok=426273,error=0, records=41
[INFO ] 2026-06-01 17:32:21.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:32:22.618 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:32:33.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 17:32:33.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426274,ok=426274,error=0, records=41
[INFO ] 2026-06-01 17:32:36.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:32:37.624 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:32:46.638 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21325/300s
[INFO ] 2026-06-01 17:32:48.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 17:32:48.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426275,ok=426275,error=0, records=41
[INFO ] 2026-06-01 17:32:48.740 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21325/300s
[INFO ] 2026-06-01 17:32:51.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:32:52.631 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:32:56.047 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21325/300s
[INFO ] 2026-06-01 17:33:03.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 17:33:03.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426276,ok=426276,error=0, records=41
[INFO ] 2026-06-01 17:33:06.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:33:07.635 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:33:18.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 17:33:18.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426277,ok=426277,error=0, records=41
[INFO ] 2026-06-01 17:33:21.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:33:22.640 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:33:33.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 17:33:33.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426278,ok=426278,error=0, records=41
[INFO ] 2026-06-01 17:33:36.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 17:33:36.941 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 17:33:37.645 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:33:48.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 17:33:48.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426279,ok=426279,error=0, records=41
[INFO ] 2026-06-01 17:33:51.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:33:52.651 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:34:03.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-06-01 17:34:03.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426280,ok=426280,error=0, records=41
[INFO ] 2026-06-01 17:34:06.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:34:07.657 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:34:18.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 17:34:18.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426281,ok=426281,error=0, records=41
[INFO ] 2026-06-01 17:34:20.088 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17757/300s
[INFO ] 2026-06-01 17:34:20.090 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:34:20.225 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:34:20.225 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 17:34:20.225 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:34:20.225 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:34:20.225 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:34:20.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:34:20.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:34:20.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:34:20.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:34:20.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:34:20.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:34:21.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:34:22.661 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:34:33.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 17:34:33.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426282,ok=426282,error=0, records=41
[INFO ] 2026-06-01 17:34:36.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:34:37.666 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:34:48.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-01 17:34:48.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426283,ok=426283,error=0, records=41
[INFO ] 2026-06-01 17:34:51.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:34:52.670 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:35:01.269 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21329/300s
[INFO ] 2026-06-01 17:35:03.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 17:35:03.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426284,ok=426284,error=0, records=41
[INFO ] 2026-06-01 17:35:06.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:35:07.675 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:35:10.176 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21320/300s
[INFO ] 2026-06-01 17:35:18.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:35:18.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426285,ok=426285,error=0, records=41
[INFO ] 2026-06-01 17:35:21.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:35:22.680 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:35:33.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:35:33.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426286,ok=426286,error=0, records=41
[INFO ] 2026-06-01 17:35:33.226 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21316/300s
[INFO ] 2026-06-01 17:35:36.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:35:37.684 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:35:41.879 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21329/300s
[INFO ] 2026-06-01 17:35:48.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 17:35:48.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426287,ok=426287,error=0, records=41
[INFO ] 2026-06-01 17:35:51.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:35:52.689 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:36:03.236 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 17:36:03.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426288,ok=426288,error=0, records=41
[INFO ] 2026-06-01 17:36:06.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:36:07.696 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:36:18.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 17:36:18.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426289,ok=426289,error=0, records=41
[INFO ] 2026-06-01 17:36:21.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:36:22.701 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:36:26.728 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21316/300s
[INFO ] 2026-06-01 17:36:33.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 17:36:33.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426290,ok=426290,error=0, records=41
[INFO ] 2026-06-01 17:36:36.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:36:37.707 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:36:40.567 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21325/300s
[INFO ] 2026-06-01 17:36:48.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 17:36:48.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426291,ok=426291,error=0, records=41
[INFO ] 2026-06-01 17:36:51.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:36:52.712 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:37:03.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 17:37:03.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426292,ok=426292,error=0, records=41
[INFO ] 2026-06-01 17:37:06.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:37:06.950 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21328/300s
[WARN ] 2026-06-01 17:37:07.716 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:37:18.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:37:18.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426293,ok=426293,error=0, records=41
[INFO ] 2026-06-01 17:37:20.227 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:37:20.399 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:37:20.399 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 17:37:20.399 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:37:20.399 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:37:20.399 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:37:20.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:37:20.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:37:20.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:37:20.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:37:20.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:37:20.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:37:21.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:37:22.722 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:37:33.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:37:33.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426294,ok=426294,error=0, records=41
[INFO ] 2026-06-01 17:37:36.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:37:37.726 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:37:46.707 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21326/300s
[INFO ] 2026-06-01 17:37:48.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 17:37:48.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426295,ok=426295,error=0, records=41
[INFO ] 2026-06-01 17:37:48.810 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21326/300s
[INFO ] 2026-06-01 17:37:51.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:37:52.732 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:37:56.116 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21326/300s
[INFO ] 2026-06-01 17:38:03.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 17:38:03.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426296,ok=426296,error=0, records=41
[INFO ] 2026-06-01 17:38:06.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:38:07.739 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:38:18.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 17:38:18.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426297,ok=426297,error=0, records=41
[INFO ] 2026-06-01 17:38:21.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:38:22.744 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:38:33.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 17:38:33.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426298,ok=426298,error=0, records=41
[INFO ] 2026-06-01 17:38:36.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:38:37.748 [31833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:38:48.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 17:38:48.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426299,ok=426299,error=0, records=41
[INFO ] 2026-06-01 17:38:51.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:38:51.955 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 17:38:52.753 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:39:03.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 17:39:03.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426300,ok=426300,error=0, records=41
[INFO ] 2026-06-01 17:39:06.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:39:07.757 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:39:18.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 17:39:18.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426301,ok=426301,error=0, records=41
[INFO ] 2026-06-01 17:39:21.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:39:22.763 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:39:33.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:39:33.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426302,ok=426302,error=0, records=41
[INFO ] 2026-06-01 17:39:36.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:39:37.768 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:39:48.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 17:39:48.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426303,ok=426303,error=0, records=41
[INFO ] 2026-06-01 17:39:51.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:39:52.773 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:40:01.272 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21330/300s
[INFO ] 2026-06-01 17:40:03.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 17:40:03.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426304,ok=426304,error=0, records=41
[INFO ] 2026-06-01 17:40:06.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:40:07.779 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:40:10.280 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21321/300s
[INFO ] 2026-06-01 17:40:18.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 17:40:18.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426305,ok=426305,error=0, records=41
[INFO ] 2026-06-01 17:40:20.399 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17758/300s
[INFO ] 2026-06-01 17:40:20.401 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:40:20.565 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:40:20.565 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 17:40:20.566 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:40:20.566 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:40:20.566 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:40:20.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:40:20.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:40:20.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:40:20.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:40:20.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:40:20.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:40:21.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:40:22.785 [31900] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:40:33.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 17:40:33.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426306,ok=426306,error=0, records=41
[INFO ] 2026-06-01 17:40:33.462 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21317/300s
[INFO ] 2026-06-01 17:40:36.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:40:37.789 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:40:41.885 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21330/300s
[INFO ] 2026-06-01 17:40:48.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 17:40:48.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426307,ok=426307,error=0, records=41
[INFO ] 2026-06-01 17:40:51.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:40:52.794 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:41:03.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 17:41:03.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426308,ok=426308,error=0, records=41
[INFO ] 2026-06-01 17:41:06.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:41:07.799 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:41:18.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 17:41:18.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426309,ok=426309,error=0, records=41
[INFO ] 2026-06-01 17:41:21.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:41:22.804 [31828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:41:26.907 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21317/300s
[INFO ] 2026-06-01 17:41:33.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 17:41:33.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426310,ok=426310,error=0, records=41
[INFO ] 2026-06-01 17:41:36.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:41:37.809 [31882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:41:40.621 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21326/300s
[INFO ] 2026-06-01 17:41:48.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 17:41:48.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426311,ok=426311,error=0, records=41
[INFO ] 2026-06-01 17:41:51.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:41:52.815 [32450] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:42:03.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 17:42:03.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426312,ok=426312,error=0, records=41
[INFO ] 2026-06-01 17:42:06.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:42:06.964 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21329/300s
[WARN ] 2026-06-01 17:42:07.821 [32450] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:42:18.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 17:42:18.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426313,ok=426313,error=0, records=41
[INFO ] 2026-06-01 17:42:21.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:42:22.825 [32496] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:42:33.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 17:42:33.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426314,ok=426314,error=0, records=41
[INFO ] 2026-06-01 17:42:36.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:42:37.830 [32496] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:42:46.761 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21327/300s
[INFO ] 2026-06-01 17:42:48.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 17:42:48.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426315,ok=426315,error=0, records=41
[INFO ] 2026-06-01 17:42:48.863 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21327/300s
[INFO ] 2026-06-01 17:42:51.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:42:52.835 [32482] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:42:56.168 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21327/300s
[INFO ] 2026-06-01 17:43:03.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 17:43:03.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426316,ok=426316,error=0, records=41
[INFO ] 2026-06-01 17:43:06.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:43:07.842 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:43:18.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 17:43:18.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426317,ok=426317,error=0, records=41
[INFO ] 2026-06-01 17:43:20.567 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:43:20.732 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:43:20.732 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 17:43:20.732 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:43:20.732 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:43:20.732 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:43:20.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:43:20.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:43:20.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:43:20.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:43:20.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:43:20.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:43:21.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:43:22.847 [32510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:43:33.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 17:43:33.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426318,ok=426318,error=0, records=41
[INFO ] 2026-06-01 17:43:36.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 17:43:36.968 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 17:43:37.852 [32510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:43:48.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 17:43:48.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426319,ok=426319,error=0, records=41
[INFO ] 2026-06-01 17:43:51.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:43:52.856 [32574] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:44:03.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 17:44:03.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426320,ok=426320,error=0, records=41
[INFO ] 2026-06-01 17:44:06.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:44:07.863 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:44:18.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 17:44:18.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426321,ok=426321,error=0, records=41
[INFO ] 2026-06-01 17:44:21.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:44:22.867 [32601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:44:33.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 17:44:33.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426322,ok=426322,error=0, records=41
[INFO ] 2026-06-01 17:44:36.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:44:37.871 [32615] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:44:48.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 17:44:48.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426323,ok=426323,error=0, records=41
[INFO ] 2026-06-01 17:44:51.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:44:52.875 [32635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:45:01.276 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21331/300s
[INFO ] 2026-06-01 17:45:03.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-01 17:45:03.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426324,ok=426324,error=0, records=41
[INFO ] 2026-06-01 17:45:06.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:45:07.885 [32531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:45:10.387 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21322/300s
[INFO ] 2026-06-01 17:45:18.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 17:45:18.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426325,ok=426325,error=0, records=41
[INFO ] 2026-06-01 17:45:21.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:45:22.892 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:45:33.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 17:45:33.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426326,ok=426326,error=0, records=41
[INFO ] 2026-06-01 17:45:33.640 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21318/300s
[INFO ] 2026-06-01 17:45:36.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:45:37.898 [32661] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:45:41.892 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21331/300s
[INFO ] 2026-06-01 17:45:49.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-01 17:45:49.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426327,ok=426327,error=0, records=41
[INFO ] 2026-06-01 17:45:51.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:45:52.904 [32701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:46:04.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 17:46:04.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426328,ok=426328,error=0, records=41
[INFO ] 2026-06-01 17:46:06.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:46:07.909 [32674] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:46:19.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 17:46:19.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426329,ok=426329,error=0, records=41
[INFO ] 2026-06-01 17:46:20.732 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17759/300s
[INFO ] 2026-06-01 17:46:20.734 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847936},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:46:20.886 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:46:20.886 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 17:46:20.886 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:46:20.886 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:46:20.886 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:46:20.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:46:20.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:46:20.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:46:20.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:46:20.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:46:20.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:46:21.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:46:22.914 [32734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:46:27.088 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21318/300s
[INFO ] 2026-06-01 17:46:34.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 17:46:34.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426330,ok=426330,error=0, records=41
[INFO ] 2026-06-01 17:46:36.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:46:37.919 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:46:40.677 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21327/300s
[INFO ] 2026-06-01 17:46:49.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:46:49.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426331,ok=426331,error=0, records=41
[INFO ] 2026-06-01 17:46:51.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:46:52.924 [32755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:47:04.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:47:04.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426332,ok=426332,error=0, records=41
[INFO ] 2026-06-01 17:47:06.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:47:06.976 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21330/300s
[WARN ] 2026-06-01 17:47:07.931 [309  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:47:19.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 17:47:19.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426333,ok=426333,error=0, records=41
[INFO ] 2026-06-01 17:47:21.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:47:22.937 [32755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:47:34.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 17:47:34.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426334,ok=426334,error=0, records=41
[INFO ] 2026-06-01 17:47:36.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:47:37.942 [32755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:47:46.827 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21328/300s
[INFO ] 2026-06-01 17:47:48.929 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21328/300s
[INFO ] 2026-06-01 17:47:49.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:47:49.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426335,ok=426335,error=0, records=41
[INFO ] 2026-06-01 17:47:51.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:47:52.949 [32755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:47:56.234 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21328/300s
[INFO ] 2026-06-01 17:48:04.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 17:48:04.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426336,ok=426336,error=0, records=41
[INFO ] 2026-06-01 17:48:06.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:48:07.954 [327  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:48:19.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 17:48:19.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426337,ok=426337,error=0, records=41
[INFO ] 2026-06-01 17:48:21.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:48:22.960 [385  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:48:34.825 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:48:34.825 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426338,ok=426338,error=0, records=41
[INFO ] 2026-06-01 17:48:36.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:48:37.965 [32750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:48:49.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 17:48:49.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426339,ok=426339,error=0, records=41
[INFO ] 2026-06-01 17:48:51.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:48:52.970 [327  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:49:04.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 17:49:04.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426340,ok=426340,error=0, records=41
[INFO ] 2026-06-01 17:49:06.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:49:07.975 [32750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:49:19.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 17:49:19.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426341,ok=426341,error=0, records=41
[INFO ] 2026-06-01 17:49:20.888 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847864},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:49:21.052 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:49:21.052 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 17:49:21.052 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:49:21.052 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:49:21.052 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:49:21.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:49:21.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:49:21.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:49:21.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:49:21.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:49:21.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:49:21.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:49:22.980 [310  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:49:34.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 17:49:34.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426342,ok=426342,error=0, records=41
[INFO ] 2026-06-01 17:49:36.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:49:37.985 [385  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:49:49.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 17:49:49.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426343,ok=426343,error=0, records=41
[INFO ] 2026-06-01 17:49:51.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:49:52.991 [456  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:50:01.279 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21332/300s
[INFO ] 2026-06-01 17:50:04.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10736, records=44
[INFO ] 2026-06-01 17:50:04.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426344,ok=426344,error=0, records=44
[INFO ] 2026-06-01 17:50:06.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:50:07.996 [310  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:50:10.497 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21323/300s
[INFO ] 2026-06-01 17:50:19.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 17:50:19.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426345,ok=426345,error=0, records=41
[INFO ] 2026-06-01 17:50:21.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:50:23.002 [310  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:50:34.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 17:50:34.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426346,ok=426346,error=0, records=41
[INFO ] 2026-06-01 17:50:34.876 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21319/300s
[INFO ] 2026-06-01 17:50:36.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:50:38.007 [310  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:50:41.899 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21332/300s
[INFO ] 2026-06-01 17:50:49.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 17:50:49.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426347,ok=426347,error=0, records=41
[INFO ] 2026-06-01 17:50:51.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:50:53.012 [472  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:51:04.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 17:51:04.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426348,ok=426348,error=0, records=41
[INFO ] 2026-06-01 17:51:06.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:51:08.017 [508  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:51:19.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 17:51:19.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426349,ok=426349,error=0, records=41
[INFO ] 2026-06-01 17:51:21.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:51:23.022 [508  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:51:27.269 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21319/300s
[INFO ] 2026-06-01 17:51:34.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 17:51:34.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426350,ok=426350,error=0, records=41
[INFO ] 2026-06-01 17:51:36.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:51:38.027 [560  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:51:40.731 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21328/300s
[INFO ] 2026-06-01 17:51:49.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 17:51:49.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426351,ok=426351,error=0, records=41
[INFO ] 2026-06-01 17:51:51.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:51:53.032 [310  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:52:04.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 17:52:04.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426352,ok=426352,error=0, records=41
[INFO ] 2026-06-01 17:52:06.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:52:06.988 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21331/300s
[WARN ] 2026-06-01 17:52:08.038 [494  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:52:19.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 17:52:19.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426353,ok=426353,error=0, records=41
[INFO ] 2026-06-01 17:52:21.052 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17760/300s
[INFO ] 2026-06-01 17:52:21.054 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:52:21.220 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:52:21.220 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 17:52:21.220 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:52:21.221 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:52:21.221 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:52:21.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:52:21.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:52:21.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:52:21.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:52:21.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:52:21.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:52:21.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:52:23.043 [603  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:52:34.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 17:52:34.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426354,ok=426354,error=0, records=41
[INFO ] 2026-06-01 17:52:36.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:52:38.048 [639  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:52:46.887 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21329/300s
[INFO ] 2026-06-01 17:52:48.989 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21329/300s
[INFO ] 2026-06-01 17:52:49.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 17:52:49.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426355,ok=426355,error=0, records=41
[INFO ] 2026-06-01 17:52:51.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:52:53.053 [649  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:52:56.296 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21329/300s
[INFO ] 2026-06-01 17:53:04.935 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 17:53:04.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426356,ok=426356,error=0, records=41
[INFO ] 2026-06-01 17:53:06.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:53:07.558 [669  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:53:19.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 17:53:19.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426357,ok=426357,error=0, records=41
[INFO ] 2026-06-01 17:53:21.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:53:22.563 [701  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:53:34.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 17:53:34.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426358,ok=426358,error=0, records=41
[INFO ] 2026-06-01 17:53:36.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 17:53:36.992 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 17:53:37.568 [717  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:53:49.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 17:53:49.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426359,ok=426359,error=0, records=41
[INFO ] 2026-06-01 17:53:51.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:53:51.993 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 17:53:52.573 [700  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:54:04.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 17:54:04.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426360,ok=426360,error=0, records=41
[INFO ] 2026-06-01 17:54:06.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:54:07.578 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:54:19.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 17:54:19.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426361,ok=426361,error=0, records=41
[INFO ] 2026-06-01 17:54:21.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:54:22.582 [771  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:54:34.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 17:54:34.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426362,ok=426362,error=0, records=41
[INFO ] 2026-06-01 17:54:36.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:54:37.587 [783  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:54:49.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 17:54:49.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426363,ok=426363,error=0, records=41
[INFO ] 2026-06-01 17:54:51.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:54:52.592 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:55:01.283 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21333/300s
[INFO ] 2026-06-01 17:55:04.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 17:55:04.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426364,ok=426364,error=0, records=41
[INFO ] 2026-06-01 17:55:06.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:55:07.597 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:55:10.598 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21324/300s
[INFO ] 2026-06-01 17:55:20.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 17:55:20.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426365,ok=426365,error=0, records=41
[INFO ] 2026-06-01 17:55:21.222 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847708},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:55:21.376 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:55:21.376 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 17:55:21.376 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:55:21.376 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:55:21.376 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:55:21.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:55:21.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:55:21.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:55:21.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:55:21.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:55:21.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:55:21.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:55:22.602 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:55:35.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 17:55:35.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426366,ok=426366,error=0, records=41
[INFO ] 2026-06-01 17:55:35.012 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21320/300s
[INFO ] 2026-06-01 17:55:36.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:55:37.608 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:55:41.906 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21333/300s
[INFO ] 2026-06-01 17:55:50.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 17:55:50.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426367,ok=426367,error=0, records=41
[INFO ] 2026-06-01 17:55:51.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:55:52.612 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:56:05.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 17:56:05.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426368,ok=426368,error=0, records=41
[INFO ] 2026-06-01 17:56:06.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:56:07.617 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:56:20.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 17:56:20.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426369,ok=426369,error=0, records=41
[INFO ] 2026-06-01 17:56:22.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:56:22.622 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:56:27.454 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21320/300s
[INFO ] 2026-06-01 17:56:35.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 17:56:35.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426370,ok=426370,error=0, records=41
[INFO ] 2026-06-01 17:56:37.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:56:37.627 [814  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:56:40.787 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21329/300s
[INFO ] 2026-06-01 17:56:50.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 17:56:50.044 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426371,ok=426371,error=0, records=41
[INFO ] 2026-06-01 17:56:52.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:56:52.631 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:57:05.050 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 17:57:05.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426372,ok=426372,error=0, records=41
[INFO ] 2026-06-01 17:57:07.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 17:57:07.002 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21332/300s
[WARN ] 2026-06-01 17:57:07.637 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:57:20.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 17:57:20.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426373,ok=426373,error=0, records=41
[INFO ] 2026-06-01 17:57:22.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:57:22.642 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:57:35.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 17:57:35.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426374,ok=426374,error=0, records=41
[INFO ] 2026-06-01 17:57:37.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:57:37.646 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:57:46.980 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21330/300s
[INFO ] 2026-06-01 17:57:49.066 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21330/300s
[INFO ] 2026-06-01 17:57:50.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-01 17:57:50.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426375,ok=426375,error=0, records=41
[INFO ] 2026-06-01 17:57:52.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:57:52.653 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:57:56.371 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21330/300s
[INFO ] 2026-06-01 17:58:05.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 17:58:05.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426376,ok=426376,error=0, records=41
[INFO ] 2026-06-01 17:58:07.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:58:07.659 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:58:20.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 17:58:20.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426377,ok=426377,error=0, records=41
[INFO ] 2026-06-01 17:58:21.376 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17761/300s
[INFO ] 2026-06-01 17:58:21.378 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847608},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 17:58:21.522 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 17:58:21.522 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 17:58:21.522 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 17:58:21.522 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 17:58:21.522 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 17:58:21.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:58:21.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 17:58:21.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:58:21.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 17:58:21.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:58:21.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 17:58:22.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:58:22.664 [788  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 17:58:32.668 [788  ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23752/stat), No such file or directory
[INFO ] 2026-06-01 17:58:35.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 17:58:35.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426378,ok=426378,error=0, records=41
[INFO ] 2026-06-01 17:58:37.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:58:37.669 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 17:58:47.672 [725  ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27824/stat), No such file or directory
[WARN ] 2026-06-01 17:58:47.672 [725  ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23752/stat), No such file or directory
[WARN ] 2026-06-01 17:58:47.672 [725  ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/29047/stat), No such file or directory
[INFO ] 2026-06-01 17:58:50.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 17:58:50.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426379,ok=426379,error=0, records=41
[INFO ] 2026-06-01 17:58:52.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:58:52.673 [814  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:59:05.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 17:59:05.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426380,ok=426380,error=0, records=41
[INFO ] 2026-06-01 17:59:07.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:59:07.677 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:59:20.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 17:59:20.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426381,ok=426381,error=0, records=41
[INFO ] 2026-06-01 17:59:22.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:59:22.682 [814  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:59:35.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 17:59:35.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426382,ok=426382,error=0, records=41
[INFO ] 2026-06-01 17:59:37.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:59:37.687 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 17:59:50.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 17:59:50.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426383,ok=426383,error=0, records=41
[INFO ] 2026-06-01 17:59:52.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 17:59:52.692 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:00:01.286 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21334/300s
[INFO ] 2026-06-01 18:00:05.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 18:00:05.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426384,ok=426384,error=0, records=41
[INFO ] 2026-06-01 18:00:07.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:00:07.698 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:00:10.698 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21325/300s
[INFO ] 2026-06-01 18:00:20.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12998, records=54
[INFO ] 2026-06-01 18:00:20.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426385,ok=426385,error=0, records=54
[INFO ] 2026-06-01 18:00:22.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:00:22.703 [814  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:00:35.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 18:00:35.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426386,ok=426386,error=0, records=41
[INFO ] 2026-06-01 18:00:35.239 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21321/300s
[INFO ] 2026-06-01 18:00:37.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:00:37.709 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:00:41.914 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21334/300s
[INFO ] 2026-06-01 18:00:50.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 18:00:50.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426387,ok=426387,error=0, records=41
[INFO ] 2026-06-01 18:00:52.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:00:52.714 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:01:05.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 18:01:05.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426388,ok=426388,error=0, records=41
[INFO ] 2026-06-01 18:01:07.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:01:07.720 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:01:20.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 18:01:20.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426389,ok=426389,error=0, records=41
[INFO ] 2026-06-01 18:01:21.524 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847524},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:01:21.688 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:01:21.688 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 18:01:21.688 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:01:21.688 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:01:21.688 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:01:21.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:01:21.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:01:21.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:01:21.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:01:21.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:01:21.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:01:22.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:01:22.727 [788  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:01:27.631 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21321/300s
[INFO ] 2026-06-01 18:01:35.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 18:01:35.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426390,ok=426390,error=0, records=41
[INFO ] 2026-06-01 18:01:37.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:01:37.733 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:01:40.838 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21330/300s
[INFO ] 2026-06-01 18:01:50.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 18:01:50.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426391,ok=426391,error=0, records=41
[INFO ] 2026-06-01 18:01:52.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:01:52.738 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:02:05.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 18:02:05.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426392,ok=426392,error=0, records=41
[INFO ] 2026-06-01 18:02:07.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:02:07.013 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21333/300s
[WARN ] 2026-06-01 18:02:07.744 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:02:20.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 18:02:20.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426393,ok=426393,error=0, records=41
[INFO ] 2026-06-01 18:02:22.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:02:22.749 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:02:35.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 18:02:35.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426394,ok=426394,error=0, records=41
[INFO ] 2026-06-01 18:02:37.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:02:37.754 [814  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:02:47.021 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21331/300s
[INFO ] 2026-06-01 18:02:49.095 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21331/300s
[INFO ] 2026-06-01 18:02:50.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 18:02:50.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426395,ok=426395,error=0, records=41
[INFO ] 2026-06-01 18:02:52.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:02:52.759 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:02:56.401 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21331/300s
[INFO ] 2026-06-01 18:03:05.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 18:03:05.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426396,ok=426396,error=0, records=41
[INFO ] 2026-06-01 18:03:07.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:03:07.763 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:03:20.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 18:03:20.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426397,ok=426397,error=0, records=41
[INFO ] 2026-06-01 18:03:22.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:03:22.767 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:03:35.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 18:03:35.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426398,ok=426398,error=0, records=41
[INFO ] 2026-06-01 18:03:37.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 18:03:37.017 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 18:03:37.771 [814  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:03:50.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 18:03:50.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426399,ok=426399,error=0, records=41
[INFO ] 2026-06-01 18:03:52.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:03:52.777 [777  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:04:05.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 18:04:05.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426400,ok=426400,error=0, records=41
[INFO ] 2026-06-01 18:04:07.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:04:07.784 [788  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:04:20.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 18:04:20.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426401,ok=426401,error=0, records=41
[INFO ] 2026-06-01 18:04:21.688 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17762/300s
[INFO ] 2026-06-01 18:04:21.690 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847448},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:04:21.866 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:04:21.866 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 18:04:21.866 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:04:21.866 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:04:21.866 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:04:21.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:04:21.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:04:21.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:04:21.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:04:21.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:04:21.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:04:22.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:04:22.789 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:04:35.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 18:04:35.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426402,ok=426402,error=0, records=41
[INFO ] 2026-06-01 18:04:37.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:04:37.793 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:04:50.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 18:04:50.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426403,ok=426403,error=0, records=41
[INFO ] 2026-06-01 18:04:52.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:04:52.798 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:05:01.289 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21335/300s
[INFO ] 2026-06-01 18:05:05.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 18:05:05.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426404,ok=426404,error=0, records=41
[INFO ] 2026-06-01 18:05:07.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:05:07.803 [788  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:05:10.803 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21326/300s
[INFO ] 2026-06-01 18:05:20.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 18:05:20.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426405,ok=426405,error=0, records=41
[INFO ] 2026-06-01 18:05:22.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:05:22.808 [1514 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:05:35.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 18:05:35.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426406,ok=426406,error=0, records=41
[INFO ] 2026-06-01 18:05:35.371 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21322/300s
[INFO ] 2026-06-01 18:05:37.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:05:37.813 [1514 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:05:41.920 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21335/300s
[INFO ] 2026-06-01 18:05:50.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 18:05:50.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426407,ok=426407,error=0, records=41
[INFO ] 2026-06-01 18:05:52.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:05:52.818 [1514 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:06:05.382 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-01 18:06:05.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426408,ok=426408,error=0, records=41
[INFO ] 2026-06-01 18:06:07.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:06:07.824 [808  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:06:20.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-01 18:06:20.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426409,ok=426409,error=0, records=41
[INFO ] 2026-06-01 18:06:22.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:06:22.829 [1547 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:06:27.812 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21322/300s
[INFO ] 2026-06-01 18:06:35.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 18:06:35.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426410,ok=426410,error=0, records=41
[INFO ] 2026-06-01 18:06:37.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:06:37.835 [1547 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:06:40.889 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21331/300s
[INFO ] 2026-06-01 18:06:50.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 18:06:50.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426411,ok=426411,error=0, records=41
[INFO ] 2026-06-01 18:06:52.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:06:52.839 [1595 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:07:05.407 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 18:07:05.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426412,ok=426412,error=0, records=41
[INFO ] 2026-06-01 18:07:07.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:07:07.026 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21334/300s
[WARN ] 2026-06-01 18:07:07.845 [1581 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:07:20.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 18:07:20.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426413,ok=426413,error=0, records=41
[INFO ] 2026-06-01 18:07:21.868 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847368},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:07:22.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 18:07:22.047 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:07:22.047 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:07:22.048 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:07:22.048 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:07:22.048 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:07:22.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:07:22.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:07:22.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:07:22.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:07:22.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:07:22.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 18:07:22.849 [1595 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:07:35.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 18:07:35.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426414,ok=426414,error=0, records=41
[INFO ] 2026-06-01 18:07:37.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:07:37.858 [1617 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:07:47.081 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21332/300s
[INFO ] 2026-06-01 18:07:49.143 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21332/300s
[INFO ] 2026-06-01 18:07:50.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 18:07:50.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426415,ok=426415,error=0, records=41
[INFO ] 2026-06-01 18:07:52.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:07:52.864 [1595 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:07:56.447 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21332/300s
[INFO ] 2026-06-01 18:08:05.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 18:08:05.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426416,ok=426416,error=0, records=41
[INFO ] 2026-06-01 18:08:07.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:08:07.870 [1595 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:08:20.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 18:08:20.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426417,ok=426417,error=0, records=41
[INFO ] 2026-06-01 18:08:22.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:08:22.875 [1595 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:08:35.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 18:08:35.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426418,ok=426418,error=0, records=41
[INFO ] 2026-06-01 18:08:37.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:08:37.880 [1547 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:08:50.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 18:08:50.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426419,ok=426419,error=0, records=41
[INFO ] 2026-06-01 18:08:52.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:08:52.030 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 18:08:52.885 [1719 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:09:05.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 18:09:05.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426420,ok=426420,error=0, records=41
[INFO ] 2026-06-01 18:09:07.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:09:07.891 [1743 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:09:20.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 18:09:20.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426421,ok=426421,error=0, records=41
[INFO ] 2026-06-01 18:09:22.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:09:22.896 [1743 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:09:35.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 18:09:35.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426422,ok=426422,error=0, records=41
[INFO ] 2026-06-01 18:09:37.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:09:37.902 [1775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:09:50.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 18:09:50.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426423,ok=426423,error=0, records=41
[INFO ] 2026-06-01 18:09:52.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:09:52.906 [1786 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:10:01.292 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21336/300s
[INFO ] 2026-06-01 18:10:05.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 18:10:05.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426424,ok=426424,error=0, records=41
[INFO ] 2026-06-01 18:10:07.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:10:07.911 [1816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:10:10.912 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21327/300s
[INFO ] 2026-06-01 18:10:20.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 18:10:20.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426425,ok=426425,error=0, records=41
[INFO ] 2026-06-01 18:10:22.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:10:22.048 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17763/300s
[INFO ] 2026-06-01 18:10:22.049 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847288},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:10:22.229 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:10:22.229 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 18:10:22.229 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:10:22.229 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:10:22.229 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:10:22.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:10:22.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:10:22.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:10:22.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:10:22.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:10:22.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 18:10:22.917 [1809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:10:35.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 18:10:35.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426426,ok=426426,error=0, records=41
[INFO ] 2026-06-01 18:10:35.486 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21323/300s
[INFO ] 2026-06-01 18:10:37.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:10:37.924 [1737 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:10:41.927 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21336/300s
[INFO ] 2026-06-01 18:10:50.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 18:10:50.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426427,ok=426427,error=0, records=41
[INFO ] 2026-06-01 18:10:52.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:10:52.930 [1857 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:11:05.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-01 18:11:05.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426428,ok=426428,error=0, records=41
[INFO ] 2026-06-01 18:11:07.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:11:07.935 [1826 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:11:20.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 18:11:20.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426429,ok=426429,error=0, records=41
[INFO ] 2026-06-01 18:11:22.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:11:22.941 [1737 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:11:27.993 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21323/300s
[INFO ] 2026-06-01 18:11:35.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 18:11:35.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426430,ok=426430,error=0, records=41
[INFO ] 2026-06-01 18:11:37.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:11:37.946 [1910 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:11:40.943 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21332/300s
[INFO ] 2026-06-01 18:11:50.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-01 18:11:50.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426431,ok=426431,error=0, records=41
[INFO ] 2026-06-01 18:11:52.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:11:52.951 [1737 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:12:05.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 18:12:05.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426432,ok=426432,error=0, records=41
[INFO ] 2026-06-01 18:12:07.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:12:07.039 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21335/300s
[WARN ] 2026-06-01 18:12:07.955 [1737 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:12:20.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 18:12:20.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426433,ok=426433,error=0, records=41
[INFO ] 2026-06-01 18:12:22.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:12:22.960 [1905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:12:35.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 18:12:35.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426434,ok=426434,error=0, records=41
[INFO ] 2026-06-01 18:12:37.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:12:37.965 [1905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:12:47.140 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21333/300s
[INFO ] 2026-06-01 18:12:49.188 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21333/300s
[INFO ] 2026-06-01 18:12:50.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 18:12:50.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426435,ok=426435,error=0, records=41
[INFO ] 2026-06-01 18:12:52.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:12:52.969 [1910 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:12:56.494 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21333/300s
[INFO ] 2026-06-01 18:13:05.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 18:13:05.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426436,ok=426436,error=0, records=41
[INFO ] 2026-06-01 18:13:07.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:13:07.974 [1982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:13:20.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 18:13:20.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426437,ok=426437,error=0, records=41
[INFO ] 2026-06-01 18:13:22.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:13:22.231 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847212},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:13:22.382 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:13:22.382 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 18:13:22.382 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:13:22.382 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:13:22.382 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:13:22.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:13:22.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:13:22.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:13:22.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:13:22.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:13:22.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 18:13:22.979 [2010 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:13:35.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 18:13:35.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426438,ok=426438,error=0, records=41
[INFO ] 2026-06-01 18:13:37.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 18:13:37.043 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 18:13:37.983 [2025 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:13:50.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 18:13:50.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426439,ok=426439,error=0, records=41
[INFO ] 2026-06-01 18:13:52.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:13:52.990 [1737 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:14:05.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 18:14:05.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426440,ok=426440,error=0, records=41
[INFO ] 2026-06-01 18:14:07.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:14:07.995 [1941 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:14:20.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 18:14:20.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426441,ok=426441,error=0, records=41
[INFO ] 2026-06-01 18:14:22.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:14:23.000 [2025 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:14:35.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 18:14:35.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426442,ok=426442,error=0, records=41
[INFO ] 2026-06-01 18:14:37.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:14:38.005 [1941 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:14:50.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 18:14:50.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426443,ok=426443,error=0, records=41
[INFO ] 2026-06-01 18:14:52.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:14:53.010 [2095 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:15:01.295 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21337/300s
[INFO ] 2026-06-01 18:15:05.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:15:05.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426444,ok=426444,error=0, records=41
[INFO ] 2026-06-01 18:15:07.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:15:08.015 [2081 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:15:11.016 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21328/300s
[INFO ] 2026-06-01 18:15:20.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 18:15:20.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426445,ok=426445,error=0, records=41
[INFO ] 2026-06-01 18:15:22.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:15:23.020 [2052 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:15:35.693 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 18:15:35.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426446,ok=426446,error=0, records=41
[INFO ] 2026-06-01 18:15:35.693 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21324/300s
[INFO ] 2026-06-01 18:15:37.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:15:38.025 [2095 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:15:41.933 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21337/300s
[INFO ] 2026-06-01 18:15:50.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 18:15:50.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426447,ok=426447,error=0, records=41
[INFO ] 2026-06-01 18:15:52.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:15:53.030 [2052 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:16:05.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 18:16:05.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426448,ok=426448,error=0, records=41
[INFO ] 2026-06-01 18:16:07.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:16:08.035 [2136 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:16:20.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:16:20.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426449,ok=426449,error=0, records=41
[INFO ] 2026-06-01 18:16:22.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:16:22.382 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17764/300s
[INFO ] 2026-06-01 18:16:22.383 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847144},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:16:22.545 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:16:22.545 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:16:22.546 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:16:22.546 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:16:22.546 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:16:22.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:16:22.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:16:22.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:16:22.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:16:22.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:16:22.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 18:16:23.040 [2164 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:16:28.172 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21324/300s
[INFO ] 2026-06-01 18:16:35.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 18:16:35.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426450,ok=426450,error=0, records=41
[INFO ] 2026-06-01 18:16:37.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:16:38.046 [2199 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:16:40.992 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21333/300s
[INFO ] 2026-06-01 18:16:50.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 18:16:50.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426451,ok=426451,error=0, records=41
[INFO ] 2026-06-01 18:16:52.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:16:53.051 [2216 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:17:05.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 18:17:05.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426452,ok=426452,error=0, records=41
[INFO ] 2026-06-01 18:17:07.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:17:07.051 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21336/300s
[WARN ] 2026-06-01 18:17:07.556 [2227 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:17:20.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 18:17:20.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426453,ok=426453,error=0, records=41
[INFO ] 2026-06-01 18:17:22.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:17:22.560 [2249 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:17:35.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 18:17:35.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426454,ok=426454,error=0, records=41
[INFO ] 2026-06-01 18:17:37.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:17:37.564 [2238 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:17:47.194 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21334/300s
[INFO ] 2026-06-01 18:17:49.233 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21334/300s
[INFO ] 2026-06-01 18:17:50.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 18:17:50.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426455,ok=426455,error=0, records=41
[INFO ] 2026-06-01 18:17:52.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:17:52.568 [2269 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:17:56.539 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21334/300s
[INFO ] 2026-06-01 18:18:05.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 18:18:05.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426456,ok=426456,error=0, records=41
[INFO ] 2026-06-01 18:18:07.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:18:07.573 [2269 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:18:20.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 18:18:20.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426457,ok=426457,error=0, records=41
[INFO ] 2026-06-01 18:18:22.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:18:22.579 [2315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:18:35.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 18:18:35.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426458,ok=426458,error=0, records=41
[INFO ] 2026-06-01 18:18:37.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:18:37.583 [2340 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:18:50.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 18:18:50.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426459,ok=426459,error=0, records=41
[INFO ] 2026-06-01 18:18:52.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:18:52.589 [2339 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:19:05.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 18:19:05.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426460,ok=426460,error=0, records=41
[INFO ] 2026-06-01 18:19:07.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:19:07.593 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:19:20.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 18:19:20.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426461,ok=426461,error=0, records=41
[INFO ] 2026-06-01 18:19:22.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:19:22.547 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847072},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 18:19:22.598 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:19:22.707 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:19:22.707 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:19:22.707 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:19:22.707 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:19:22.707 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:19:22.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:19:22.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:19:22.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:19:22.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:19:22.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:19:22.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:19:35.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 18:19:35.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426462,ok=426462,error=0, records=41
[INFO ] 2026-06-01 18:19:37.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:19:37.604 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:19:50.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 18:19:50.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426463,ok=426463,error=0, records=41
[INFO ] 2026-06-01 18:19:52.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:19:52.609 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:20:01.299 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21338/300s
[INFO ] 2026-06-01 18:20:05.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 18:20:05.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426464,ok=426464,error=0, records=41
[INFO ] 2026-06-01 18:20:07.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:20:07.614 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:20:11.115 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21329/300s
[INFO ] 2026-06-01 18:20:20.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 18:20:20.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426465,ok=426465,error=0, records=41
[INFO ] 2026-06-01 18:20:22.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:20:22.619 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:20:35.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 18:20:35.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426466,ok=426466,error=0, records=41
[INFO ] 2026-06-01 18:20:35.880 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21325/300s
[INFO ] 2026-06-01 18:20:37.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:20:37.625 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:20:41.939 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21338/300s
[INFO ] 2026-06-01 18:20:50.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 18:20:50.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426467,ok=426467,error=0, records=41
[INFO ] 2026-06-01 18:20:52.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:20:52.630 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:21:05.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 18:21:05.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426468,ok=426468,error=0, records=41
[INFO ] 2026-06-01 18:21:07.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:21:07.635 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:21:20.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-01 18:21:20.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426469,ok=426469,error=0, records=41
[INFO ] 2026-06-01 18:21:22.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:21:22.640 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:21:28.354 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21325/300s
[INFO ] 2026-06-01 18:21:35.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 18:21:35.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426470,ok=426470,error=0, records=41
[INFO ] 2026-06-01 18:21:37.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:21:37.645 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:21:41.047 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21334/300s
[INFO ] 2026-06-01 18:21:50.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 18:21:50.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426471,ok=426471,error=0, records=41
[INFO ] 2026-06-01 18:21:52.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:21:52.650 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:22:05.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 18:22:05.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426472,ok=426472,error=0, records=41
[INFO ] 2026-06-01 18:22:07.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:22:07.063 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21337/300s
[WARN ] 2026-06-01 18:22:07.656 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:22:20.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 18:22:20.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426473,ok=426473,error=0, records=41
[INFO ] 2026-06-01 18:22:22.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:22:22.660 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:22:22.707 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17765/300s
[INFO ] 2026-06-01 18:22:22.709 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847004},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:22:22.857 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:22:22.857 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:22:22.857 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:22:22.857 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:22:22.857 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:22:22.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:22:22.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:22:22.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:22:22.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:22:22.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:22:22.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:22:35.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 18:22:35.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426474,ok=426474,error=0, records=41
[INFO ] 2026-06-01 18:22:37.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:22:37.666 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:22:47.263 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21335/300s
[INFO ] 2026-06-01 18:22:49.276 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21335/300s
[INFO ] 2026-06-01 18:22:50.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 18:22:50.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426475,ok=426475,error=0, records=41
[INFO ] 2026-06-01 18:22:52.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:22:52.671 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:22:56.581 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21335/300s
[INFO ] 2026-06-01 18:23:05.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:23:05.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426476,ok=426476,error=0, records=41
[INFO ] 2026-06-01 18:23:07.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:23:07.675 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:23:20.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 18:23:20.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426477,ok=426477,error=0, records=41
[INFO ] 2026-06-01 18:23:22.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:23:22.680 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:23:35.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 18:23:35.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426478,ok=426478,error=0, records=41
[INFO ] 2026-06-01 18:23:37.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 18:23:37.067 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 18:23:37.685 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:23:50.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 18:23:50.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426479,ok=426479,error=0, records=41
[INFO ] 2026-06-01 18:23:52.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:23:52.068 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 18:23:52.690 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:24:05.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 18:24:05.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426480,ok=426480,error=0, records=41
[INFO ] 2026-06-01 18:24:07.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:24:07.695 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:24:20.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 18:24:20.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426481,ok=426481,error=0, records=41
[INFO ] 2026-06-01 18:24:22.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:24:22.699 [2396 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:24:35.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 18:24:35.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426482,ok=426482,error=0, records=41
[INFO ] 2026-06-01 18:24:37.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:24:37.705 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:24:50.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 18:24:50.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426483,ok=426483,error=0, records=41
[INFO ] 2026-06-01 18:24:52.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:24:52.711 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:25:01.302 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21339/300s
[INFO ] 2026-06-01 18:25:05.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-01 18:25:05.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426484,ok=426484,error=0, records=41
[INFO ] 2026-06-01 18:25:07.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:25:07.716 [2396 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:25:11.217 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21330/300s
[INFO ] 2026-06-01 18:25:21.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 18:25:21.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426485,ok=426485,error=0, records=41
[INFO ] 2026-06-01 18:25:22.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:25:22.722 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:25:22.859 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846940},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:25:23.021 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:25:23.021 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 18:25:23.021 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:25:23.021 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:25:23.021 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:25:23.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:25:23.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:25:23.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:25:23.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:25:23.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:25:23.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:25:36.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 18:25:36.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426486,ok=426486,error=0, records=41
[INFO ] 2026-06-01 18:25:36.009 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21326/300s
[INFO ] 2026-06-01 18:25:37.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:25:37.727 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:25:41.946 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21339/300s
[INFO ] 2026-06-01 18:25:51.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 18:25:51.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426487,ok=426487,error=0, records=41
[INFO ] 2026-06-01 18:25:52.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:25:52.732 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:26:06.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 18:26:06.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426488,ok=426488,error=0, records=41
[INFO ] 2026-06-01 18:26:07.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:26:07.737 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:26:21.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 18:26:21.026 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426489,ok=426489,error=0, records=41
[INFO ] 2026-06-01 18:26:22.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:26:22.742 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:26:28.540 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21326/300s
[INFO ] 2026-06-01 18:26:36.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 18:26:36.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426490,ok=426490,error=0, records=41
[INFO ] 2026-06-01 18:26:37.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:26:37.747 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:26:41.101 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21335/300s
[INFO ] 2026-06-01 18:26:51.037 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 18:26:51.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426491,ok=426491,error=0, records=41
[INFO ] 2026-06-01 18:26:52.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:26:52.752 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:27:06.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:27:06.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426492,ok=426492,error=0, records=41
[INFO ] 2026-06-01 18:27:07.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:27:07.077 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21338/300s
[WARN ] 2026-06-01 18:27:07.757 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:27:21.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-01 18:27:21.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426493,ok=426493,error=0, records=41
[INFO ] 2026-06-01 18:27:22.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:27:22.763 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:27:36.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 18:27:36.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426494,ok=426494,error=0, records=41
[INFO ] 2026-06-01 18:27:37.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:27:37.768 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:27:47.327 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21336/300s
[INFO ] 2026-06-01 18:27:49.328 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21336/300s
[INFO ] 2026-06-01 18:27:51.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 18:27:51.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426495,ok=426495,error=0, records=41
[INFO ] 2026-06-01 18:27:52.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:27:52.774 [2396 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:27:56.635 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21336/300s
[INFO ] 2026-06-01 18:28:06.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10381, records=41
[INFO ] 2026-06-01 18:28:06.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426496,ok=426496,error=0, records=41
[INFO ] 2026-06-01 18:28:07.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:28:07.779 [2396 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:28:21.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 18:28:21.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426497,ok=426497,error=0, records=41
[INFO ] 2026-06-01 18:28:22.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:28:22.785 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:28:23.021 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17766/300s
[INFO ] 2026-06-01 18:28:23.022 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846872},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:28:23.168 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:28:23.168 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:28:23.169 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:28:23.169 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:28:23.169 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:28:23.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:28:23.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:28:23.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:28:23.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:28:23.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:28:23.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:28:36.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 18:28:36.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426498,ok=426498,error=0, records=41
[INFO ] 2026-06-01 18:28:37.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:28:37.790 [2364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:28:51.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10389, records=41
[INFO ] 2026-06-01 18:28:51.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426499,ok=426499,error=0, records=41
[INFO ] 2026-06-01 18:28:52.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:28:52.795 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:29:06.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 18:29:06.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426500,ok=426500,error=0, records=41
[INFO ] 2026-06-01 18:29:07.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:29:07.801 [2374 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:29:21.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:29:21.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426501,ok=426501,error=0, records=41
[INFO ] 2026-06-01 18:29:22.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:29:22.807 [2356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 18:29:32.811 [2389 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1078/stat), No such file or directory
[WARN ] 2026-06-01 18:29:32.811 [2389 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23419/stat), No such file or directory
[WARN ] 2026-06-01 18:29:32.811 [2389 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1082/stat), No such file or directory
[WARN ] 2026-06-01 18:29:32.812 [2389 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2179/stat), No such file or directory
[INFO ] 2026-06-01 18:29:36.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 18:29:36.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426502,ok=426502,error=0, records=41
[INFO ] 2026-06-01 18:29:37.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:29:37.812 [2953 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 18:29:47.817 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1078/stat), No such file or directory
[WARN ] 2026-06-01 18:29:47.817 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/23419/stat), No such file or directory
[WARN ] 2026-06-01 18:29:47.817 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1082/stat), No such file or directory
[WARN ] 2026-06-01 18:29:47.817 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2182/stat), No such file or directory
[WARN ] 2026-06-01 18:29:47.817 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2179/stat), No such file or directory
[WARN ] 2026-06-01 18:29:47.818 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1044/stat), No such file or directory
[WARN ] 2026-06-01 18:29:47.818 [2959 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1080/stat), No such file or directory
[INFO ] 2026-06-01 18:29:51.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 18:29:51.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426503,ok=426503,error=0, records=41
[INFO ] 2026-06-01 18:29:52.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:29:52.819 [2953 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:30:01.306 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21340/300s
[INFO ] 2026-06-01 18:30:06.179 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 18:30:06.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426504,ok=426504,error=0, records=41
[INFO ] 2026-06-01 18:30:07.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:30:07.824 [2983 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:30:11.325 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21331/300s
[INFO ] 2026-06-01 18:30:21.206 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 18:30:21.206 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426505,ok=426505,error=0, records=41
[INFO ] 2026-06-01 18:30:22.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:30:22.830 [3022 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:30:36.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 18:30:36.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426506,ok=426506,error=0, records=41
[INFO ] 2026-06-01 18:30:36.212 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21327/300s
[INFO ] 2026-06-01 18:30:37.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:30:37.835 [2953 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:30:41.952 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21340/300s
[INFO ] 2026-06-01 18:30:51.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 18:30:51.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426507,ok=426507,error=0, records=41
[INFO ] 2026-06-01 18:30:52.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:30:52.840 [3022 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:31:06.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 18:31:06.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426508,ok=426508,error=0, records=41
[INFO ] 2026-06-01 18:31:07.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:31:07.845 [2989 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:31:21.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 18:31:21.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426509,ok=426509,error=0, records=41
[INFO ] 2026-06-01 18:31:22.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:31:22.850 [2953 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:31:23.171 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846768},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:31:23.351 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:31:23.351 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 18:31:23.351 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:31:23.351 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:31:23.351 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:31:23.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:31:23.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:31:23.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:31:23.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:31:23.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:31:23.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:31:28.719 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21327/300s
[INFO ] 2026-06-01 18:31:36.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 18:31:36.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426510,ok=426510,error=0, records=41
[INFO ] 2026-06-01 18:31:37.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:31:37.855 [2959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:31:41.154 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21336/300s
[INFO ] 2026-06-01 18:31:51.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 18:31:51.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426511,ok=426511,error=0, records=41
[INFO ] 2026-06-01 18:31:52.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:31:52.859 [3099 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:32:06.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 18:32:06.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426512,ok=426512,error=0, records=41
[INFO ] 2026-06-01 18:32:07.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:32:07.089 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21339/300s
[WARN ] 2026-06-01 18:32:07.864 [3099 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:32:21.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 18:32:21.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426513,ok=426513,error=0, records=41
[INFO ] 2026-06-01 18:32:22.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:32:22.870 [2389 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:32:36.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 18:32:36.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426514,ok=426514,error=0, records=41
[INFO ] 2026-06-01 18:32:37.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:32:37.875 [3142 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:32:47.399 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21337/300s
[INFO ] 2026-06-01 18:32:49.401 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21337/300s
[INFO ] 2026-06-01 18:32:51.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 18:32:51.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426515,ok=426515,error=0, records=41
[INFO ] 2026-06-01 18:32:52.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:32:52.881 [3142 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:32:56.708 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21337/300s
[INFO ] 2026-06-01 18:33:06.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 18:33:06.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426516,ok=426516,error=0, records=41
[INFO ] 2026-06-01 18:33:07.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:33:07.887 [3162 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:33:21.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 18:33:21.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426517,ok=426517,error=0, records=41
[INFO ] 2026-06-01 18:33:22.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:33:22.892 [3189 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:33:36.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 18:33:36.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426518,ok=426518,error=0, records=41
[INFO ] 2026-06-01 18:33:37.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 18:33:37.093 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 18:33:37.897 [3206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:33:51.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 18:33:51.379 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426519,ok=426519,error=0, records=41
[INFO ] 2026-06-01 18:33:52.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:33:52.907 [3225 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:34:06.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 18:34:06.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426520,ok=426520,error=0, records=41
[INFO ] 2026-06-01 18:34:07.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:34:07.913 [3232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:34:21.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 18:34:21.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426521,ok=426521,error=0, records=41
[INFO ] 2026-06-01 18:34:22.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:34:22.919 [3248 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:34:23.352 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17767/300s
[INFO ] 2026-06-01 18:34:23.353 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846684},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:34:23.519 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:34:23.519 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:34:23.519 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:34:23.519 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:34:23.519 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:34:23.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:34:23.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:34:23.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:34:23.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:34:23.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:34:23.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:34:36.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-01 18:34:36.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426522,ok=426522,error=0, records=41
[INFO ] 2026-06-01 18:34:37.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:34:37.925 [3217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:34:51.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 18:34:51.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426523,ok=426523,error=0, records=41
[INFO ] 2026-06-01 18:34:52.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:34:52.930 [3298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:35:01.310 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21341/300s
[INFO ] 2026-06-01 18:35:06.407 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 18:35:06.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426524,ok=426524,error=0, records=41
[INFO ] 2026-06-01 18:35:07.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:35:07.935 [3309 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:35:11.436 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21332/300s
[INFO ] 2026-06-01 18:35:21.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:35:21.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426525,ok=426525,error=0, records=41
[INFO ] 2026-06-01 18:35:22.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:35:22.940 [3324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:35:36.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 18:35:36.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426526,ok=426526,error=0, records=41
[INFO ] 2026-06-01 18:35:36.418 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21328/300s
[INFO ] 2026-06-01 18:35:37.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:35:37.945 [3308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:35:41.959 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21341/300s
[INFO ] 2026-06-01 18:35:51.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 18:35:51.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426527,ok=426527,error=0, records=41
[INFO ] 2026-06-01 18:35:52.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:35:52.951 [3341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:36:06.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 18:36:06.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426528,ok=426528,error=0, records=41
[INFO ] 2026-06-01 18:36:07.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:36:07.956 [3308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:36:21.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 18:36:21.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426529,ok=426529,error=0, records=41
[INFO ] 2026-06-01 18:36:22.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:36:22.962 [3335 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:36:28.897 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21328/300s
[INFO ] 2026-06-01 18:36:36.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 18:36:36.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426530,ok=426530,error=0, records=41
[INFO ] 2026-06-01 18:36:37.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:36:37.966 [3308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:36:41.207 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21337/300s
[INFO ] 2026-06-01 18:36:51.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 18:36:51.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426531,ok=426531,error=0, records=41
[INFO ] 2026-06-01 18:36:52.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:36:52.970 [3412 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:37:06.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 18:37:06.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426532,ok=426532,error=0, records=41
[INFO ] 2026-06-01 18:37:07.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:37:07.102 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21340/300s
[WARN ] 2026-06-01 18:37:07.976 [3315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:37:21.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 18:37:21.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426533,ok=426533,error=0, records=41
[INFO ] 2026-06-01 18:37:22.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:37:22.982 [3341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:37:23.521 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:37:23.675 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:37:23.675 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:37:36.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 18:37:36.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426534,ok=426534,error=0, records=41
[INFO ] 2026-06-01 18:37:37.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:37:37.988 [3439 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:37:47.471 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21338/300s
[INFO ] 2026-06-01 18:37:49.473 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21338/300s
[INFO ] 2026-06-01 18:37:51.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 18:37:51.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426535,ok=426535,error=0, records=41
[INFO ] 2026-06-01 18:37:52.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:37:52.992 [3315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:37:56.779 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21338/300s
[INFO ] 2026-06-01 18:38:06.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 18:38:06.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426536,ok=426536,error=0, records=41
[INFO ] 2026-06-01 18:38:07.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:38:07.997 [3352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:38:21.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 18:38:21.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426537,ok=426537,error=0, records=41
[INFO ] 2026-06-01 18:38:22.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:38:23.003 [3315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:38:36.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 18:38:36.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426538,ok=426538,error=0, records=41
[INFO ] 2026-06-01 18:38:37.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:38:38.008 [3352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:38:51.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 18:38:51.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426539,ok=426539,error=0, records=41
[INFO ] 2026-06-01 18:38:52.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:38:52.106 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 18:38:53.013 [3352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:39:06.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 18:39:06.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426540,ok=426540,error=0, records=41
[INFO ] 2026-06-01 18:39:07.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:39:08.017 [3341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:39:21.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 18:39:21.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426541,ok=426541,error=0, records=41
[INFO ] 2026-06-01 18:39:22.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:39:23.022 [3536 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:39:36.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 18:39:36.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426542,ok=426542,error=0, records=41
[INFO ] 2026-06-01 18:39:37.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:39:38.027 [3341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:39:51.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 18:39:51.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426543,ok=426543,error=0, records=41
[INFO ] 2026-06-01 18:39:52.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:39:53.032 [3341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:40:01.313 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21342/300s
[INFO ] 2026-06-01 18:40:06.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 18:40:06.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426544,ok=426544,error=0, records=41
[INFO ] 2026-06-01 18:40:07.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:40:08.037 [3615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:40:11.538 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21333/300s
[INFO ] 2026-06-01 18:40:21.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 18:40:21.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426545,ok=426545,error=0, records=41
[INFO ] 2026-06-01 18:40:22.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:40:23.042 [3685 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:40:23.675 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17768/300s
[INFO ] 2026-06-01 18:40:23.676 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846520},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:40:23.850 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:40:23.850 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 18:40:23.850 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:40:23.850 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:40:23.850 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:40:23.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:40:23.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:40:23.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:40:23.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:40:23.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:40:23.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:40:36.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 18:40:36.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426546,ok=426546,error=0, records=41
[INFO ] 2026-06-01 18:40:36.537 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21329/300s
[INFO ] 2026-06-01 18:40:37.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:40:38.046 [3685 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:40:41.965 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21342/300s
[WARN ] 2026-06-01 18:40:47.549 [3696 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/1079/stat), No such file or directory
[WARN ] 2026-06-01 18:40:47.550 [3696 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/2931/stat), No such file or directory
[INFO ] 2026-06-01 18:40:51.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 18:40:51.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426547,ok=426547,error=0, records=41
[INFO ] 2026-06-01 18:40:52.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:40:53.051 [3719 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:41:06.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 18:41:06.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426548,ok=426548,error=0, records=41
[INFO ] 2026-06-01 18:41:07.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:41:07.557 [3729 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:41:21.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 18:41:21.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426549,ok=426549,error=0, records=41
[INFO ] 2026-06-01 18:41:22.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:41:22.561 [3733 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:41:29.079 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21329/300s
[INFO ] 2026-06-01 18:41:36.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 18:41:36.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426550,ok=426550,error=0, records=41
[INFO ] 2026-06-01 18:41:37.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:41:37.566 [3737 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:41:41.263 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21338/300s
[INFO ] 2026-06-01 18:41:51.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 18:41:51.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426551,ok=426551,error=0, records=41
[INFO ] 2026-06-01 18:41:52.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:41:52.572 [3772 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:42:06.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 18:42:06.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426552,ok=426552,error=0, records=41
[INFO ] 2026-06-01 18:42:07.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:42:07.115 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21341/300s
[WARN ] 2026-06-01 18:42:07.577 [3790 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:42:21.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-01 18:42:21.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426553,ok=426553,error=0, records=41
[INFO ] 2026-06-01 18:42:22.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:42:22.584 [3814 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:42:36.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 18:42:36.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426554,ok=426554,error=0, records=41
[INFO ] 2026-06-01 18:42:37.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:42:37.589 [3844 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:42:47.531 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21339/300s
[INFO ] 2026-06-01 18:42:49.533 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21339/300s
[INFO ] 2026-06-01 18:42:51.587 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 18:42:51.588 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426555,ok=426555,error=0, records=41
[INFO ] 2026-06-01 18:42:52.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:42:52.594 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:42:56.839 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21339/300s
[INFO ] 2026-06-01 18:43:06.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11379, records=45
[INFO ] 2026-06-01 18:43:06.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426556,ok=426556,error=0, records=45
[INFO ] 2026-06-01 18:43:07.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:43:07.599 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:43:21.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 18:43:21.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426557,ok=426557,error=0, records=41
[INFO ] 2026-06-01 18:43:22.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:43:22.604 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:43:23.852 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846444},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:43:24.022 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:43:24.022 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:43:24.022 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:43:24.022 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:43:24.022 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:43:24.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:43:24.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:43:24.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:43:24.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:43:24.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:43:24.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:43:36.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 18:43:36.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426558,ok=426558,error=0, records=41
[INFO ] 2026-06-01 18:43:37.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 18:43:37.118 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 18:43:37.609 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:43:51.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 18:43:51.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426559,ok=426559,error=0, records=41
[INFO ] 2026-06-01 18:43:52.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:43:52.614 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:44:06.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 18:44:06.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426560,ok=426560,error=0, records=41
[INFO ] 2026-06-01 18:44:07.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:44:07.619 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:44:21.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 18:44:21.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426561,ok=426561,error=0, records=41
[INFO ] 2026-06-01 18:44:22.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:44:22.623 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:44:36.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 18:44:36.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426562,ok=426562,error=0, records=41
[INFO ] 2026-06-01 18:44:37.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:44:37.628 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:44:51.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-01 18:44:51.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426563,ok=426563,error=0, records=41
[INFO ] 2026-06-01 18:44:52.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:44:52.632 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:45:01.317 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21343/300s
[INFO ] 2026-06-01 18:45:06.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 18:45:06.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426564,ok=426564,error=0, records=41
[INFO ] 2026-06-01 18:45:07.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:45:07.637 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:45:11.638 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21334/300s
[INFO ] 2026-06-01 18:45:21.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 18:45:21.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426565,ok=426565,error=0, records=41
[INFO ] 2026-06-01 18:45:22.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:45:22.642 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:45:36.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 18:45:36.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426566,ok=426566,error=0, records=41
[INFO ] 2026-06-01 18:45:36.690 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21330/300s
[INFO ] 2026-06-01 18:45:37.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:45:37.646 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:45:41.972 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21343/300s
[INFO ] 2026-06-01 18:45:51.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 18:45:51.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426567,ok=426567,error=0, records=41
[INFO ] 2026-06-01 18:45:52.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:45:52.651 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:46:06.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 18:46:06.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426568,ok=426568,error=0, records=41
[INFO ] 2026-06-01 18:46:07.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:46:07.657 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:46:21.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 18:46:21.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426569,ok=426569,error=0, records=41
[INFO ] 2026-06-01 18:46:22.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:46:22.662 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:46:24.022 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17769/300s
[INFO ] 2026-06-01 18:46:24.024 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846364},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:46:24.200 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:46:24.200 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 18:46:24.200 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:46:24.200 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:46:24.200 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:46:24.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:46:24.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:46:24.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:46:24.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:46:24.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:46:24.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:46:29.263 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21330/300s
[INFO ] 2026-06-01 18:46:36.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 18:46:36.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426570,ok=426570,error=0, records=41
[INFO ] 2026-06-01 18:46:37.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:46:37.666 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:46:41.316 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21339/300s
[INFO ] 2026-06-01 18:46:51.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 18:46:51.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426571,ok=426571,error=0, records=41
[INFO ] 2026-06-01 18:46:52.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:46:52.670 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:47:06.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 18:47:06.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426572,ok=426572,error=0, records=41
[INFO ] 2026-06-01 18:47:07.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:47:07.128 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21342/300s
[WARN ] 2026-06-01 18:47:07.675 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:47:21.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 18:47:21.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426573,ok=426573,error=0, records=41
[INFO ] 2026-06-01 18:47:22.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:47:22.679 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:47:36.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 18:47:36.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426574,ok=426574,error=0, records=41
[INFO ] 2026-06-01 18:47:37.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:47:37.684 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:47:47.588 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21340/300s
[INFO ] 2026-06-01 18:47:49.590 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21340/300s
[INFO ] 2026-06-01 18:47:51.740 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 18:47:51.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426575,ok=426575,error=0, records=41
[INFO ] 2026-06-01 18:47:52.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:47:52.689 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:47:56.896 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21340/300s
[INFO ] 2026-06-01 18:48:06.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 18:48:06.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426576,ok=426576,error=0, records=41
[INFO ] 2026-06-01 18:48:07.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:48:07.694 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:48:21.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 18:48:21.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426577,ok=426577,error=0, records=41
[INFO ] 2026-06-01 18:48:22.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:48:22.699 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:48:36.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 18:48:36.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426578,ok=426578,error=0, records=41
[INFO ] 2026-06-01 18:48:37.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:48:37.703 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:48:51.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 18:48:51.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426579,ok=426579,error=0, records=41
[INFO ] 2026-06-01 18:48:52.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:48:52.708 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:49:06.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 18:49:06.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426580,ok=426580,error=0, records=41
[INFO ] 2026-06-01 18:49:07.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:49:07.713 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:49:21.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 18:49:21.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426581,ok=426581,error=0, records=41
[INFO ] 2026-06-01 18:49:22.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:49:22.718 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:49:24.202 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846292},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:49:24.374 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:49:24.374 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 18:49:24.374 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:49:24.374 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:49:24.374 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:49:24.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:49:24.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:49:24.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:49:24.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:49:24.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:49:24.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:49:36.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 18:49:36.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426582,ok=426582,error=0, records=41
[INFO ] 2026-06-01 18:49:37.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:49:37.723 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:49:51.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 18:49:51.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426583,ok=426583,error=0, records=41
[INFO ] 2026-06-01 18:49:52.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:49:52.728 [3840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:50:01.320 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21344/300s
[INFO ] 2026-06-01 18:50:06.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 18:50:06.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426584,ok=426584,error=0, records=41
[INFO ] 2026-06-01 18:50:07.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:50:07.734 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:50:11.735 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21335/300s
[INFO ] 2026-06-01 18:50:21.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 18:50:21.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426585,ok=426585,error=0, records=41
[INFO ] 2026-06-01 18:50:22.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:50:22.739 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:50:36.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 18:50:36.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426586,ok=426586,error=0, records=41
[INFO ] 2026-06-01 18:50:36.808 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21331/300s
[INFO ] 2026-06-01 18:50:37.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:50:37.745 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:50:41.978 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21344/300s
[INFO ] 2026-06-01 18:50:51.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-01 18:50:51.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426587,ok=426587,error=0, records=41
[INFO ] 2026-06-01 18:50:52.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:50:52.749 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:51:06.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 18:51:06.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426588,ok=426588,error=0, records=41
[INFO ] 2026-06-01 18:51:07.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:51:07.755 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:51:21.836 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 18:51:21.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426589,ok=426589,error=0, records=41
[INFO ] 2026-06-01 18:51:22.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:51:22.760 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:51:29.447 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21331/300s
[INFO ] 2026-06-01 18:51:36.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 18:51:36.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426590,ok=426590,error=0, records=41
[INFO ] 2026-06-01 18:51:37.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:51:37.765 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:51:41.372 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21340/300s
[INFO ] 2026-06-01 18:51:51.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 18:51:51.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426591,ok=426591,error=0, records=41
[INFO ] 2026-06-01 18:51:52.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:51:52.769 [3836 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:52:06.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-01 18:52:06.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426592,ok=426592,error=0, records=41
[INFO ] 2026-06-01 18:52:07.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:52:07.140 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21343/300s
[WARN ] 2026-06-01 18:52:07.775 [3837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:52:21.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 18:52:21.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426593,ok=426593,error=0, records=41
[INFO ] 2026-06-01 18:52:22.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:52:22.779 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:52:24.375 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17770/300s
[INFO ] 2026-06-01 18:52:24.376 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846212},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:52:24.533 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:52:24.534 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:52:24.534 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:52:24.534 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:52:24.534 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:52:24.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:52:24.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:52:24.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:52:24.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:52:24.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:52:24.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:52:36.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 18:52:36.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426594,ok=426594,error=0, records=41
[INFO ] 2026-06-01 18:52:37.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:52:37.784 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:52:47.659 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21341/300s
[INFO ] 2026-06-01 18:52:49.661 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21341/300s
[INFO ] 2026-06-01 18:52:51.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 18:52:51.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426595,ok=426595,error=0, records=41
[INFO ] 2026-06-01 18:52:52.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:52:52.788 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:52:56.967 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21341/300s
[INFO ] 2026-06-01 18:53:06.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 18:53:06.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426596,ok=426596,error=0, records=41
[INFO ] 2026-06-01 18:53:07.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:53:07.793 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:53:21.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 18:53:21.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426597,ok=426597,error=0, records=41
[INFO ] 2026-06-01 18:53:22.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:53:22.798 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:53:36.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 18:53:36.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426598,ok=426598,error=0, records=41
[INFO ] 2026-06-01 18:53:37.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 18:53:37.144 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 18:53:37.803 [3855 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:53:51.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 18:53:51.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426599,ok=426599,error=0, records=41
[INFO ] 2026-06-01 18:53:52.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:53:52.144 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 18:53:52.808 [4439 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:54:06.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 18:54:06.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426600,ok=426600,error=0, records=41
[INFO ] 2026-06-01 18:54:07.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:54:07.814 [4439 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:54:22.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 18:54:22.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426601,ok=426601,error=0, records=41
[INFO ] 2026-06-01 18:54:22.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:54:22.819 [4475 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:54:37.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 18:54:37.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426602,ok=426602,error=0, records=41
[INFO ] 2026-06-01 18:54:37.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:54:37.824 [4469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:54:52.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 18:54:52.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426603,ok=426603,error=0, records=41
[INFO ] 2026-06-01 18:54:52.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:54:52.828 [4489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:55:01.323 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21345/300s
[INFO ] 2026-06-01 18:55:07.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 18:55:07.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426604,ok=426604,error=0, records=41
[INFO ] 2026-06-01 18:55:07.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:55:07.834 [4439 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:55:11.835 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21336/300s
[INFO ] 2026-06-01 18:55:22.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-01 18:55:22.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426605,ok=426605,error=0, records=41
[INFO ] 2026-06-01 18:55:22.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:55:22.840 [4439 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:55:24.536 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846132},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:55:24.701 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:55:24.701 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 18:55:24.701 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:55:24.701 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:55:24.701 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:55:24.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:55:24.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:55:24.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:55:24.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:55:24.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:55:24.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:55:37.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 18:55:37.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426606,ok=426606,error=0, records=41
[INFO ] 2026-06-01 18:55:37.125 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21332/300s
[INFO ] 2026-06-01 18:55:37.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:55:37.858 [4469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:55:41.984 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21345/300s
[INFO ] 2026-06-01 18:55:52.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 18:55:52.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426607,ok=426607,error=0, records=41
[INFO ] 2026-06-01 18:55:52.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:55:52.863 [4489 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:56:07.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 18:56:07.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426608,ok=426608,error=0, records=41
[INFO ] 2026-06-01 18:56:07.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:56:07.869 [4430 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:56:22.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 18:56:22.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426609,ok=426609,error=0, records=41
[INFO ] 2026-06-01 18:56:22.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:56:22.875 [4430 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:56:29.639 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21332/300s
[WARN ] 2026-06-01 18:56:32.379 [4469 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3628/stat), No such file or directory
[WARN ] 2026-06-01 18:56:32.379 [4469 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3625/stat), No such file or directory
[INFO ] 2026-06-01 18:56:37.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 18:56:37.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426610,ok=426610,error=0, records=41
[INFO ] 2026-06-01 18:56:37.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 18:56:37.880 [4636 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:56:41.438 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21341/300s
[WARN ] 2026-06-01 18:56:47.384 [4672 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3628/stat), No such file or directory
[WARN ] 2026-06-01 18:56:47.384 [4672 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3625/stat), No such file or directory
[WARN ] 2026-06-01 18:56:47.385 [4672 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3629/stat), No such file or directory
[INFO ] 2026-06-01 18:56:52.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.49MB[>=200.00MB 0/4], openFiles=14[>=300 0/4]
[INFO ] 2026-06-01 18:56:52.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 18:56:52.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426611,ok=426611,error=0, records=41
[WARN ] 2026-06-01 18:56:52.885 [4469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:57:07.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:57:07.152 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21344/300s
[INFO ] 2026-06-01 18:57:07.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 18:57:07.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426612,ok=426612,error=0, records=41
[WARN ] 2026-06-01 18:57:07.891 [4469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:57:22.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:57:22.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 18:57:22.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426613,ok=426613,error=0, records=41
[WARN ] 2026-06-01 18:57:22.897 [4711 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:57:37.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:57:37.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 18:57:37.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426614,ok=426614,error=0, records=41
[WARN ] 2026-06-01 18:57:37.902 [4723 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:57:47.752 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21342/300s
[INFO ] 2026-06-01 18:57:49.753 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21342/300s
[INFO ] 2026-06-01 18:57:52.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:57:52.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 18:57:52.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426615,ok=426615,error=0, records=41
[WARN ] 2026-06-01 18:57:52.908 [4672 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:57:57.029 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21342/300s
[INFO ] 2026-06-01 18:58:07.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:58:07.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 18:58:07.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426616,ok=426616,error=0, records=41
[WARN ] 2026-06-01 18:58:07.914 [4757 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:58:22.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:58:22.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 18:58:22.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426617,ok=426617,error=0, records=41
[WARN ] 2026-06-01 18:58:22.920 [4723 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:58:24.701 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17771/300s
[INFO ] 2026-06-01 18:58:24.703 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846028},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 18:58:24.875 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 18:58:24.875 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 18:58:24.875 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 18:58:24.875 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 18:58:24.875 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 18:58:24.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:58:24.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 18:58:24.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:58:24.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 18:58:24.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:58:24.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 18:58:37.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:58:37.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 18:58:37.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426618,ok=426618,error=0, records=41
[WARN ] 2026-06-01 18:58:37.927 [4775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:58:52.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:58:52.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 18:58:52.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426619,ok=426619,error=0, records=41
[WARN ] 2026-06-01 18:58:52.933 [4745 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:59:07.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:59:07.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 18:59:07.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426620,ok=426620,error=0, records=41
[WARN ] 2026-06-01 18:59:07.939 [4835 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:59:22.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:59:22.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 18:59:22.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426621,ok=426621,error=0, records=41
[WARN ] 2026-06-01 18:59:22.943 [4829 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:59:37.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:59:37.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 18:59:37.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426622,ok=426622,error=0, records=41
[WARN ] 2026-06-01 18:59:37.948 [4846 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 18:59:52.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 18:59:52.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 18:59:52.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426623,ok=426623,error=0, records=41
[WARN ] 2026-06-01 18:59:52.953 [4879 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:00:01.326 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21346/300s
[INFO ] 2026-06-01 19:00:07.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:00:07.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 19:00:07.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426624,ok=426624,error=0, records=41
[WARN ] 2026-06-01 19:00:07.958 [4898 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:00:11.959 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21337/300s
[INFO ] 2026-06-01 19:00:22.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:00:22.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 19:00:22.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426625,ok=426625,error=0, records=41
[WARN ] 2026-06-01 19:00:22.963 [4775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:00:37.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:00:37.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 19:00:37.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426626,ok=426626,error=0, records=41
[INFO ] 2026-06-01 19:00:37.259 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21333/300s
[WARN ] 2026-06-01 19:00:37.967 [4898 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:00:41.990 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21346/300s
[INFO ] 2026-06-01 19:00:52.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:00:52.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 19:00:52.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426627,ok=426627,error=0, records=41
[WARN ] 2026-06-01 19:00:52.972 [4928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:01:07.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:01:07.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 19:01:07.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426628,ok=426628,error=0, records=41
[WARN ] 2026-06-01 19:01:07.977 [4870 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:01:22.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:01:22.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 19:01:22.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426629,ok=426629,error=0, records=41
[WARN ] 2026-06-01 19:01:22.981 [4982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:01:24.877 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845936},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:01:25.041 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:01:25.041 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 19:01:25.041 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:01:25.041 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:01:25.041 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:01:25.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:01:25.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:01:25.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:01:25.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:01:25.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:01:25.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:01:29.813 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21333/300s
[INFO ] 2026-06-01 19:01:37.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:01:37.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 19:01:37.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426630,ok=426630,error=0, records=41
[WARN ] 2026-06-01 19:01:37.987 [4870 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:01:41.486 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21342/300s
[INFO ] 2026-06-01 19:01:52.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:01:52.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-01 19:01:52.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426631,ok=426631,error=0, records=41
[WARN ] 2026-06-01 19:01:52.993 [4928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:02:07.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:02:07.164 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21345/300s
[INFO ] 2026-06-01 19:02:07.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 19:02:07.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426632,ok=426632,error=0, records=41
[WARN ] 2026-06-01 19:02:07.998 [4982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:02:22.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:02:22.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 19:02:22.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426633,ok=426633,error=0, records=41
[WARN ] 2026-06-01 19:02:23.002 [5024 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:02:37.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:02:37.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 19:02:37.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426634,ok=426634,error=0, records=41
[WARN ] 2026-06-01 19:02:38.006 [4982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:02:47.793 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21343/300s
[INFO ] 2026-06-01 19:02:49.795 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21343/300s
[INFO ] 2026-06-01 19:02:52.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:02:52.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 19:02:52.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426635,ok=426635,error=0, records=41
[WARN ] 2026-06-01 19:02:53.012 [5067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:02:57.061 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21343/300s
[INFO ] 2026-06-01 19:03:07.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:03:07.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 19:03:07.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426636,ok=426636,error=0, records=41
[WARN ] 2026-06-01 19:03:08.016 [5053 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:03:22.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:03:22.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 19:03:22.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426637,ok=426637,error=0, records=41
[WARN ] 2026-06-01 19:03:23.022 [5067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:03:37.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 19:03:37.168 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 19:03:37.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 19:03:37.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426638,ok=426638,error=0, records=41
[WARN ] 2026-06-01 19:03:38.027 [5110 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[WARN ] 2026-06-01 19:03:47.531 [5095 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3664/stat), No such file or directory
[INFO ] 2026-06-01 19:03:52.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:03:52.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 19:03:52.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426639,ok=426639,error=0, records=41
[WARN ] 2026-06-01 19:03:53.032 [5124 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:04:07.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:04:07.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 19:04:07.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426640,ok=426640,error=0, records=41
[WARN ] 2026-06-01 19:04:08.038 [5139 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:04:22.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:04:22.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 19:04:22.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426641,ok=426641,error=0, records=41
[WARN ] 2026-06-01 19:04:23.045 [5156 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:04:25.041 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17772/300s
[INFO ] 2026-06-01 19:04:25.043 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845812},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:04:25.187 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:04:25.187 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 19:04:25.187 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:04:25.187 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:04:25.187 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:04:25.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:04:25.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:04:25.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:04:25.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:04:25.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:04:25.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:04:37.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:04:37.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 19:04:37.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426642,ok=426642,error=0, records=41
[WARN ] 2026-06-01 19:04:38.049 [5172 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:04:52.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:04:52.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 19:04:52.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426643,ok=426643,error=0, records=41
[WARN ] 2026-06-01 19:04:53.053 [5110 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:05:01.330 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21347/300s
[INFO ] 2026-06-01 19:05:07.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:05:07.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 19:05:07.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426644,ok=426644,error=0, records=41
[WARN ] 2026-06-01 19:05:07.557 [5195 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:05:12.058 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21338/300s
[INFO ] 2026-06-01 19:05:22.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:05:22.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 19:05:22.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426645,ok=426645,error=0, records=41
[WARN ] 2026-06-01 19:05:22.562 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:05:37.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:05:37.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 19:05:37.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426646,ok=426646,error=0, records=41
[INFO ] 2026-06-01 19:05:37.476 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21334/300s
[WARN ] 2026-06-01 19:05:37.568 [5231 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:05:41.996 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21347/300s
[INFO ] 2026-06-01 19:05:52.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:05:52.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-01 19:05:52.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426647,ok=426647,error=0, records=41
[WARN ] 2026-06-01 19:05:52.575 [5230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:06:07.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:06:07.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 19:06:07.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426648,ok=426648,error=0, records=41
[WARN ] 2026-06-01 19:06:07.580 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:06:22.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:06:22.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 19:06:22.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426649,ok=426649,error=0, records=41
[WARN ] 2026-06-01 19:06:22.584 [5274 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:06:29.997 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21334/300s
[INFO ] 2026-06-01 19:06:37.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:06:37.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 19:06:37.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426650,ok=426650,error=0, records=41
[WARN ] 2026-06-01 19:06:37.589 [5311 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:06:41.542 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21343/300s
[INFO ] 2026-06-01 19:06:52.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:06:52.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 19:06:52.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426651,ok=426651,error=0, records=41
[WARN ] 2026-06-01 19:06:52.593 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:07:07.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:07:07.176 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21346/300s
[INFO ] 2026-06-01 19:07:07.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 19:07:07.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426652,ok=426652,error=0, records=41
[WARN ] 2026-06-01 19:07:07.598 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:07:22.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:07:22.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-01 19:07:22.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426653,ok=426653,error=0, records=41
[WARN ] 2026-06-01 19:07:22.604 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:07:25.189 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845728},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:07:25.358 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:07:25.358 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 19:07:25.358 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:07:25.358 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:07:25.358 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:07:25.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:07:25.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:07:25.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:07:25.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:07:25.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:07:25.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:07:37.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:07:37.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-01 19:07:37.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426654,ok=426654,error=0, records=41
[WARN ] 2026-06-01 19:07:37.609 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:07:47.857 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21344/300s
[INFO ] 2026-06-01 19:07:49.859 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21344/300s
[INFO ] 2026-06-01 19:07:52.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:07:52.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 19:07:52.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426655,ok=426655,error=0, records=41
[WARN ] 2026-06-01 19:07:52.614 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:07:57.102 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21344/300s
[INFO ] 2026-06-01 19:08:07.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:08:07.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 19:08:07.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426656,ok=426656,error=0, records=41
[WARN ] 2026-06-01 19:08:07.620 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:08:22.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:08:22.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 19:08:22.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426657,ok=426657,error=0, records=41
[WARN ] 2026-06-01 19:08:22.625 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:08:37.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:08:37.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 19:08:37.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426658,ok=426658,error=0, records=41
[WARN ] 2026-06-01 19:08:37.630 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:08:52.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:08:52.181 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 19:08:52.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 19:08:52.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426659,ok=426659,error=0, records=41
[WARN ] 2026-06-01 19:08:52.636 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:09:07.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:09:07.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 19:09:07.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426660,ok=426660,error=0, records=41
[WARN ] 2026-06-01 19:09:07.641 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:09:22.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:09:22.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 19:09:22.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426661,ok=426661,error=0, records=41
[WARN ] 2026-06-01 19:09:22.647 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:09:37.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:09:37.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 19:09:37.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426662,ok=426662,error=0, records=41
[WARN ] 2026-06-01 19:09:37.652 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:09:52.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:09:52.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 19:09:52.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426663,ok=426663,error=0, records=41
[WARN ] 2026-06-01 19:09:52.657 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:10:01.333 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21348/300s
[INFO ] 2026-06-01 19:10:07.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:10:07.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 19:10:07.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426664,ok=426664,error=0, records=41
[WARN ] 2026-06-01 19:10:07.662 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:10:12.163 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21339/300s
[INFO ] 2026-06-01 19:10:22.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:10:22.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 19:10:22.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426665,ok=426665,error=0, records=41
[WARN ] 2026-06-01 19:10:22.666 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:10:25.358 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17773/300s
[INFO ] 2026-06-01 19:10:25.360 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845648},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:10:25.497 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:10:25.497 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 19:10:25.498 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:10:25.498 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:10:25.498 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:10:25.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:10:25.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:10:25.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:10:25.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:10:25.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:10:25.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:10:37.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:10:37.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 19:10:37.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426666,ok=426666,error=0, records=41
[INFO ] 2026-06-01 19:10:37.671 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21335/300s
[WARN ] 2026-06-01 19:10:37.671 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:10:42.003 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21348/300s
[INFO ] 2026-06-01 19:10:52.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:10:52.677 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:10:52.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 19:10:52.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426667,ok=426667,error=0, records=41
[INFO ] 2026-06-01 19:11:07.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:11:07.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 19:11:07.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426668,ok=426668,error=0, records=41
[WARN ] 2026-06-01 19:11:07.683 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:11:22.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:11:22.687 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:11:22.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 19:11:22.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426669,ok=426669,error=0, records=41
[INFO ] 2026-06-01 19:11:30.158 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21335/300s
[INFO ] 2026-06-01 19:11:37.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:11:37.692 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:11:37.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 19:11:37.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426670,ok=426670,error=0, records=41
[INFO ] 2026-06-01 19:11:41.596 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21344/300s
[INFO ] 2026-06-01 19:11:52.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:11:52.698 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:11:52.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 19:11:52.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426671,ok=426671,error=0, records=41
[INFO ] 2026-06-01 19:12:07.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:12:07.189 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21347/300s
[WARN ] 2026-06-01 19:12:07.703 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:12:07.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 19:12:07.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426672,ok=426672,error=0, records=41
[INFO ] 2026-06-01 19:12:22.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:12:22.708 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:12:22.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 19:12:22.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426673,ok=426673,error=0, records=41
[INFO ] 2026-06-01 19:12:37.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:12:37.713 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:12:37.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 19:12:37.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426674,ok=426674,error=0, records=41
[INFO ] 2026-06-01 19:12:47.927 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21345/300s
[INFO ] 2026-06-01 19:12:49.929 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21345/300s
[INFO ] 2026-06-01 19:12:52.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:12:52.718 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:12:52.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 19:12:52.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426675,ok=426675,error=0, records=41
[INFO ] 2026-06-01 19:12:57.146 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21345/300s
[INFO ] 2026-06-01 19:13:07.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:13:07.723 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:13:07.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 19:13:07.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426676,ok=426676,error=0, records=41
[INFO ] 2026-06-01 19:13:22.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:13:22.729 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:13:22.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-01 19:13:22.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426677,ok=426677,error=0, records=41
[INFO ] 2026-06-01 19:13:25.499 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845576},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:13:25.665 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:13:25.665 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 19:13:25.665 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:13:25.665 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:13:25.665 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:13:25.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:13:25.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:13:25.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:13:25.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:13:25.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:13:25.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:13:37.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 19:13:37.193 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 19:13:37.734 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:13:37.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 19:13:37.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426678,ok=426678,error=0, records=41
[INFO ] 2026-06-01 19:13:52.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:13:52.738 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:13:52.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 19:13:52.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426679,ok=426679,error=0, records=41
[INFO ] 2026-06-01 19:14:07.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:14:07.744 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:14:07.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 19:14:07.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426680,ok=426680,error=0, records=41
[INFO ] 2026-06-01 19:14:22.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:14:22.750 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:14:22.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 19:14:22.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426681,ok=426681,error=0, records=41
[INFO ] 2026-06-01 19:14:37.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:14:37.755 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:14:37.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 19:14:37.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426682,ok=426682,error=0, records=41
[INFO ] 2026-06-01 19:14:52.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:14:52.760 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:14:52.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 19:14:52.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426683,ok=426683,error=0, records=41
[INFO ] 2026-06-01 19:15:01.337 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21349/300s
[INFO ] 2026-06-01 19:15:07.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:15:07.767 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:15:07.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 19:15:07.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426684,ok=426684,error=0, records=41
[INFO ] 2026-06-01 19:15:12.268 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21340/300s
[INFO ] 2026-06-01 19:15:22.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:15:22.773 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:15:22.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 19:15:22.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426685,ok=426685,error=0, records=41
[INFO ] 2026-06-01 19:15:37.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:15:37.782 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:15:37.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 19:15:37.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426686,ok=426686,error=0, records=41
[INFO ] 2026-06-01 19:15:37.892 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21336/300s
[INFO ] 2026-06-01 19:15:42.009 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21349/300s
[INFO ] 2026-06-01 19:15:52.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:15:52.788 [5347 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:15:52.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 19:15:52.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426687,ok=426687,error=0, records=41
[INFO ] 2026-06-01 19:16:07.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:16:07.793 [5258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:16:07.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 19:16:07.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426688,ok=426688,error=0, records=41
[INFO ] 2026-06-01 19:16:22.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:16:22.798 [5292 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:16:22.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 19:16:22.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426689,ok=426689,error=0, records=41
[INFO ] 2026-06-01 19:16:25.665 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17774/300s
[INFO ] 2026-06-01 19:16:25.667 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:16:25.824 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:16:25.824 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 19:16:25.824 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:16:25.824 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:16:25.824 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:16:25.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:16:25.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:16:25.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:16:25.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:16:25.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:16:25.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:16:30.338 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21336/300s
[INFO ] 2026-06-01 19:16:37.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:16:37.803 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:16:37.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 19:16:37.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426690,ok=426690,error=0, records=41
[INFO ] 2026-06-01 19:16:41.657 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21345/300s
[INFO ] 2026-06-01 19:16:52.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:16:52.807 [5867 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:16:52.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 19:16:52.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426691,ok=426691,error=0, records=41
[INFO ] 2026-06-01 19:17:07.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:17:07.201 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21348/300s
[WARN ] 2026-06-01 19:17:07.812 [5857 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:17:07.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 19:17:07.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426692,ok=426692,error=0, records=41
[INFO ] 2026-06-01 19:17:22.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:17:22.817 [5305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:17:22.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 19:17:22.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426693,ok=426693,error=0, records=41
[INFO ] 2026-06-01 19:17:37.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:17:37.823 [5341 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:17:37.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 19:17:37.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426694,ok=426694,error=0, records=41
[INFO ] 2026-06-01 19:17:48.021 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21346/300s
[INFO ] 2026-06-01 19:17:49.931 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21346/300s
[INFO ] 2026-06-01 19:17:52.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:17:52.827 [5916 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:17:52.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 19:17:52.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426695,ok=426695,error=0, records=41
[INFO ] 2026-06-01 19:17:57.229 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21346/300s
[INFO ] 2026-06-01 19:18:07.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:18:07.832 [5916 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:18:07.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-01 19:18:07.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426696,ok=426696,error=0, records=41
[INFO ] 2026-06-01 19:18:22.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:18:22.837 [5930 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:18:22.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11290, records=50
[INFO ] 2026-06-01 19:18:22.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426697,ok=426697,error=0, records=50
[INFO ] 2026-06-01 19:18:37.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:18:37.842 [5902 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:18:37.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 19:18:37.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426698,ok=426698,error=0, records=41
[INFO ] 2026-06-01 19:18:52.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:18:52.847 [5982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:18:52.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 19:18:52.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426699,ok=426699,error=0, records=41
[INFO ] 2026-06-01 19:19:07.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:19:07.852 [5982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:19:07.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 19:19:07.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426700,ok=426700,error=0, records=41
[INFO ] 2026-06-01 19:19:22.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:19:22.857 [5982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:19:22.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 19:19:22.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426701,ok=426701,error=0, records=41
[INFO ] 2026-06-01 19:19:25.826 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845416},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:19:26.004 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:19:26.004 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 19:19:26.004 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:19:26.004 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:19:26.004 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:19:26.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:19:26.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:19:26.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:19:26.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:19:26.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:19:26.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:19:37.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:19:37.862 [6010 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:19:37.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 19:19:37.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426702,ok=426702,error=0, records=41
[INFO ] 2026-06-01 19:19:52.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:19:52.867 [5982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:19:52.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 19:19:52.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426703,ok=426703,error=0, records=41
[INFO ] 2026-06-01 19:20:01.340 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21350/300s
[INFO ] 2026-06-01 19:20:07.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:20:07.872 [6010 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:20:08.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 19:20:08.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426704,ok=426704,error=0, records=41
[INFO ] 2026-06-01 19:20:12.374 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21341/300s
[INFO ] 2026-06-01 19:20:22.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:20:22.879 [6081 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:20:23.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 19:20:23.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426705,ok=426705,error=0, records=41
[INFO ] 2026-06-01 19:20:37.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:20:37.885 [6062 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:20:38.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 19:20:38.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426706,ok=426706,error=0, records=41
[INFO ] 2026-06-01 19:20:38.010 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21337/300s
[INFO ] 2026-06-01 19:20:42.016 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21350/300s
[INFO ] 2026-06-01 19:20:52.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:20:52.890 [6093 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:20:53.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 19:20:53.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426707,ok=426707,error=0, records=41
[INFO ] 2026-06-01 19:21:07.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:21:07.897 [6126 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:21:08.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 19:21:08.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426708,ok=426708,error=0, records=41
[INFO ] 2026-06-01 19:21:22.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:21:22.902 [6152 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:21:23.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 19:21:23.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426709,ok=426709,error=0, records=41
[INFO ] 2026-06-01 19:21:30.516 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21337/300s
[INFO ] 2026-06-01 19:21:37.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:21:37.907 [6169 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:21:38.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 19:21:38.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426710,ok=426710,error=0, records=41
[INFO ] 2026-06-01 19:21:41.707 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21346/300s
[INFO ] 2026-06-01 19:21:52.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:21:52.913 [6186 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:21:53.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 19:21:53.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426711,ok=426711,error=0, records=41
[INFO ] 2026-06-01 19:22:07.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:22:07.213 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21349/300s
[WARN ] 2026-06-01 19:22:07.919 [6198 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:22:08.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-01 19:22:08.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426712,ok=426712,error=0, records=41
[INFO ] 2026-06-01 19:22:22.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:22:22.926 [6209 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:22:23.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 19:22:23.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426713,ok=426713,error=0, records=41
[INFO ] 2026-06-01 19:22:26.004 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17775/300s
[INFO ] 2026-06-01 19:22:26.006 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845320},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:22:26.178 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:22:26.178 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 19:22:26.178 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:22:26.178 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:22:26.178 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:22:26.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:22:26.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:22:26.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:22:26.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:22:26.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:22:26.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:22:37.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:22:37.931 [6181 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:22:38.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 19:22:38.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426714,ok=426714,error=0, records=41
[INFO ] 2026-06-01 19:22:48.064 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21347/300s
[INFO ] 2026-06-01 19:22:49.966 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21347/300s
[INFO ] 2026-06-01 19:22:52.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:22:52.937 [6232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:22:53.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 19:22:53.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426715,ok=426715,error=0, records=41
[INFO ] 2026-06-01 19:22:57.271 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21347/300s
[INFO ] 2026-06-01 19:23:07.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:23:07.946 [6248 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:23:08.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 19:23:08.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426716,ok=426716,error=0, records=41
[INFO ] 2026-06-01 19:23:22.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:23:22.951 [6260 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:23:23.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 19:23:23.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426717,ok=426717,error=0, records=41
[INFO ] 2026-06-01 19:23:37.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 19:23:37.217 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 19:23:37.956 [6269 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:23:38.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 19:23:38.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426718,ok=426718,error=0, records=41
[INFO ] 2026-06-01 19:23:52.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:23:52.217 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 19:23:52.961 [6260 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:23:53.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-01 19:23:53.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426719,ok=426719,error=0, records=41
[INFO ] 2026-06-01 19:24:07.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:24:07.967 [6309 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:24:08.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 19:24:08.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426720,ok=426720,error=0, records=41
[INFO ] 2026-06-01 19:24:22.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:24:22.972 [6269 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:24:23.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 19:24:23.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426721,ok=426721,error=0, records=41
[INFO ] 2026-06-01 19:24:37.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:24:37.978 [6309 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:24:38.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 19:24:38.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426722,ok=426722,error=0, records=41
[INFO ] 2026-06-01 19:24:52.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:24:52.984 [6336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:24:53.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 19:24:53.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426723,ok=426723,error=0, records=41
[INFO ] 2026-06-01 19:25:01.343 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21351/300s
[INFO ] 2026-06-01 19:25:07.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:25:07.988 [6254 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:25:08.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 19:25:08.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426724,ok=426724,error=0, records=41
[INFO ] 2026-06-01 19:25:12.489 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21342/300s
[INFO ] 2026-06-01 19:25:22.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:25:22.992 [6336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:25:23.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 19:25:23.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426725,ok=426725,error=0, records=41
[INFO ] 2026-06-01 19:25:26.180 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845244},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:25:26.345 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:25:26.345 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 19:25:26.346 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:25:26.346 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:25:26.346 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:25:26.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:25:26.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:25:26.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:25:26.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:25:26.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:25:26.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:25:37.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:25:37.997 [6254 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:25:38.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 19:25:38.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426726,ok=426726,error=0, records=41
[INFO ] 2026-06-01 19:25:38.127 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21338/300s
[INFO ] 2026-06-01 19:25:42.022 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21351/300s
[INFO ] 2026-06-01 19:25:52.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:25:53.002 [6254 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:25:53.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 19:25:53.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426727,ok=426727,error=0, records=41
[INFO ] 2026-06-01 19:26:07.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:26:08.008 [6260 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:26:08.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 19:26:08.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426728,ok=426728,error=0, records=41
[INFO ] 2026-06-01 19:26:22.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:26:23.013 [6260 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:26:23.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 19:26:23.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426729,ok=426729,error=0, records=41
[INFO ] 2026-06-01 19:26:30.700 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21338/300s
[INFO ] 2026-06-01 19:26:37.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:26:38.019 [6336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:26:38.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 19:26:38.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426730,ok=426730,error=0, records=41
[INFO ] 2026-06-01 19:26:41.764 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21347/300s
[INFO ] 2026-06-01 19:26:52.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:26:53.025 [6463 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:26:53.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 19:26:53.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426731,ok=426731,error=0, records=41
[INFO ] 2026-06-01 19:27:07.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:27:07.226 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21350/300s
[WARN ] 2026-06-01 19:27:08.030 [6463 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:27:08.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 19:27:08.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426732,ok=426732,error=0, records=41
[INFO ] 2026-06-01 19:27:22.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:27:23.037 [6336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:27:23.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 19:27:23.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426733,ok=426733,error=0, records=41
[INFO ] 2026-06-01 19:27:37.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:27:38.043 [6336 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:27:38.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 19:27:38.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426734,ok=426734,error=0, records=41
[INFO ] 2026-06-01 19:27:48.125 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21348/300s
[INFO ] 2026-06-01 19:27:50.027 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21348/300s
[INFO ] 2026-06-01 19:27:52.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:27:53.049 [6509 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:27:53.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 19:27:53.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426735,ok=426735,error=0, records=41
[INFO ] 2026-06-01 19:27:57.334 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21348/300s
[INFO ] 2026-06-01 19:28:07.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:28:07.555 [6554 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:28:08.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 19:28:08.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426736,ok=426736,error=0, records=41
[INFO ] 2026-06-01 19:28:22.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:28:22.560 [6571 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:28:23.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 19:28:23.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426737,ok=426737,error=0, records=41
[INFO ] 2026-06-01 19:28:26.346 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17776/300s
[INFO ] 2026-06-01 19:28:26.347 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845164},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:28:26.531 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:28:26.531 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 19:28:26.531 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:28:26.531 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:28:26.531 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:28:26.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:28:26.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:28:26.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:28:26.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:28:26.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:28:26.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:28:37.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:28:37.565 [6554 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:28:38.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 19:28:38.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426738,ok=426738,error=0, records=41
[INFO ] 2026-06-01 19:28:52.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:28:52.570 [6591 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:28:53.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 19:28:53.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426739,ok=426739,error=0, records=41
[INFO ] 2026-06-01 19:29:07.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:29:07.576 [6594 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:29:08.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 19:29:08.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426740,ok=426740,error=0, records=41
[INFO ] 2026-06-01 19:29:22.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:29:22.582 [6639 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:29:23.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 19:29:23.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426741,ok=426741,error=0, records=41
[INFO ] 2026-06-01 19:29:37.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:29:37.586 [6657 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:29:38.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 19:29:38.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426742,ok=426742,error=0, records=41
[INFO ] 2026-06-01 19:29:52.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:29:52.592 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:29:53.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10891, records=42
[INFO ] 2026-06-01 19:29:53.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426743,ok=426743,error=0, records=42
[INFO ] 2026-06-01 19:30:01.347 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21352/300s
[INFO ] 2026-06-01 19:30:07.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:30:07.597 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:30:08.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 19:30:08.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426744,ok=426744,error=0, records=41
[INFO ] 2026-06-01 19:30:12.599 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21343/300s
[INFO ] 2026-06-01 19:30:22.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:30:22.602 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:30:23.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 19:30:23.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426745,ok=426745,error=0, records=41
[INFO ] 2026-06-01 19:30:37.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:30:37.608 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:30:38.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 19:30:38.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426746,ok=426746,error=0, records=41
[INFO ] 2026-06-01 19:30:38.452 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21339/300s
[INFO ] 2026-06-01 19:30:42.028 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21352/300s
[INFO ] 2026-06-01 19:30:52.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:30:52.612 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:30:53.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 19:30:53.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426747,ok=426747,error=0, records=41
[INFO ] 2026-06-01 19:31:07.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:31:07.617 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:31:08.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 19:31:08.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426748,ok=426748,error=0, records=41
[INFO ] 2026-06-01 19:31:22.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:31:22.622 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:31:23.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 19:31:23.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426749,ok=426749,error=0, records=41
[INFO ] 2026-06-01 19:31:26.533 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845084},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:31:26.687 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:31:26.687 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 19:31:26.688 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:31:26.688 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:31:26.688 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:31:26.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:31:26.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:31:26.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:31:26.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:31:26.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:31:26.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:31:30.882 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21339/300s
[INFO ] 2026-06-01 19:31:37.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:31:37.626 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:31:38.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 19:31:38.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426750,ok=426750,error=0, records=41
[INFO ] 2026-06-01 19:31:41.819 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21348/300s
[INFO ] 2026-06-01 19:31:52.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:31:52.632 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:31:53.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 19:31:53.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426751,ok=426751,error=0, records=41
[INFO ] 2026-06-01 19:32:07.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:32:07.239 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21351/300s
[WARN ] 2026-06-01 19:32:07.637 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:32:08.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 19:32:08.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426752,ok=426752,error=0, records=41
[INFO ] 2026-06-01 19:32:22.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:32:22.642 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:32:23.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 19:32:23.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426753,ok=426753,error=0, records=41
[INFO ] 2026-06-01 19:32:37.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:32:37.648 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:32:38.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 19:32:38.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426754,ok=426754,error=0, records=41
[INFO ] 2026-06-01 19:32:48.190 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21349/300s
[INFO ] 2026-06-01 19:32:50.092 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21349/300s
[INFO ] 2026-06-01 19:32:52.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:32:52.653 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:32:53.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 19:32:53.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426755,ok=426755,error=0, records=41
[INFO ] 2026-06-01 19:32:57.398 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21349/300s
[INFO ] 2026-06-01 19:33:07.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:33:07.657 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:33:08.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 19:33:08.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426756,ok=426756,error=0, records=41
[INFO ] 2026-06-01 19:33:22.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:33:22.662 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:33:23.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 19:33:23.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426757,ok=426757,error=0, records=41
[INFO ] 2026-06-01 19:33:37.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 19:33:37.243 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 19:33:37.666 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:33:38.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 19:33:38.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426758,ok=426758,error=0, records=41
[INFO ] 2026-06-01 19:33:52.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:33:52.670 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:33:53.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 19:33:53.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426759,ok=426759,error=0, records=41
[INFO ] 2026-06-01 19:34:07.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:34:07.675 [6647 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:34:08.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 19:34:08.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426760,ok=426760,error=0, records=41
[INFO ] 2026-06-01 19:34:22.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:34:22.681 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:34:23.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-01 19:34:23.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426761,ok=426761,error=0, records=41
[INFO ] 2026-06-01 19:34:26.688 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17777/300s
[INFO ] 2026-06-01 19:34:26.689 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845004},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:34:26.844 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:34:26.844 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 19:34:26.844 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:34:26.844 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:34:26.844 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:34:26.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:34:26.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:34:26.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:34:26.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:34:26.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:34:26.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:34:37.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:34:37.687 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:34:38.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 19:34:38.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426762,ok=426762,error=0, records=41
[INFO ] 2026-06-01 19:34:52.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:34:52.692 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:34:53.562 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 19:34:53.562 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426763,ok=426763,error=0, records=41
[INFO ] 2026-06-01 19:35:01.350 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21353/300s
[INFO ] 2026-06-01 19:35:07.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:35:07.698 [6647 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:35:08.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 19:35:08.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426764,ok=426764,error=0, records=41
[INFO ] 2026-06-01 19:35:12.699 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21344/300s
[INFO ] 2026-06-01 19:35:22.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:35:22.703 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:35:23.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 19:35:23.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426765,ok=426765,error=0, records=41
[INFO ] 2026-06-01 19:35:37.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:35:37.708 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:35:38.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 19:35:38.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426766,ok=426766,error=0, records=41
[INFO ] 2026-06-01 19:35:38.579 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21340/300s
[INFO ] 2026-06-01 19:35:42.039 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21353/300s
[INFO ] 2026-06-01 19:35:52.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:35:52.713 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:35:53.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 19:35:53.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426767,ok=426767,error=0, records=41
[INFO ] 2026-06-01 19:36:07.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:36:07.719 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:36:08.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 19:36:08.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426768,ok=426768,error=0, records=41
[INFO ] 2026-06-01 19:36:22.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:36:22.724 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:36:23.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 19:36:23.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426769,ok=426769,error=0, records=41
[INFO ] 2026-06-01 19:36:31.071 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21340/300s
[INFO ] 2026-06-01 19:36:37.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:36:37.729 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:36:38.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 19:36:38.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426770,ok=426770,error=0, records=41
[INFO ] 2026-06-01 19:36:41.874 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21349/300s
[WARN ] 2026-06-01 19:36:47.733 [6658 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/4562/stat), No such file or directory
[INFO ] 2026-06-01 19:36:52.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:36:52.734 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:36:53.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 19:36:53.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426771,ok=426771,error=0, records=41
[INFO ] 2026-06-01 19:37:07.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:37:07.252 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21352/300s
[WARN ] 2026-06-01 19:37:07.739 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:37:08.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 19:37:08.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426772,ok=426772,error=0, records=41
[INFO ] 2026-06-01 19:37:22.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:37:22.744 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:37:23.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 19:37:23.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426773,ok=426773,error=0, records=41
[INFO ] 2026-06-01 19:37:26.846 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844908},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:37:27.005 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:37:27.005 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 19:37:27.005 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:37:27.005 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:37:27.005 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:37:27.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:37:27.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:37:27.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:37:27.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:37:27.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:37:27.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:37:37.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:37:37.749 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:37:38.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 19:37:38.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426774,ok=426774,error=0, records=41
[INFO ] 2026-06-01 19:37:48.246 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21350/300s
[INFO ] 2026-06-01 19:37:50.147 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21350/300s
[INFO ] 2026-06-01 19:37:52.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:37:52.754 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:37:53.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 19:37:53.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426775,ok=426775,error=0, records=41
[INFO ] 2026-06-01 19:37:57.454 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21350/300s
[INFO ] 2026-06-01 19:38:07.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:38:07.759 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:38:08.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 19:38:08.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426776,ok=426776,error=0, records=41
[INFO ] 2026-06-01 19:38:22.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:38:22.763 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:38:23.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 19:38:23.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426777,ok=426777,error=0, records=41
[INFO ] 2026-06-01 19:38:37.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:38:37.768 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:38:38.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 19:38:38.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426778,ok=426778,error=0, records=41
[INFO ] 2026-06-01 19:38:52.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:38:52.256 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 19:38:52.774 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:38:53.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 19:38:53.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426779,ok=426779,error=0, records=41
[INFO ] 2026-06-01 19:39:07.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:39:07.780 [6700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:39:08.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 19:39:08.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426780,ok=426780,error=0, records=41
[INFO ] 2026-06-01 19:39:22.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:39:22.785 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:39:23.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 19:39:23.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426781,ok=426781,error=0, records=41
[INFO ] 2026-06-01 19:39:37.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:39:37.790 [6647 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:39:38.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 19:39:38.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426782,ok=426782,error=0, records=41
[INFO ] 2026-06-01 19:39:52.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:39:52.796 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:39:53.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 19:39:53.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426783,ok=426783,error=0, records=41
[INFO ] 2026-06-01 19:40:01.354 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21354/300s
[INFO ] 2026-06-01 19:40:07.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:40:07.802 [6679 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:40:08.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 19:40:08.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426784,ok=426784,error=0, records=41
[INFO ] 2026-06-01 19:40:12.804 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21345/300s
[INFO ] 2026-06-01 19:40:22.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:40:22.809 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:40:23.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 19:40:23.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426785,ok=426785,error=0, records=41
[INFO ] 2026-06-01 19:40:27.005 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17778/300s
[INFO ] 2026-06-01 19:40:27.007 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844828},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:40:27.190 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:40:27.190 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-01 19:40:27.190 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:40:27.190 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:40:27.190 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:40:27.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:40:27.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:40:27.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:40:27.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:40:27.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:40:27.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:40:37.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:40:37.813 [7273 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:40:38.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 19:40:38.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426786,ok=426786,error=0, records=41
[INFO ] 2026-06-01 19:40:38.788 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21341/300s
[INFO ] 2026-06-01 19:40:42.046 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21354/300s
[INFO ] 2026-06-01 19:40:52.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:40:52.818 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:40:53.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 19:40:53.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426787,ok=426787,error=0, records=41
[INFO ] 2026-06-01 19:41:07.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:41:07.823 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:41:08.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 19:41:08.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426788,ok=426788,error=0, records=41
[INFO ] 2026-06-01 19:41:22.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:41:22.828 [6663 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:41:23.844 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-01 19:41:23.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426789,ok=426789,error=0, records=41
[INFO ] 2026-06-01 19:41:31.256 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21341/300s
[INFO ] 2026-06-01 19:41:37.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:41:37.833 [7273 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:41:38.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 19:41:38.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426790,ok=426790,error=0, records=41
[INFO ] 2026-06-01 19:41:41.931 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21350/300s
[INFO ] 2026-06-01 19:41:52.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:41:52.840 [7287 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:41:53.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 19:41:53.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426791,ok=426791,error=0, records=41
[INFO ] 2026-06-01 19:42:07.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:42:07.265 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21353/300s
[WARN ] 2026-06-01 19:42:07.845 [7273 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:42:08.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 19:42:08.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426792,ok=426792,error=0, records=41
[INFO ] 2026-06-01 19:42:22.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:42:22.850 [7365 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:42:23.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 19:42:23.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426793,ok=426793,error=0, records=41
[INFO ] 2026-06-01 19:42:37.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:42:37.855 [7273 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:42:38.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 19:42:38.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426794,ok=426794,error=0, records=41
[INFO ] 2026-06-01 19:42:48.304 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21351/300s
[INFO ] 2026-06-01 19:42:50.206 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21351/300s
[INFO ] 2026-06-01 19:42:52.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:42:52.860 [7315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:42:53.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 19:42:53.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426795,ok=426795,error=0, records=41
[INFO ] 2026-06-01 19:42:57.512 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21351/300s
[INFO ] 2026-06-01 19:43:07.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:43:07.873 [7273 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:43:08.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 19:43:08.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426796,ok=426796,error=0, records=41
[WARN ] 2026-06-01 19:43:17.374 [7406 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6680/stat), No such file or directory
[INFO ] 2026-06-01 19:43:22.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:43:22.875 [7392 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:43:23.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 19:43:23.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426797,ok=426797,error=0, records=41
[INFO ] 2026-06-01 19:43:27.191 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844744},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:43:27.357 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:43:27.357 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 19:43:27.357 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:43:27.357 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:43:27.357 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:43:27.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:43:27.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:43:27.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:43:27.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:43:27.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:43:27.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 19:43:32.380 [7273 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6680/stat), No such file or directory
[INFO ] 2026-06-01 19:43:37.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 19:43:37.268 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 19:43:37.881 [7315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:43:38.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 19:43:38.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426798,ok=426798,error=0, records=41
[WARN ] 2026-06-01 19:43:47.385 [7459 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6680/stat), No such file or directory
[INFO ] 2026-06-01 19:43:52.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:43:52.887 [7470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:43:53.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 19:43:53.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426799,ok=426799,error=0, records=41
[INFO ] 2026-06-01 19:44:07.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:44:07.892 [7459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:44:08.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 19:44:08.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426800,ok=426800,error=0, records=41
[INFO ] 2026-06-01 19:44:22.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:44:22.897 [7498 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:44:23.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 19:44:23.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426801,ok=426801,error=0, records=41
[INFO ] 2026-06-01 19:44:37.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:44:37.903 [7515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:44:38.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 19:44:38.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426802,ok=426802,error=0, records=41
[INFO ] 2026-06-01 19:44:52.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:44:52.908 [7532 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:44:53.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 19:44:53.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426803,ok=426803,error=0, records=41
[INFO ] 2026-06-01 19:45:01.358 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21355/300s
[INFO ] 2026-06-01 19:45:07.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:45:07.914 [7532 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:45:08.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 19:45:08.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426804,ok=426804,error=0, records=41
[INFO ] 2026-06-01 19:45:12.915 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21346/300s
[INFO ] 2026-06-01 19:45:22.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:45:22.919 [7571 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:45:23.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 19:45:23.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426805,ok=426805,error=0, records=41
[INFO ] 2026-06-01 19:45:37.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:45:37.929 [7571 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:45:38.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 19:45:38.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426806,ok=426806,error=0, records=41
[INFO ] 2026-06-01 19:45:38.948 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21342/300s
[INFO ] 2026-06-01 19:45:42.052 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21355/300s
[INFO ] 2026-06-01 19:45:52.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:45:52.935 [7571 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:45:53.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 19:45:53.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426807,ok=426807,error=0, records=41
[INFO ] 2026-06-01 19:46:07.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:46:07.941 [7608 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:46:08.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 19:46:08.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426808,ok=426808,error=0, records=41
[INFO ] 2026-06-01 19:46:22.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:46:22.947 [7641 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:46:23.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-01 19:46:23.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426809,ok=426809,error=0, records=41
[INFO ] 2026-06-01 19:46:27.358 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17779/300s
[INFO ] 2026-06-01 19:46:27.359 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844668},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:46:27.527 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:46:27.527 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 19:46:27.527 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:46:27.527 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:46:27.527 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:46:27.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:46:27.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:46:27.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:46:27.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:46:27.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:46:27.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:46:31.437 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21342/300s
[INFO ] 2026-06-01 19:46:37.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:46:37.952 [7657 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:46:38.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 19:46:38.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426810,ok=426810,error=0, records=41
[INFO ] 2026-06-01 19:46:41.998 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21351/300s
[INFO ] 2026-06-01 19:46:52.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:46:52.957 [7608 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:46:53.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 19:46:53.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426811,ok=426811,error=0, records=41
[INFO ] 2026-06-01 19:47:07.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:47:07.277 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21354/300s
[WARN ] 2026-06-01 19:47:07.961 [7641 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:47:08.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 19:47:08.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426812,ok=426812,error=0, records=41
[INFO ] 2026-06-01 19:47:22.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:47:22.966 [7640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:47:23.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 19:47:23.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426813,ok=426813,error=0, records=41
[INFO ] 2026-06-01 19:47:37.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:47:37.971 [7699 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:47:39.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 19:47:39.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426814,ok=426814,error=0, records=41
[INFO ] 2026-06-01 19:47:48.393 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21352/300s
[INFO ] 2026-06-01 19:47:50.294 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21352/300s
[INFO ] 2026-06-01 19:47:52.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:47:52.976 [7640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:47:54.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 19:47:54.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426815,ok=426815,error=0, records=41
[INFO ] 2026-06-01 19:47:57.601 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21352/300s
[INFO ] 2026-06-01 19:48:07.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:48:07.982 [7699 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:48:09.080 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 19:48:09.080 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426816,ok=426816,error=0, records=41
[INFO ] 2026-06-01 19:48:22.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:48:22.986 [7657 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:48:24.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 19:48:24.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426817,ok=426817,error=0, records=41
[INFO ] 2026-06-01 19:48:37.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:48:37.991 [7775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:48:39.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 19:48:39.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426818,ok=426818,error=0, records=41
[INFO ] 2026-06-01 19:48:52.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:48:52.995 [7640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:48:54.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 19:48:54.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426819,ok=426819,error=0, records=41
[INFO ] 2026-06-01 19:49:07.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:49:08.000 [7715 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:49:09.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 19:49:09.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426820,ok=426820,error=0, records=41
[INFO ] 2026-06-01 19:49:22.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:49:23.005 [7775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:49:24.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 19:49:24.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426821,ok=426821,error=0, records=41
[INFO ] 2026-06-01 19:49:27.529 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844588},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:49:27.692 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:49:27.692 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 19:49:27.692 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:49:27.692 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:49:27.692 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:49:27.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:49:27.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:49:27.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:49:27.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:49:27.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:49:27.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:49:37.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:49:38.010 [7789 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:49:39.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 19:49:39.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426822,ok=426822,error=0, records=41
[INFO ] 2026-06-01 19:49:52.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:49:53.016 [7775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:49:54.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 19:49:54.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426823,ok=426823,error=0, records=41
[INFO ] 2026-06-01 19:50:01.361 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21356/300s
[INFO ] 2026-06-01 19:50:07.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:50:08.020 [7775 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:50:09.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 19:50:09.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426824,ok=426824,error=0, records=41
[INFO ] 2026-06-01 19:50:13.021 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21347/300s
[INFO ] 2026-06-01 19:50:22.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:50:23.025 [7789 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:50:24.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10136, records=41
[INFO ] 2026-06-01 19:50:24.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426825,ok=426825,error=0, records=41
[INFO ] 2026-06-01 19:50:37.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:50:38.031 [7817 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:50:39.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 19:50:39.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426826,ok=426826,error=0, records=41
[INFO ] 2026-06-01 19:50:39.310 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21343/300s
[INFO ] 2026-06-01 19:50:42.059 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21356/300s
[INFO ] 2026-06-01 19:50:52.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:50:53.036 [7803 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:50:54.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 19:50:54.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426827,ok=426827,error=0, records=41
[INFO ] 2026-06-01 19:51:07.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:51:08.042 [7803 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:51:09.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 19:51:09.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426828,ok=426828,error=0, records=41
[INFO ] 2026-06-01 19:51:22.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:51:23.048 [7915 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:51:24.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 19:51:24.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426829,ok=426829,error=0, records=41
[INFO ] 2026-06-01 19:51:31.610 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21343/300s
[INFO ] 2026-06-01 19:51:37.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:51:38.053 [7966 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:51:39.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 19:51:39.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426830,ok=426830,error=0, records=41
[INFO ] 2026-06-01 19:51:42.055 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21352/300s
[INFO ] 2026-06-01 19:51:52.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:51:52.558 [7976 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:51:54.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 19:51:54.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426831,ok=426831,error=0, records=41
[INFO ] 2026-06-01 19:52:07.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:52:07.290 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21355/300s
[WARN ] 2026-06-01 19:52:07.563 [7999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:52:09.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 19:52:09.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426832,ok=426832,error=0, records=41
[INFO ] 2026-06-01 19:52:22.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:52:22.568 [7982 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:52:24.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 19:52:24.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426833,ok=426833,error=0, records=41
[INFO ] 2026-06-01 19:52:27.692 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17780/300s
[INFO ] 2026-06-01 19:52:27.694 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844512},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:52:27.861 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:52:27.862 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 19:52:27.862 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:52:27.862 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:52:27.862 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:52:27.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:52:27.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:52:27.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:52:27.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:52:27.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:52:27.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:52:37.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:52:37.573 [8035 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:52:39.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 19:52:39.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426834,ok=426834,error=0, records=41
[INFO ] 2026-06-01 19:52:48.458 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21353/300s
[INFO ] 2026-06-01 19:52:50.359 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21353/300s
[INFO ] 2026-06-01 19:52:52.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:52:52.577 [8054 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:52:54.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 19:52:54.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426835,ok=426835,error=0, records=41
[INFO ] 2026-06-01 19:52:57.665 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21353/300s
[INFO ] 2026-06-01 19:53:07.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:53:07.582 [8067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:53:09.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 19:53:09.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426836,ok=426836,error=0, records=41
[INFO ] 2026-06-01 19:53:22.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:53:22.586 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:53:24.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 19:53:24.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426837,ok=426837,error=0, records=41
[INFO ] 2026-06-01 19:53:37.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 19:53:37.294 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 19:53:37.592 [8101 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:53:39.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 19:53:39.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426838,ok=426838,error=0, records=41
[INFO ] 2026-06-01 19:53:52.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:53:52.294 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 19:53:52.596 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:53:54.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 19:53:54.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426839,ok=426839,error=0, records=41
[INFO ] 2026-06-01 19:54:07.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:54:07.601 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:54:09.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 19:54:09.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426840,ok=426840,error=0, records=41
[INFO ] 2026-06-01 19:54:22.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:54:22.606 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:54:24.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 19:54:24.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426841,ok=426841,error=0, records=41
[INFO ] 2026-06-01 19:54:37.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:54:37.610 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:54:39.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 19:54:39.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426842,ok=426842,error=0, records=41
[INFO ] 2026-06-01 19:54:52.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:54:52.614 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:54:54.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 19:54:54.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426843,ok=426843,error=0, records=41
[INFO ] 2026-06-01 19:55:01.364 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21357/300s
[INFO ] 2026-06-01 19:55:07.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:55:07.619 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:55:09.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-01 19:55:09.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426844,ok=426844,error=0, records=41
[INFO ] 2026-06-01 19:55:13.120 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21348/300s
[INFO ] 2026-06-01 19:55:22.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:55:22.624 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:55:24.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 19:55:24.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426845,ok=426845,error=0, records=41
[INFO ] 2026-06-01 19:55:27.863 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844432},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:55:28.014 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:55:28.014 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 19:55:28.014 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:55:28.014 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:55:28.014 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:55:28.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:55:28.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:55:28.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:55:28.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:55:28.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:55:28.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:55:37.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:55:37.630 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:55:39.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-01 19:55:39.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426846,ok=426846,error=0, records=41
[INFO ] 2026-06-01 19:55:39.465 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21344/300s
[INFO ] 2026-06-01 19:55:42.065 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21357/300s
[INFO ] 2026-06-01 19:55:52.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:55:52.635 [8083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:55:54.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 19:55:54.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426847,ok=426847,error=0, records=41
[INFO ] 2026-06-01 19:56:07.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:56:07.641 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:56:09.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 19:56:09.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426848,ok=426848,error=0, records=41
[INFO ] 2026-06-01 19:56:22.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:56:22.645 [8083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:56:24.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 19:56:24.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426849,ok=426849,error=0, records=41
[INFO ] 2026-06-01 19:56:31.795 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21344/300s
[INFO ] 2026-06-01 19:56:37.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:56:37.651 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:56:39.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 19:56:39.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426850,ok=426850,error=0, records=41
[INFO ] 2026-06-01 19:56:42.110 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21353/300s
[INFO ] 2026-06-01 19:56:52.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:56:52.657 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:56:54.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 19:56:54.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426851,ok=426851,error=0, records=41
[INFO ] 2026-06-01 19:57:07.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 19:57:07.303 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21356/300s
[WARN ] 2026-06-01 19:57:07.662 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:57:09.511 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 19:57:09.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426852,ok=426852,error=0, records=41
[INFO ] 2026-06-01 19:57:22.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:57:22.667 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:57:24.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 19:57:24.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426853,ok=426853,error=0, records=41
[INFO ] 2026-06-01 19:57:37.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:57:37.672 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:57:39.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 19:57:39.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426854,ok=426854,error=0, records=41
[INFO ] 2026-06-01 19:57:48.518 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21354/300s
[INFO ] 2026-06-01 19:57:50.420 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21354/300s
[INFO ] 2026-06-01 19:57:52.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:57:52.677 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:57:54.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 19:57:54.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426855,ok=426855,error=0, records=41
[INFO ] 2026-06-01 19:57:57.727 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21354/300s
[INFO ] 2026-06-01 19:58:07.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:58:07.681 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:58:09.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 19:58:09.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426856,ok=426856,error=0, records=41
[INFO ] 2026-06-01 19:58:22.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:58:22.685 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:58:24.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 19:58:24.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426857,ok=426857,error=0, records=41
[INFO ] 2026-06-01 19:58:28.014 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17781/300s
[INFO ] 2026-06-01 19:58:28.016 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844356},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 19:58:28.169 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 19:58:28.169 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 19:58:28.170 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 19:58:28.170 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 19:58:28.170 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 19:58:28.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:58:28.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 19:58:28.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:58:28.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 19:58:28.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:58:28.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 19:58:37.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:58:37.690 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:58:39.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 19:58:39.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426858,ok=426858,error=0, records=41
[INFO ] 2026-06-01 19:58:52.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:58:52.694 [8083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:58:54.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 19:58:54.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426859,ok=426859,error=0, records=41
[INFO ] 2026-06-01 19:59:07.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:59:07.699 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:59:09.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 19:59:09.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426860,ok=426860,error=0, records=41
[INFO ] 2026-06-01 19:59:22.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:59:22.704 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:59:24.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 19:59:24.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426861,ok=426861,error=0, records=41
[INFO ] 2026-06-01 19:59:37.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:59:37.708 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:59:39.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 19:59:39.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426862,ok=426862,error=0, records=41
[INFO ] 2026-06-01 19:59:52.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 19:59:52.713 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 19:59:54.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 19:59:54.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426863,ok=426863,error=0, records=41
[INFO ] 2026-06-01 20:00:01.367 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21358/300s
[INFO ] 2026-06-01 20:00:07.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:00:07.719 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:00:09.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 20:00:09.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426864,ok=426864,error=0, records=41
[INFO ] 2026-06-01 20:00:13.221 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21349/300s
[INFO ] 2026-06-01 20:00:22.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:00:22.725 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:00:24.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 20:00:24.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426865,ok=426865,error=0, records=41
[INFO ] 2026-06-01 20:00:37.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:00:37.730 [8083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:00:39.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 20:00:39.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426866,ok=426866,error=0, records=41
[INFO ] 2026-06-01 20:00:39.671 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21345/300s
[INFO ] 2026-06-01 20:00:42.071 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21358/300s
[INFO ] 2026-06-01 20:00:52.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:00:52.736 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:00:54.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 20:00:54.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426867,ok=426867,error=0, records=41
[INFO ] 2026-06-01 20:01:07.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:01:07.741 [8083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:01:09.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 20:01:09.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426868,ok=426868,error=0, records=41
[INFO ] 2026-06-01 20:01:22.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:01:22.747 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:01:24.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 20:01:24.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426869,ok=426869,error=0, records=41
[INFO ] 2026-06-01 20:01:28.171 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844280},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:01:28.341 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:01:28.341 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 20:01:28.341 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:01:28.341 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:01:28.341 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:01:28.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:01:28.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:01:28.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:01:28.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:01:28.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:01:28.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:01:31.978 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21345/300s
[INFO ] 2026-06-01 20:01:37.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:01:37.751 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:01:39.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 20:01:39.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426870,ok=426870,error=0, records=41
[INFO ] 2026-06-01 20:01:42.165 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21354/300s
[INFO ] 2026-06-01 20:01:52.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:01:52.758 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:01:54.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 20:01:54.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426871,ok=426871,error=0, records=41
[INFO ] 2026-06-01 20:02:07.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:02:07.316 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21357/300s
[WARN ] 2026-06-01 20:02:07.763 [8083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:02:09.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 20:02:09.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426872,ok=426872,error=0, records=41
[WARN ] 2026-06-01 20:02:17.771 [8083 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/799/stat), No such file or directory
[INFO ] 2026-06-01 20:02:22.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:02:22.773 [8100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:02:24.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 20:02:24.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426873,ok=426873,error=0, records=41
[WARN ] 2026-06-01 20:02:32.777 [8083 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/799/stat), No such file or directory
[INFO ] 2026-06-01 20:02:37.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:02:37.778 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:02:39.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 20:02:39.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426874,ok=426874,error=0, records=41
[WARN ] 2026-06-01 20:02:47.782 [8116 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/799/stat), No such file or directory
[INFO ] 2026-06-01 20:02:48.582 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21355/300s
[INFO ] 2026-06-01 20:02:50.484 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21355/300s
[INFO ] 2026-06-01 20:02:52.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:02:52.783 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:02:54.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 20:02:54.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426875,ok=426875,error=0, records=41
[INFO ] 2026-06-01 20:02:57.783 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21355/300s
[INFO ] 2026-06-01 20:03:07.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:03:07.788 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:03:09.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 20:03:09.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426876,ok=426876,error=0, records=41
[INFO ] 2026-06-01 20:03:22.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:03:22.793 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:03:24.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 20:03:24.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426877,ok=426877,error=0, records=41
[INFO ] 2026-06-01 20:03:37.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 20:03:37.319 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 20:03:37.799 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:03:39.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 20:03:39.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426878,ok=426878,error=0, records=41
[INFO ] 2026-06-01 20:03:52.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:03:52.804 [8121 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:03:54.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 20:03:54.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426879,ok=426879,error=0, records=41
[INFO ] 2026-06-01 20:04:07.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:04:07.809 [8774 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:04:09.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 20:04:09.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426880,ok=426880,error=0, records=41
[INFO ] 2026-06-01 20:04:22.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:04:22.815 [8779 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:04:24.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 20:04:24.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426881,ok=426881,error=0, records=41
[INFO ] 2026-06-01 20:04:28.341 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17782/300s
[INFO ] 2026-06-01 20:04:28.343 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:04:28.512 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:04:28.512 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 20:04:28.512 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:04:28.512 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:04:28.512 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:04:28.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:04:28.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:04:28.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:04:28.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:04:28.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:04:28.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:04:37.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:04:37.820 [8809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:04:39.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 20:04:39.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426882,ok=426882,error=0, records=41
[INFO ] 2026-06-01 20:04:52.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:04:52.825 [8779 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:04:54.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 20:04:54.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426883,ok=426883,error=0, records=41
[INFO ] 2026-06-01 20:05:01.370 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21359/300s
[INFO ] 2026-06-01 20:05:07.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:05:07.831 [8794 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:05:09.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 20:05:09.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426884,ok=426884,error=0, records=41
[INFO ] 2026-06-01 20:05:13.333 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21350/300s
[INFO ] 2026-06-01 20:05:22.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:05:22.836 [8823 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:05:24.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 20:05:24.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426885,ok=426885,error=0, records=41
[INFO ] 2026-06-01 20:05:37.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:05:37.842 [8794 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:05:39.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 20:05:39.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426886,ok=426886,error=0, records=41
[INFO ] 2026-06-01 20:05:39.797 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21346/300s
[INFO ] 2026-06-01 20:05:42.077 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21359/300s
[INFO ] 2026-06-01 20:05:52.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:05:52.848 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:05:54.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 20:05:54.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426887,ok=426887,error=0, records=41
[INFO ] 2026-06-01 20:06:07.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:06:07.854 [8055 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:06:09.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 20:06:09.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426888,ok=426888,error=0, records=41
[INFO ] 2026-06-01 20:06:22.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:06:22.860 [8873 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:06:24.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 20:06:24.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426889,ok=426889,error=0, records=41
[INFO ] 2026-06-01 20:06:32.154 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21346/300s
[INFO ] 2026-06-01 20:06:37.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:06:37.865 [8901 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:06:39.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 20:06:39.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426890,ok=426890,error=0, records=41
[INFO ] 2026-06-01 20:06:42.215 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21355/300s
[INFO ] 2026-06-01 20:06:52.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:06:52.870 [8850 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:06:54.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 20:06:54.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426891,ok=426891,error=0, records=41
[INFO ] 2026-06-01 20:07:07.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:07:07.327 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21358/300s
[WARN ] 2026-06-01 20:07:07.875 [8929 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:07:09.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 20:07:09.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426892,ok=426892,error=0, records=41
[INFO ] 2026-06-01 20:07:22.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:07:22.879 [8965 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:07:24.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-01 20:07:24.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426893,ok=426893,error=0, records=41
[INFO ] 2026-06-01 20:07:28.514 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867856},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:07:28.685 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:07:28.685 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 20:07:28.685 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:07:28.685 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:07:28.685 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:07:28.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:07:28.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:07:28.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:07:28.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:07:28.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:07:28.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:07:37.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:07:37.885 [8954 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:07:39.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 20:07:39.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426894,ok=426894,error=0, records=41
[INFO ] 2026-06-01 20:07:48.608 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21356/300s
[INFO ] 2026-06-01 20:07:50.509 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21356/300s
[INFO ] 2026-06-01 20:07:52.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:07:52.890 [8954 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:07:54.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 20:07:54.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426895,ok=426895,error=0, records=41
[INFO ] 2026-06-01 20:07:57.811 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21356/300s
[INFO ] 2026-06-01 20:08:07.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:08:07.896 [8954 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:08:09.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 20:08:09.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426896,ok=426896,error=0, records=41
[INFO ] 2026-06-01 20:08:22.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:08:22.900 [8989 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:08:24.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 20:08:24.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426897,ok=426897,error=0, records=41
[INFO ] 2026-06-01 20:08:37.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:08:37.905 [9039 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:08:40.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 20:08:40.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426898,ok=426898,error=0, records=41
[INFO ] 2026-06-01 20:08:52.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:08:52.332 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 20:08:52.911 [9061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:08:55.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 20:08:55.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426899,ok=426899,error=0, records=41
[INFO ] 2026-06-01 20:09:07.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:09:07.917 [9061 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:09:10.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 20:09:10.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426900,ok=426900,error=0, records=41
[INFO ] 2026-06-01 20:09:22.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:09:22.922 [9093 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:09:25.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 20:09:25.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426901,ok=426901,error=0, records=41
[INFO ] 2026-06-01 20:09:37.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:09:37.926 [9077 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:09:40.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 20:09:40.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426902,ok=426902,error=0, records=41
[INFO ] 2026-06-01 20:09:52.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:09:52.932 [9126 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:09:55.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 20:09:55.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426903,ok=426903,error=0, records=41
[INFO ] 2026-06-01 20:10:01.373 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21360/300s
[INFO ] 2026-06-01 20:10:07.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:10:07.938 [9148 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:10:10.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 20:10:10.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426904,ok=426904,error=0, records=41
[INFO ] 2026-06-01 20:10:13.439 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21351/300s
[INFO ] 2026-06-01 20:10:22.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:10:22.942 [9171 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:10:25.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 20:10:25.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426905,ok=426905,error=0, records=41
[INFO ] 2026-06-01 20:10:28.685 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17783/300s
[INFO ] 2026-06-01 20:10:28.687 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867768},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:10:28.852 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:10:28.852 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 20:10:28.852 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:10:28.852 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:10:28.852 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:10:28.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:10:28.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:10:28.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:10:28.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:10:28.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:10:28.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:10:37.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:10:37.948 [9189 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:10:40.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 20:10:40.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426906,ok=426906,error=0, records=41
[INFO ] 2026-06-01 20:10:40.178 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21347/300s
[INFO ] 2026-06-01 20:10:42.084 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21360/300s
[INFO ] 2026-06-01 20:10:52.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:10:52.952 [9199 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:10:55.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 20:10:55.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426907,ok=426907,error=0, records=41
[INFO ] 2026-06-01 20:11:07.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:11:07.958 [9148 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:11:10.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 20:11:10.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426908,ok=426908,error=0, records=41
[INFO ] 2026-06-01 20:11:22.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:11:22.963 [9189 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:11:25.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:11:25.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426909,ok=426909,error=0, records=41
[INFO ] 2026-06-01 20:11:32.338 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21347/300s
[INFO ] 2026-06-01 20:11:37.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:11:37.969 [9199 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:11:40.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 20:11:40.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426910,ok=426910,error=0, records=41
[INFO ] 2026-06-01 20:11:42.271 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21356/300s
[INFO ] 2026-06-01 20:11:52.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:11:52.973 [9227 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:11:55.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 20:11:55.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426911,ok=426911,error=0, records=41
[INFO ] 2026-06-01 20:12:07.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:12:07.341 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21359/300s
[WARN ] 2026-06-01 20:12:07.980 [9254 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:12:10.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 20:12:10.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426912,ok=426912,error=0, records=41
[INFO ] 2026-06-01 20:12:22.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:12:22.985 [9227 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:12:25.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-01 20:12:25.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426913,ok=426913,error=0, records=41
[INFO ] 2026-06-01 20:12:37.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:12:37.989 [9296 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:12:40.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 20:12:40.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426914,ok=426914,error=0, records=41
[INFO ] 2026-06-01 20:12:48.667 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21357/300s
[INFO ] 2026-06-01 20:12:50.569 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21357/300s
[INFO ] 2026-06-01 20:12:52.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:12:52.994 [9148 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:12:55.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 20:12:55.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426915,ok=426915,error=0, records=41
[INFO ] 2026-06-01 20:12:57.874 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21357/300s
[INFO ] 2026-06-01 20:13:07.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:13:07.999 [9227 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:13:10.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 20:13:10.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426916,ok=426916,error=0, records=41
[INFO ] 2026-06-01 20:13:22.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:13:23.003 [9324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:13:25.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 20:13:25.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426917,ok=426917,error=0, records=41
[INFO ] 2026-06-01 20:13:28.854 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:13:29.038 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:13:29.038 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 20:13:29.038 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:13:29.038 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:13:29.038 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:13:29.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:13:29.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:13:29.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:13:29.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:13:29.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:13:29.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:13:37.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 20:13:37.345 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 20:13:38.007 [9227 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:13:40.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 20:13:40.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426918,ok=426918,error=0, records=41
[INFO ] 2026-06-01 20:13:52.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:13:53.013 [9324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:13:55.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 20:13:55.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426919,ok=426919,error=0, records=41
[INFO ] 2026-06-01 20:14:07.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:14:08.019 [9310 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:14:10.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 20:14:10.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426920,ok=426920,error=0, records=41
[INFO ] 2026-06-01 20:14:22.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:14:23.024 [9296 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:14:25.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 20:14:25.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426921,ok=426921,error=0, records=41
[INFO ] 2026-06-01 20:14:37.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:14:38.029 [9310 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:14:40.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 20:14:40.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426922,ok=426922,error=0, records=41
[INFO ] 2026-06-01 20:14:52.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:14:53.033 [9421 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:14:55.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:14:55.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426923,ok=426923,error=0, records=41
[INFO ] 2026-06-01 20:15:01.377 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21361/300s
[INFO ] 2026-06-01 20:15:07.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:15:08.038 [9227 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:15:10.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-01 20:15:10.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426924,ok=426924,error=0, records=41
[INFO ] 2026-06-01 20:15:13.539 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21352/300s
[INFO ] 2026-06-01 20:15:22.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:15:23.043 [9459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:15:25.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 20:15:25.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426925,ok=426925,error=0, records=41
[INFO ] 2026-06-01 20:15:37.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:15:38.048 [9459 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:15:40.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-01 20:15:40.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426926,ok=426926,error=0, records=41
[INFO ] 2026-06-01 20:15:40.323 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21348/300s
[INFO ] 2026-06-01 20:15:42.090 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21361/300s
[INFO ] 2026-06-01 20:15:52.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:15:53.053 [9488 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:15:55.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 20:15:55.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426927,ok=426927,error=0, records=41
[INFO ] 2026-06-01 20:16:07.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:16:07.559 [9497 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:16:10.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 20:16:10.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426928,ok=426928,error=0, records=41
[INFO ] 2026-06-01 20:16:22.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:16:22.563 [9525 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:16:25.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 20:16:25.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426929,ok=426929,error=0, records=41
[INFO ] 2026-06-01 20:16:29.038 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17784/300s
[INFO ] 2026-06-01 20:16:29.040 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867616},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:16:29.183 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:16:29.183 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 20:16:29.184 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:16:29.184 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:16:29.184 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:16:29.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:16:29.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:16:29.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:16:29.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:16:29.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:16:29.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:16:32.519 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21348/300s
[INFO ] 2026-06-01 20:16:37.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:16:37.568 [9538 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:16:40.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 20:16:40.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426930,ok=426930,error=0, records=41
[INFO ] 2026-06-01 20:16:42.323 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21357/300s
[INFO ] 2026-06-01 20:16:52.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:16:52.573 [9546 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:16:55.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 20:16:55.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426931,ok=426931,error=0, records=41
[INFO ] 2026-06-01 20:17:07.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:17:07.353 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21360/300s
[WARN ] 2026-06-01 20:17:07.577 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:17:10.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 20:17:10.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426932,ok=426932,error=0, records=41
[INFO ] 2026-06-01 20:17:22.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:17:22.581 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:17:25.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-01 20:17:25.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426933,ok=426933,error=0, records=41
[INFO ] 2026-06-01 20:17:37.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:17:37.586 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:17:40.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 20:17:40.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426934,ok=426934,error=0, records=41
[INFO ] 2026-06-01 20:17:48.696 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21358/300s
[INFO ] 2026-06-01 20:17:50.598 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21358/300s
[INFO ] 2026-06-01 20:17:52.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:17:52.592 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:17:55.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 20:17:55.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426935,ok=426935,error=0, records=41
[INFO ] 2026-06-01 20:17:57.905 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21358/300s
[INFO ] 2026-06-01 20:18:07.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:18:07.598 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:18:10.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 20:18:10.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426936,ok=426936,error=0, records=41
[INFO ] 2026-06-01 20:18:22.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:18:22.603 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:18:25.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 20:18:25.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426937,ok=426937,error=0, records=41
[INFO ] 2026-06-01 20:18:37.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:18:37.608 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:18:40.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 20:18:40.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426938,ok=426938,error=0, records=41
[INFO ] 2026-06-01 20:18:52.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:18:52.614 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:18:55.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 20:18:55.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426939,ok=426939,error=0, records=41
[INFO ] 2026-06-01 20:19:07.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:19:07.620 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:19:10.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 20:19:10.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426940,ok=426940,error=0, records=41
[INFO ] 2026-06-01 20:19:22.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:19:22.625 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:19:25.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 20:19:25.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426941,ok=426941,error=0, records=41
[INFO ] 2026-06-01 20:19:29.185 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867536},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:19:29.360 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:19:29.360 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 20:19:29.360 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:19:29.360 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:19:29.360 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:19:29.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:19:29.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:19:29.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:19:29.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:19:29.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:19:29.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:19:37.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:19:37.630 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:19:40.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-01 20:19:40.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426942,ok=426942,error=0, records=41
[INFO ] 2026-06-01 20:19:52.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:19:52.636 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:19:55.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-01 20:19:55.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426943,ok=426943,error=0, records=41
[INFO ] 2026-06-01 20:20:01.380 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21362/300s
[INFO ] 2026-06-01 20:20:07.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:20:07.642 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:20:10.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 20:20:10.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426944,ok=426944,error=0, records=41
[INFO ] 2026-06-01 20:20:13.643 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21353/300s
[INFO ] 2026-06-01 20:20:22.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:20:22.647 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:20:25.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:20:25.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426945,ok=426945,error=0, records=41
[INFO ] 2026-06-01 20:20:37.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:20:37.652 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:20:40.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 20:20:40.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426946,ok=426946,error=0, records=41
[INFO ] 2026-06-01 20:20:40.573 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21349/300s
[INFO ] 2026-06-01 20:20:42.096 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21362/300s
[INFO ] 2026-06-01 20:20:52.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:20:52.657 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:20:55.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 20:20:55.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426947,ok=426947,error=0, records=41
[INFO ] 2026-06-01 20:21:07.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:21:07.665 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:21:10.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 20:21:10.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426948,ok=426948,error=0, records=41
[INFO ] 2026-06-01 20:21:22.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:21:22.669 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:21:25.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 20:21:25.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426949,ok=426949,error=0, records=41
[WARN ] 2026-06-01 20:21:32.674 [9645 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7822/stat), No such file or directory
[INFO ] 2026-06-01 20:21:32.699 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21349/300s
[INFO ] 2026-06-01 20:21:37.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:21:37.674 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:21:40.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 20:21:40.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426950,ok=426950,error=0, records=41
[INFO ] 2026-06-01 20:21:42.377 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21358/300s
[WARN ] 2026-06-01 20:21:47.679 [9640 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7822/stat), No such file or directory
[INFO ] 2026-06-01 20:21:52.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:21:52.680 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:21:55.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 20:21:55.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426951,ok=426951,error=0, records=41
[INFO ] 2026-06-01 20:22:07.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:22:07.364 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21361/300s
[WARN ] 2026-06-01 20:22:07.684 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:22:10.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 20:22:10.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426952,ok=426952,error=0, records=41
[INFO ] 2026-06-01 20:22:22.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:22:22.689 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:22:25.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 20:22:25.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426953,ok=426953,error=0, records=41
[INFO ] 2026-06-01 20:22:29.361 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17785/300s
[INFO ] 2026-06-01 20:22:29.362 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867424},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:22:29.533 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:22:29.533 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 20:22:29.533 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:22:29.534 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:22:29.534 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:22:29.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:22:29.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:22:29.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:22:29.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:22:29.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:22:29.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:22:37.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:22:37.694 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:22:40.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 20:22:40.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426954,ok=426954,error=0, records=41
[INFO ] 2026-06-01 20:22:48.762 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21359/300s
[INFO ] 2026-06-01 20:22:50.664 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21359/300s
[INFO ] 2026-06-01 20:22:52.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:22:52.699 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:22:55.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 20:22:55.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426955,ok=426955,error=0, records=41
[INFO ] 2026-06-01 20:22:57.970 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21359/300s
[INFO ] 2026-06-01 20:23:07.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:23:07.705 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:23:10.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 20:23:10.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426956,ok=426956,error=0, records=41
[INFO ] 2026-06-01 20:23:22.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:23:22.711 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:23:25.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 20:23:25.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426957,ok=426957,error=0, records=41
[INFO ] 2026-06-01 20:23:37.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 20:23:37.368 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 20:23:37.716 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:23:40.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 20:23:40.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426958,ok=426958,error=0, records=41
[INFO ] 2026-06-01 20:23:52.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:23:52.369 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 20:23:52.721 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:23:55.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 20:23:55.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426959,ok=426959,error=0, records=41
[INFO ] 2026-06-01 20:24:07.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:24:07.725 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:24:10.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 20:24:10.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426960,ok=426960,error=0, records=41
[INFO ] 2026-06-01 20:24:22.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:24:22.731 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:24:25.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 20:24:25.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426961,ok=426961,error=0, records=41
[INFO ] 2026-06-01 20:24:37.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:24:37.736 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:24:40.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 20:24:40.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426962,ok=426962,error=0, records=41
[INFO ] 2026-06-01 20:24:52.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:24:52.741 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:24:55.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 20:24:55.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426963,ok=426963,error=0, records=41
[INFO ] 2026-06-01 20:25:01.383 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21363/300s
[INFO ] 2026-06-01 20:25:07.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:25:07.746 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:25:10.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 20:25:10.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426964,ok=426964,error=0, records=41
[INFO ] 2026-06-01 20:25:13.747 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21354/300s
[INFO ] 2026-06-01 20:25:22.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:25:22.751 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:25:25.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 20:25:25.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426965,ok=426965,error=0, records=41
[INFO ] 2026-06-01 20:25:29.535 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867336},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:25:29.690 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:25:29.690 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 20:25:29.690 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:25:29.690 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:25:29.690 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:25:29.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:25:29.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:25:29.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:25:29.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:25:29.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:25:29.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:25:37.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:25:37.757 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:25:40.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:25:40.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426966,ok=426966,error=0, records=41
[INFO ] 2026-06-01 20:25:40.689 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21350/300s
[INFO ] 2026-06-01 20:25:42.102 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21363/300s
[INFO ] 2026-06-01 20:25:52.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:25:52.762 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:25:55.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 20:25:55.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426967,ok=426967,error=0, records=41
[INFO ] 2026-06-01 20:26:07.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:26:07.767 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:26:10.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 20:26:10.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426968,ok=426968,error=0, records=41
[INFO ] 2026-06-01 20:26:22.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:26:22.771 [9615 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:26:25.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 20:26:25.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426969,ok=426969,error=0, records=41
[INFO ] 2026-06-01 20:26:32.881 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21350/300s
[INFO ] 2026-06-01 20:26:37.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:26:37.777 [9575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:26:40.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 20:26:40.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426970,ok=426970,error=0, records=41
[INFO ] 2026-06-01 20:26:42.440 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21359/300s
[INFO ] 2026-06-01 20:26:52.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:26:52.782 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:26:55.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-01 20:26:55.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426971,ok=426971,error=0, records=41
[INFO ] 2026-06-01 20:27:07.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:27:07.377 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21362/300s
[WARN ] 2026-06-01 20:27:07.786 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:27:10.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-01 20:27:10.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426972,ok=426972,error=0, records=41
[INFO ] 2026-06-01 20:27:22.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:27:22.791 [9640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:27:25.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 20:27:25.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426973,ok=426973,error=0, records=41
[INFO ] 2026-06-01 20:27:37.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:27:37.797 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:27:40.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 20:27:40.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426974,ok=426974,error=0, records=41
[INFO ] 2026-06-01 20:27:48.817 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21360/300s
[INFO ] 2026-06-01 20:27:50.719 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21360/300s
[INFO ] 2026-06-01 20:27:52.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:27:52.802 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:27:55.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 20:27:55.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426975,ok=426975,error=0, records=41
[INFO ] 2026-06-01 20:27:58.025 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21360/300s
[INFO ] 2026-06-01 20:28:07.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:28:07.806 [10249] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:28:10.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 20:28:10.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426976,ok=426976,error=0, records=41
[INFO ] 2026-06-01 20:28:22.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:28:22.812 [10244] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:28:25.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 20:28:25.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426977,ok=426977,error=0, records=41
[INFO ] 2026-06-01 20:28:29.691 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17786/300s
[INFO ] 2026-06-01 20:28:29.692 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:28:29.873 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:28:29.873 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 20:28:29.873 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:28:29.873 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:28:29.873 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:28:29.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:28:29.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:28:29.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:28:29.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:28:29.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:28:29.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:28:37.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:28:37.819 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:28:40.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 20:28:40.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426978,ok=426978,error=0, records=41
[INFO ] 2026-06-01 20:28:52.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:28:52.824 [10244] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:28:55.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 20:28:55.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426979,ok=426979,error=0, records=41
[INFO ] 2026-06-01 20:29:07.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:29:07.829 [10259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:29:10.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 20:29:10.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426980,ok=426980,error=0, records=41
[INFO ] 2026-06-01 20:29:22.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:29:22.835 [10321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:29:25.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 20:29:25.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426981,ok=426981,error=0, records=41
[INFO ] 2026-06-01 20:29:37.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:29:37.840 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:29:40.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 20:29:40.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426982,ok=426982,error=0, records=41
[INFO ] 2026-06-01 20:29:52.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:29:52.845 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:29:55.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 20:29:55.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426983,ok=426983,error=0, records=41
[INFO ] 2026-06-01 20:30:01.386 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21364/300s
[INFO ] 2026-06-01 20:30:07.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:30:07.849 [9645 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:30:10.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 20:30:10.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426984,ok=426984,error=0, records=41
[INFO ] 2026-06-01 20:30:13.851 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21355/300s
[INFO ] 2026-06-01 20:30:22.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:30:22.854 [10377] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:30:25.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 20:30:25.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426985,ok=426985,error=0, records=41
[INFO ] 2026-06-01 20:30:37.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:30:37.860 [10321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:30:40.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 20:30:40.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426986,ok=426986,error=0, records=41
[INFO ] 2026-06-01 20:30:40.919 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21351/300s
[INFO ] 2026-06-01 20:30:42.109 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21364/300s
[INFO ] 2026-06-01 20:30:52.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:30:52.864 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:30:55.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 20:30:55.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426987,ok=426987,error=0, records=41
[INFO ] 2026-06-01 20:31:07.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:31:07.869 [10259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:31:10.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 20:31:10.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426988,ok=426988,error=0, records=41
[INFO ] 2026-06-01 20:31:22.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:31:22.873 [10259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:31:25.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:31:25.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426989,ok=426989,error=0, records=41
[INFO ] 2026-06-01 20:31:29.874 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:31:30.035 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:31:30.035 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 20:31:30.035 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:31:30.035 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:31:30.035 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:31:30.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:31:30.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:31:30.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:31:30.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:31:30.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:31:30.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:31:33.064 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21351/300s
[INFO ] 2026-06-01 20:31:37.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:31:37.879 [9626 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:31:40.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 20:31:40.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426990,ok=426990,error=0, records=41
[INFO ] 2026-06-01 20:31:42.495 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21360/300s
[INFO ] 2026-06-01 20:31:52.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:31:52.884 [10468] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:31:55.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 20:31:55.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426991,ok=426991,error=0, records=41
[INFO ] 2026-06-01 20:32:07.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:32:07.389 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21363/300s
[WARN ] 2026-06-01 20:32:07.890 [10484] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:32:10.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 20:32:10.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426992,ok=426992,error=0, records=41
[INFO ] 2026-06-01 20:32:22.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:32:22.896 [10462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:32:25.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 20:32:25.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426993,ok=426993,error=0, records=41
[INFO ] 2026-06-01 20:32:37.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:32:37.902 [10501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:32:40.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 20:32:40.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426994,ok=426994,error=0, records=41
[INFO ] 2026-06-01 20:32:48.871 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21361/300s
[INFO ] 2026-06-01 20:32:50.773 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21361/300s
[INFO ] 2026-06-01 20:32:52.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:32:52.907 [10506] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:32:55.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 20:32:55.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426995,ok=426995,error=0, records=41
[INFO ] 2026-06-01 20:32:58.076 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21361/300s
[INFO ] 2026-06-01 20:33:07.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:33:07.914 [10556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:33:11.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 20:33:11.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426996,ok=426996,error=0, records=41
[INFO ] 2026-06-01 20:33:22.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:33:22.921 [10577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:33:26.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:33:26.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426997,ok=426997,error=0, records=41
[INFO ] 2026-06-01 20:33:37.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 20:33:37.393 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 20:33:37.926 [10551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:33:41.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 20:33:41.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426998,ok=426998,error=0, records=41
[INFO ] 2026-06-01 20:33:52.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:33:52.932 [10519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:33:56.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 20:33:56.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=426999,ok=426999,error=0, records=41
[INFO ] 2026-06-01 20:34:07.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:34:07.936 [10551] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:34:11.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 20:34:11.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427000,ok=427000,error=0, records=41
[INFO ] 2026-06-01 20:34:22.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:34:22.942 [10609] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:34:26.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 20:34:26.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427001,ok=427001,error=0, records=41
[INFO ] 2026-06-01 20:34:30.035 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17787/300s
[INFO ] 2026-06-01 20:34:30.037 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867088},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:34:30.196 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:34:30.196 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 20:34:30.196 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:34:30.196 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:34:30.196 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:34:30.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:34:30.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:34:30.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:34:30.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:34:30.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:34:30.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:34:37.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:34:37.948 [10653] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:34:41.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10139, records=41
[INFO ] 2026-06-01 20:34:41.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427002,ok=427002,error=0, records=41
[INFO ] 2026-06-01 20:34:52.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:34:52.953 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:34:56.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-01 20:34:56.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427003,ok=427003,error=0, records=41
[INFO ] 2026-06-01 20:35:01.389 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21365/300s
[INFO ] 2026-06-01 20:35:07.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:35:07.958 [10653] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:35:11.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-01 20:35:11.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427004,ok=427004,error=0, records=41
[INFO ] 2026-06-01 20:35:13.960 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21356/300s
[INFO ] 2026-06-01 20:35:22.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:35:22.964 [10620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:35:26.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-01 20:35:26.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427005,ok=427005,error=0, records=41
[INFO ] 2026-06-01 20:35:37.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:35:37.969 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:35:41.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 20:35:41.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427006,ok=427006,error=0, records=41
[INFO ] 2026-06-01 20:35:41.105 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21352/300s
[INFO ] 2026-06-01 20:35:42.115 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21365/300s
[INFO ] 2026-06-01 20:35:52.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:35:52.974 [10681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:35:56.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 20:35:56.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427007,ok=427007,error=0, records=41
[INFO ] 2026-06-01 20:36:07.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:36:07.979 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:36:11.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 20:36:11.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427008,ok=427008,error=0, records=41
[INFO ] 2026-06-01 20:36:22.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:36:22.985 [10681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:36:26.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 20:36:26.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427009,ok=427009,error=0, records=41
[INFO ] 2026-06-01 20:36:33.243 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21352/300s
[INFO ] 2026-06-01 20:36:37.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:36:37.990 [10681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:36:41.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 20:36:41.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427010,ok=427010,error=0, records=41
[INFO ] 2026-06-01 20:36:42.547 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21361/300s
[INFO ] 2026-06-01 20:36:52.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:36:52.995 [10751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:36:56.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 20:36:56.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427011,ok=427011,error=0, records=41
[INFO ] 2026-06-01 20:37:07.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:37:07.401 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21364/300s
[WARN ] 2026-06-01 20:37:08.000 [10658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:37:11.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 20:37:11.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427012,ok=427012,error=0, records=41
[INFO ] 2026-06-01 20:37:22.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:37:23.004 [10779] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:37:26.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 20:37:26.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427013,ok=427013,error=0, records=41
[INFO ] 2026-06-01 20:37:30.198 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20867012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:37:30.368 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:37:30.368 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 20:37:30.368 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:37:30.368 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:37:30.368 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:37:30.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:37:30.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:37:30.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:37:30.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:37:30.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:37:30.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:37:37.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:37:38.009 [10681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:37:41.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 20:37:41.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427014,ok=427014,error=0, records=41
[INFO ] 2026-06-01 20:37:48.908 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21362/300s
[INFO ] 2026-06-01 20:37:50.809 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21362/300s
[INFO ] 2026-06-01 20:37:52.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:37:53.013 [10779] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:37:56.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 20:37:56.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427015,ok=427015,error=0, records=41
[INFO ] 2026-06-01 20:37:58.114 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21362/300s
[INFO ] 2026-06-01 20:38:07.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:38:08.019 [10793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:38:11.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 20:38:11.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427016,ok=427016,error=0, records=41
[INFO ] 2026-06-01 20:38:22.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:38:23.024 [10681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:38:26.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 20:38:26.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427017,ok=427017,error=0, records=41
[INFO ] 2026-06-01 20:38:37.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:38:38.030 [10779] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:38:41.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 20:38:41.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427018,ok=427018,error=0, records=41
[INFO ] 2026-06-01 20:38:52.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:38:52.405 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 20:38:53.036 [10837] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:38:56.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 20:38:56.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427019,ok=427019,error=0, records=41
[INFO ] 2026-06-01 20:39:07.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:39:08.041 [10909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:39:11.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 20:39:11.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427020,ok=427020,error=0, records=41
[INFO ] 2026-06-01 20:39:22.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:39:23.048 [10851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:39:26.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:39:26.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427021,ok=427021,error=0, records=41
[INFO ] 2026-06-01 20:39:37.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:39:37.554 [10937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:39:41.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:39:41.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427022,ok=427022,error=0, records=41
[INFO ] 2026-06-01 20:39:52.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:39:52.560 [10962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:39:56.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 20:39:56.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427023,ok=427023,error=0, records=41
[INFO ] 2026-06-01 20:40:01.393 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21366/300s
[INFO ] 2026-06-01 20:40:07.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:40:07.565 [10950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:40:11.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 20:40:11.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427024,ok=427024,error=0, records=41
[INFO ] 2026-06-01 20:40:14.066 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21357/300s
[INFO ] 2026-06-01 20:40:22.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:40:22.570 [10996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:40:26.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 20:40:26.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427025,ok=427025,error=0, records=41
[INFO ] 2026-06-01 20:40:30.368 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17788/300s
[INFO ] 2026-06-01 20:40:30.370 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866928},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:40:30.572 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:40:30.572 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 20:40:30.572 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:40:30.572 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:40:30.572 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:40:30.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:40:30.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:40:30.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:40:30.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:40:30.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:40:30.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:40:37.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:40:37.576 [10984] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:40:41.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 20:40:41.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427026,ok=427026,error=0, records=41
[INFO ] 2026-06-01 20:40:41.341 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21353/300s
[INFO ] 2026-06-01 20:40:42.121 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21366/300s
[INFO ] 2026-06-01 20:40:52.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:40:52.580 [11031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:40:56.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 20:40:56.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427027,ok=427027,error=0, records=41
[INFO ] 2026-06-01 20:41:07.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:41:07.585 [11000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:41:11.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 20:41:11.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427028,ok=427028,error=0, records=41
[INFO ] 2026-06-01 20:41:22.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:41:22.592 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:41:26.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 20:41:26.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427029,ok=427029,error=0, records=41
[INFO ] 2026-06-01 20:41:33.425 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21353/300s
[INFO ] 2026-06-01 20:41:37.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:41:37.596 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:41:41.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 20:41:41.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427030,ok=427030,error=0, records=41
[INFO ] 2026-06-01 20:41:42.598 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21362/300s
[INFO ] 2026-06-01 20:41:52.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:41:52.601 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:41:56.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 20:41:56.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427031,ok=427031,error=0, records=41
[INFO ] 2026-06-01 20:42:07.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:42:07.414 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21365/300s
[WARN ] 2026-06-01 20:42:07.605 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:42:11.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 20:42:11.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427032,ok=427032,error=0, records=41
[INFO ] 2026-06-01 20:42:22.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:42:22.611 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:42:26.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 20:42:26.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427033,ok=427033,error=0, records=41
[INFO ] 2026-06-01 20:42:37.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:42:37.615 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:42:41.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 20:42:41.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427034,ok=427034,error=0, records=41
[INFO ] 2026-06-01 20:42:48.967 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21363/300s
[INFO ] 2026-06-01 20:42:50.868 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21363/300s
[INFO ] 2026-06-01 20:42:52.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:42:52.620 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:42:56.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 20:42:56.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427035,ok=427035,error=0, records=41
[INFO ] 2026-06-01 20:42:58.173 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21363/300s
[INFO ] 2026-06-01 20:43:07.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:43:07.625 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:43:11.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 20:43:11.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427036,ok=427036,error=0, records=41
[INFO ] 2026-06-01 20:43:22.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:43:22.630 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:43:26.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 20:43:26.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427037,ok=427037,error=0, records=41
[INFO ] 2026-06-01 20:43:30.574 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:43:30.729 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:43:30.729 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 20:43:30.729 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:43:30.729 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:43:30.729 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:43:30.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:43:30.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:43:30.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:43:30.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:43:30.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:43:30.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:43:37.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 20:43:37.418 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 20:43:37.636 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:43:41.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 20:43:41.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427038,ok=427038,error=0, records=41
[INFO ] 2026-06-01 20:43:52.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:43:52.640 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:43:56.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 20:43:56.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427039,ok=427039,error=0, records=41
[INFO ] 2026-06-01 20:44:07.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:44:07.645 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:44:11.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 20:44:11.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427040,ok=427040,error=0, records=41
[INFO ] 2026-06-01 20:44:22.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:44:22.649 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:44:26.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 20:44:26.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427041,ok=427041,error=0, records=41
[INFO ] 2026-06-01 20:44:37.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:44:37.655 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:44:41.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 20:44:41.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427042,ok=427042,error=0, records=41
[INFO ] 2026-06-01 20:44:52.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:44:52.660 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:44:56.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 20:44:56.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427043,ok=427043,error=0, records=41
[INFO ] 2026-06-01 20:45:01.396 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21367/300s
[INFO ] 2026-06-01 20:45:07.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:45:07.665 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:45:11.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 20:45:11.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427044,ok=427044,error=0, records=41
[INFO ] 2026-06-01 20:45:14.167 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21358/300s
[INFO ] 2026-06-01 20:45:22.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:45:22.671 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:45:26.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-01 20:45:26.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427045,ok=427045,error=0, records=41
[INFO ] 2026-06-01 20:45:37.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:45:37.675 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:45:41.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-01 20:45:41.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427046,ok=427046,error=0, records=41
[INFO ] 2026-06-01 20:45:41.451 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21354/300s
[INFO ] 2026-06-01 20:45:42.127 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21367/300s
[INFO ] 2026-06-01 20:45:52.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:45:52.679 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:45:56.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 20:45:56.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427047,ok=427047,error=0, records=41
[INFO ] 2026-06-01 20:46:07.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:46:07.684 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:46:11.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 20:46:11.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427048,ok=427048,error=0, records=41
[INFO ] 2026-06-01 20:46:22.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:46:22.690 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:46:26.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 20:46:26.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427049,ok=427049,error=0, records=41
[INFO ] 2026-06-01 20:46:30.729 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17789/300s
[INFO ] 2026-06-01 20:46:30.731 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866776},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:46:30.891 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:46:30.891 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 20:46:30.892 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:46:30.892 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:46:30.892 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:46:30.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:46:30.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:46:30.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:46:30.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:46:30.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:46:30.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:46:33.605 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21354/300s
[INFO ] 2026-06-01 20:46:37.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:46:37.695 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:46:41.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 20:46:41.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427050,ok=427050,error=0, records=41
[INFO ] 2026-06-01 20:46:42.656 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21363/300s
[INFO ] 2026-06-01 20:46:52.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:46:52.700 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:46:56.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:46:56.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427051,ok=427051,error=0, records=41
[INFO ] 2026-06-01 20:47:07.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:47:07.427 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21366/300s
[WARN ] 2026-06-01 20:47:07.706 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:47:11.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 20:47:11.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427052,ok=427052,error=0, records=41
[INFO ] 2026-06-01 20:47:22.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:47:22.712 [11086] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:47:26.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10144, records=41
[INFO ] 2026-06-01 20:47:26.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427053,ok=427053,error=0, records=41
[INFO ] 2026-06-01 20:47:37.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:47:37.716 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:47:41.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 20:47:41.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427054,ok=427054,error=0, records=41
[INFO ] 2026-06-01 20:47:49.022 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21364/300s
[INFO ] 2026-06-01 20:47:50.924 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21364/300s
[INFO ] 2026-06-01 20:47:52.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:47:52.720 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:47:56.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 20:47:56.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427055,ok=427055,error=0, records=41
[INFO ] 2026-06-01 20:47:58.230 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21364/300s
[INFO ] 2026-06-01 20:48:07.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:48:07.726 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:48:11.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 20:48:11.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427056,ok=427056,error=0, records=41
[INFO ] 2026-06-01 20:48:22.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:48:22.732 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:48:26.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 20:48:26.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427057,ok=427057,error=0, records=41
[INFO ] 2026-06-01 20:48:37.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:48:37.736 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:48:41.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 20:48:41.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427058,ok=427058,error=0, records=41
[INFO ] 2026-06-01 20:48:52.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:48:52.743 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:48:56.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:48:56.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427059,ok=427059,error=0, records=41
[INFO ] 2026-06-01 20:49:07.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:49:07.749 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:49:11.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 20:49:11.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427060,ok=427060,error=0, records=41
[INFO ] 2026-06-01 20:49:22.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:49:22.755 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:49:26.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 20:49:26.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427061,ok=427061,error=0, records=41
[INFO ] 2026-06-01 20:49:30.894 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866680},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:49:31.062 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:49:31.062 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 20:49:31.062 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:49:31.062 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:49:31.062 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:49:31.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:49:31.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:49:31.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:49:31.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:49:31.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:49:31.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:49:37.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:49:37.761 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:49:41.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 20:49:41.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427062,ok=427062,error=0, records=41
[INFO ] 2026-06-01 20:49:52.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:49:52.766 [11065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:49:56.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 20:49:56.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427063,ok=427063,error=0, records=41
[INFO ] 2026-06-01 20:50:01.400 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21368/300s
[INFO ] 2026-06-01 20:50:07.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:50:07.770 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:50:11.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 20:50:11.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427064,ok=427064,error=0, records=41
[INFO ] 2026-06-01 20:50:14.272 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21359/300s
[INFO ] 2026-06-01 20:50:22.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:50:22.775 [11086] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:50:26.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 20:50:26.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427065,ok=427065,error=0, records=41
[INFO ] 2026-06-01 20:50:37.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:50:37.780 [11086] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:50:41.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 20:50:41.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427066,ok=427066,error=0, records=41
[INFO ] 2026-06-01 20:50:41.727 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21355/300s
[INFO ] 2026-06-01 20:50:42.134 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21368/300s
[INFO ] 2026-06-01 20:50:52.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:50:52.783 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:50:56.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 20:50:56.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427067,ok=427067,error=0, records=41
[INFO ] 2026-06-01 20:51:07.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:51:07.788 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:51:11.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 20:51:11.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427068,ok=427068,error=0, records=41
[INFO ] 2026-06-01 20:51:22.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:51:22.792 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:51:26.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:51:26.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427069,ok=427069,error=0, records=41
[INFO ] 2026-06-01 20:51:33.785 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21355/300s
[INFO ] 2026-06-01 20:51:37.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:51:37.797 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:51:41.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 20:51:41.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427070,ok=427070,error=0, records=41
[INFO ] 2026-06-01 20:51:42.709 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21364/300s
[INFO ] 2026-06-01 20:51:52.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:51:52.802 [11054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:51:56.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 20:51:56.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427071,ok=427071,error=0, records=41
[INFO ] 2026-06-01 20:52:07.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:52:07.439 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21367/300s
[WARN ] 2026-06-01 20:52:07.806 [11067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:52:11.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:52:11.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427072,ok=427072,error=0, records=41
[INFO ] 2026-06-01 20:52:22.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:52:22.812 [11071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:52:26.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 20:52:26.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427073,ok=427073,error=0, records=41
[INFO ] 2026-06-01 20:52:31.062 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17790/300s
[INFO ] 2026-06-01 20:52:31.064 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866608},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:52:31.252 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:52:31.252 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 20:52:31.252 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:52:31.252 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:52:31.252 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:52:31.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:52:31.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:52:31.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:52:31.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:52:31.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:52:31.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:52:37.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:52:37.816 [11640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:52:41.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 20:52:41.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427074,ok=427074,error=0, records=41
[INFO ] 2026-06-01 20:52:49.088 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21365/300s
[INFO ] 2026-06-01 20:52:50.990 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21365/300s
[INFO ] 2026-06-01 20:52:52.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:52:52.821 [11660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:52:56.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 20:52:56.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427075,ok=427075,error=0, records=41
[INFO ] 2026-06-01 20:52:58.296 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21365/300s
[INFO ] 2026-06-01 20:53:07.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:53:07.827 [11704] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:53:11.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 20:53:11.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427076,ok=427076,error=0, records=41
[INFO ] 2026-06-01 20:53:22.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:53:22.833 [11660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:53:26.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 20:53:26.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427077,ok=427077,error=0, records=41
[INFO ] 2026-06-01 20:53:37.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 20:53:37.442 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 20:53:37.837 [11731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:53:41.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 20:53:41.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427078,ok=427078,error=0, records=41
[INFO ] 2026-06-01 20:53:52.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:53:52.443 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 20:53:52.843 [11645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:53:56.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-01 20:53:56.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427079,ok=427079,error=0, records=41
[INFO ] 2026-06-01 20:54:07.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:54:07.848 [11645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:54:11.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 20:54:11.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427080,ok=427080,error=0, records=41
[INFO ] 2026-06-01 20:54:22.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:54:22.854 [11769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:54:26.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 20:54:26.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427081,ok=427081,error=0, records=41
[INFO ] 2026-06-01 20:54:37.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:54:37.860 [11783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:54:41.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 20:54:41.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427082,ok=427082,error=0, records=41
[INFO ] 2026-06-01 20:54:52.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:54:52.864 [11769] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:54:56.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 20:54:56.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427083,ok=427083,error=0, records=41
[INFO ] 2026-06-01 20:55:01.403 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21369/300s
[INFO ] 2026-06-01 20:55:07.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:55:07.870 [11811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:55:11.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 20:55:11.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427084,ok=427084,error=0, records=41
[INFO ] 2026-06-01 20:55:14.372 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21360/300s
[INFO ] 2026-06-01 20:55:22.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:55:22.875 [11830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:55:26.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-01 20:55:26.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427085,ok=427085,error=0, records=41
[INFO ] 2026-06-01 20:55:31.254 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866524},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:55:31.418 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:55:31.418 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 20:55:31.419 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:55:31.419 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:55:31.419 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:55:31.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:55:31.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:55:31.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:55:31.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:55:31.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:55:31.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:55:37.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:55:37.881 [11797] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:55:41.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 20:55:41.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427086,ok=427086,error=0, records=41
[INFO ] 2026-06-01 20:55:41.894 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21356/300s
[INFO ] 2026-06-01 20:55:42.140 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21369/300s
[INFO ] 2026-06-01 20:55:52.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:55:52.887 [11843] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:55:56.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 20:55:56.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427087,ok=427087,error=0, records=41
[INFO ] 2026-06-01 20:56:07.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:56:07.892 [11825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:56:11.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 20:56:11.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427088,ok=427088,error=0, records=41
[INFO ] 2026-06-01 20:56:22.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:56:22.897 [11897] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:56:26.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-01 20:56:26.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427089,ok=427089,error=0, records=41
[INFO ] 2026-06-01 20:56:33.964 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21356/300s
[INFO ] 2026-06-01 20:56:37.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:56:37.902 [11825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:56:41.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-01 20:56:41.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427090,ok=427090,error=0, records=41
[INFO ] 2026-06-01 20:56:42.766 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21365/300s
[INFO ] 2026-06-01 20:56:52.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:56:52.907 [11908] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:56:56.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10143, records=41
[INFO ] 2026-06-01 20:56:56.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427091,ok=427091,error=0, records=41
[INFO ] 2026-06-01 20:57:07.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 20:57:07.451 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21368/300s
[WARN ] 2026-06-01 20:57:07.911 [11903] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:57:11.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 20:57:11.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427092,ok=427092,error=0, records=41
[INFO ] 2026-06-01 20:57:22.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:57:22.916 [11960] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:57:26.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 20:57:26.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427093,ok=427093,error=0, records=41
[INFO ] 2026-06-01 20:57:37.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:57:37.921 [11970] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:57:41.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 20:57:41.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427094,ok=427094,error=0, records=41
[INFO ] 2026-06-01 20:57:49.149 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21366/300s
[INFO ] 2026-06-01 20:57:51.051 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21366/300s
[INFO ] 2026-06-01 20:57:52.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:57:52.926 [11991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:57:56.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 20:57:56.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427095,ok=427095,error=0, records=41
[INFO ] 2026-06-01 20:57:58.357 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21366/300s
[INFO ] 2026-06-01 20:58:07.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:58:07.932 [12002] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:58:11.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 20:58:11.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427096,ok=427096,error=0, records=41
[INFO ] 2026-06-01 20:58:22.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:58:22.938 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:58:26.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 20:58:26.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427097,ok=427097,error=0, records=41
[INFO ] 2026-06-01 20:58:31.419 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17791/300s
[INFO ] 2026-06-01 20:58:31.420 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866452},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 20:58:31.583 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 20:58:31.583 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 20:58:31.584 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 20:58:31.584 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 20:58:31.584 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 20:58:31.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:58:31.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 20:58:31.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:58:31.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 20:58:31.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:58:31.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 20:58:37.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:58:37.943 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:58:41.994 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 20:58:41.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427098,ok=427098,error=0, records=41
[INFO ] 2026-06-01 20:58:52.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:58:52.949 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:58:57.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 20:58:57.001 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427099,ok=427099,error=0, records=41
[INFO ] 2026-06-01 20:59:07.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:59:07.955 [12028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:59:12.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 20:59:12.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427100,ok=427100,error=0, records=41
[INFO ] 2026-06-01 20:59:22.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:59:22.960 [12028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:59:27.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 20:59:27.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427101,ok=427101,error=0, records=41
[INFO ] 2026-06-01 20:59:37.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:59:37.966 [12069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:59:42.021 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 20:59:42.021 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427102,ok=427102,error=0, records=41
[INFO ] 2026-06-01 20:59:52.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 20:59:52.973 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 20:59:57.026 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 20:59:57.026 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427103,ok=427103,error=0, records=41
[INFO ] 2026-06-01 21:00:01.407 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21370/300s
[INFO ] 2026-06-01 21:00:07.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:00:07.977 [12129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:00:12.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 21:00:12.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427104,ok=427104,error=0, records=41
[INFO ] 2026-06-01 21:00:14.480 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21361/300s
[INFO ] 2026-06-01 21:00:22.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:00:22.983 [12028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:00:27.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:00:27.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427105,ok=427105,error=0, records=41
[INFO ] 2026-06-01 21:00:37.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:00:37.988 [12157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:00:42.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 21:00:42.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427106,ok=427106,error=0, records=41
[INFO ] 2026-06-01 21:00:42.046 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21357/300s
[INFO ] 2026-06-01 21:00:42.147 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21370/300s
[INFO ] 2026-06-01 21:00:52.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:00:52.993 [12157] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:00:57.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 21:00:57.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427107,ok=427107,error=0, records=41
[INFO ] 2026-06-01 21:01:07.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:01:07.998 [12196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:01:12.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-01 21:01:12.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427108,ok=427108,error=0, records=41
[INFO ] 2026-06-01 21:01:22.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:01:23.004 [12129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:01:27.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-01 21:01:27.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427109,ok=427109,error=0, records=41
[INFO ] 2026-06-01 21:01:31.585 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866360},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:01:31.746 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:01:31.746 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 21:01:31.747 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:01:31.747 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:01:31.747 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:01:31.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:01:31.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:01:31.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:01:31.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:01:31.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:01:31.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:01:34.149 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21357/300s
[INFO ] 2026-06-01 21:01:37.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:01:38.009 [12129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:01:42.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-01 21:01:42.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427110,ok=427110,error=0, records=41
[INFO ] 2026-06-01 21:01:42.825 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21366/300s
[INFO ] 2026-06-01 21:01:52.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:01:53.014 [12028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:01:57.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-01 21:01:57.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427111,ok=427111,error=0, records=41
[INFO ] 2026-06-01 21:02:07.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:02:07.464 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21369/300s
[WARN ] 2026-06-01 21:02:08.020 [12028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:02:12.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 21:02:12.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427112,ok=427112,error=0, records=41
[INFO ] 2026-06-01 21:02:22.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:02:23.024 [12266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:02:27.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 21:02:27.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427113,ok=427113,error=0, records=41
[INFO ] 2026-06-01 21:02:37.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:02:38.029 [12028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:02:42.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 21:02:42.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427114,ok=427114,error=0, records=41
[INFO ] 2026-06-01 21:02:49.227 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21367/300s
[INFO ] 2026-06-01 21:02:51.129 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21367/300s
[INFO ] 2026-06-01 21:02:52.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:02:53.033 [12129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:02:57.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:02:57.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427115,ok=427115,error=0, records=41
[INFO ] 2026-06-01 21:02:58.436 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21367/300s
[INFO ] 2026-06-01 21:03:07.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:03:08.037 [12280] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:03:12.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 21:03:12.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427116,ok=427116,error=0, records=41
[INFO ] 2026-06-01 21:03:22.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:03:23.043 [12304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:03:27.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 21:03:27.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427117,ok=427117,error=0, records=41
[INFO ] 2026-06-01 21:03:37.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 21:03:37.468 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 21:03:38.048 [12325] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:03:42.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 21:03:42.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427118,ok=427118,error=0, records=41
[INFO ] 2026-06-01 21:03:52.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:03:53.054 [12326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:03:57.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 21:03:57.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427119,ok=427119,error=0, records=41
[INFO ] 2026-06-01 21:04:07.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:04:07.559 [12381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:04:12.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 21:04:12.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427120,ok=427120,error=0, records=41
[INFO ] 2026-06-01 21:04:22.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:04:22.564 [12381] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:04:27.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:04:27.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427121,ok=427121,error=0, records=41
[INFO ] 2026-06-01 21:04:31.747 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17792/300s
[INFO ] 2026-06-01 21:04:31.748 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866276},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:04:31.927 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:04:31.927 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 21:04:31.927 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:04:31.927 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:04:31.927 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:04:31.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:04:31.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:04:31.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:04:31.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:04:31.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:04:31.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:04:37.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:04:37.571 [12399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:04:42.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 21:04:42.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427122,ok=427122,error=0, records=41
[INFO ] 2026-06-01 21:04:52.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:04:52.575 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:04:57.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 21:04:57.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427123,ok=427123,error=0, records=41
[INFO ] 2026-06-01 21:05:01.410 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21371/300s
[INFO ] 2026-06-01 21:05:07.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:05:07.579 [12451] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:05:12.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 21:05:12.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427124,ok=427124,error=0, records=41
[INFO ] 2026-06-01 21:05:14.581 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21362/300s
[INFO ] 2026-06-01 21:05:22.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:05:22.584 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:05:27.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 21:05:27.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427125,ok=427125,error=0, records=41
[INFO ] 2026-06-01 21:05:37.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:05:37.588 [12463] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:05:42.153 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21371/300s
[INFO ] 2026-06-01 21:05:42.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:05:42.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427126,ok=427126,error=0, records=41
[INFO ] 2026-06-01 21:05:42.167 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21358/300s
[INFO ] 2026-06-01 21:05:52.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:05:52.593 [12496] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:05:57.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 21:05:57.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427127,ok=427127,error=0, records=41
[INFO ] 2026-06-01 21:06:07.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:06:07.598 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:06:12.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-01 21:06:12.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427128,ok=427128,error=0, records=41
[INFO ] 2026-06-01 21:06:22.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:06:22.602 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:06:27.190 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 21:06:27.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427129,ok=427129,error=0, records=41
[INFO ] 2026-06-01 21:06:34.334 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21358/300s
[INFO ] 2026-06-01 21:06:37.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:06:37.606 [12479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:06:42.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 21:06:42.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427130,ok=427130,error=0, records=41
[INFO ] 2026-06-01 21:06:42.909 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21367/300s
[INFO ] 2026-06-01 21:06:52.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:06:52.612 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:06:57.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 21:06:57.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427131,ok=427131,error=0, records=41
[INFO ] 2026-06-01 21:07:07.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:07:07.476 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21370/300s
[WARN ] 2026-06-01 21:07:07.617 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:07:12.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 21:07:12.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427132,ok=427132,error=0, records=41
[INFO ] 2026-06-01 21:07:22.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:07:22.623 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:07:27.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 21:07:27.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427133,ok=427133,error=0, records=41
[INFO ] 2026-06-01 21:07:31.929 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866200},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:07:32.088 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:07:32.088 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 21:07:32.088 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:07:32.088 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:07:32.088 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:07:32.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:07:32.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:07:32.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:07:32.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:07:32.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:07:32.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:07:37.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:07:37.628 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:07:42.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:07:42.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427134,ok=427134,error=0, records=41
[INFO ] 2026-06-01 21:07:49.305 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21368/300s
[INFO ] 2026-06-01 21:07:51.207 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21368/300s
[INFO ] 2026-06-01 21:07:52.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:07:52.634 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:07:57.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 21:07:57.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427135,ok=427135,error=0, records=41
[INFO ] 2026-06-01 21:07:58.478 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21368/300s
[INFO ] 2026-06-01 21:08:07.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:08:07.641 [12511] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:08:12.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 21:08:12.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427136,ok=427136,error=0, records=41
[INFO ] 2026-06-01 21:08:22.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:08:22.645 [12511] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:08:27.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 21:08:27.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427137,ok=427137,error=0, records=41
[INFO ] 2026-06-01 21:08:37.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:08:37.651 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:08:42.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 21:08:42.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427138,ok=427138,error=0, records=41
[INFO ] 2026-06-01 21:08:52.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:08:52.481 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 21:08:52.656 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:08:57.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:08:57.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427139,ok=427139,error=0, records=41
[INFO ] 2026-06-01 21:09:07.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:09:07.661 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:09:12.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 21:09:12.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427140,ok=427140,error=0, records=41
[INFO ] 2026-06-01 21:09:22.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:09:22.665 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:09:27.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 21:09:27.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427141,ok=427141,error=0, records=41
[INFO ] 2026-06-01 21:09:37.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:09:37.669 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:09:42.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:09:42.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427142,ok=427142,error=0, records=41
[INFO ] 2026-06-01 21:09:52.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:09:52.673 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:09:57.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 21:09:57.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427143,ok=427143,error=0, records=41
[INFO ] 2026-06-01 21:10:01.414 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21372/300s
[INFO ] 2026-06-01 21:10:07.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:10:07.678 [12479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:10:12.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-01 21:10:12.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427144,ok=427144,error=0, records=41
[INFO ] 2026-06-01 21:10:14.680 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21363/300s
[INFO ] 2026-06-01 21:10:22.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:10:22.683 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:10:27.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 21:10:27.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427145,ok=427145,error=0, records=41
[INFO ] 2026-06-01 21:10:32.089 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17793/300s
[INFO ] 2026-06-01 21:10:32.090 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:10:32.259 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:10:32.259 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:10:32.259 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:10:32.259 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:10:32.259 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:10:32.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:10:32.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:10:32.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:10:32.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:10:32.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:10:32.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:10:37.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:10:37.690 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:10:42.160 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21372/300s
[INFO ] 2026-06-01 21:10:42.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 21:10:42.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427146,ok=427146,error=0, records=41
[INFO ] 2026-06-01 21:10:42.320 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21359/300s
[INFO ] 2026-06-01 21:10:52.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:10:52.694 [12479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:10:57.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 21:10:57.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427147,ok=427147,error=0, records=41
[INFO ] 2026-06-01 21:11:07.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:11:07.698 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:11:12.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 21:11:12.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427148,ok=427148,error=0, records=41
[INFO ] 2026-06-01 21:11:22.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:11:22.704 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:11:27.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 21:11:27.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427149,ok=427149,error=0, records=41
[INFO ] 2026-06-01 21:11:34.515 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21359/300s
[INFO ] 2026-06-01 21:11:37.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:11:37.709 [12479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:11:42.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 21:11:42.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427150,ok=427150,error=0, records=41
[INFO ] 2026-06-01 21:11:42.965 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21368/300s
[INFO ] 2026-06-01 21:11:52.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:11:52.714 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:11:57.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 21:11:57.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427151,ok=427151,error=0, records=41
[INFO ] 2026-06-01 21:12:07.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:12:07.490 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21371/300s
[WARN ] 2026-06-01 21:12:07.721 [12479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:12:12.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:12:12.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427152,ok=427152,error=0, records=41
[INFO ] 2026-06-01 21:12:22.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:12:22.727 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:12:27.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-01 21:12:27.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427153,ok=427153,error=0, records=41
[INFO ] 2026-06-01 21:12:37.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:12:37.732 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:12:42.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 21:12:42.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427154,ok=427154,error=0, records=41
[INFO ] 2026-06-01 21:12:49.393 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21369/300s
[INFO ] 2026-06-01 21:12:51.295 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21369/300s
[INFO ] 2026-06-01 21:12:52.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:12:52.737 [12501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:12:57.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 21:12:57.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427155,ok=427155,error=0, records=41
[INFO ] 2026-06-01 21:12:58.534 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21369/300s
[INFO ] 2026-06-01 21:13:07.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:13:07.741 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:13:12.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:13:12.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427156,ok=427156,error=0, records=41
[INFO ] 2026-06-01 21:13:22.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:13:22.746 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:13:27.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 21:13:27.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427157,ok=427157,error=0, records=41
[INFO ] 2026-06-01 21:13:32.260 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20866044},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:13:32.419 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:13:32.419 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 21:13:32.419 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:13:32.419 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:13:32.419 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:13:32.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:13:32.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:13:32.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:13:32.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:13:32.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:13:32.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:13:37.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 21:13:37.493 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 21:13:37.752 [12511] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:13:42.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:13:42.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427158,ok=427158,error=0, records=41
[INFO ] 2026-06-01 21:13:52.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:13:52.757 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:13:57.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 21:13:57.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427159,ok=427159,error=0, records=41
[INFO ] 2026-06-01 21:14:07.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:14:07.762 [12511] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:14:12.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 21:14:12.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427160,ok=427160,error=0, records=41
[INFO ] 2026-06-01 21:14:22.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:14:22.767 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:14:27.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 21:14:27.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427161,ok=427161,error=0, records=41
[INFO ] 2026-06-01 21:14:37.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:14:37.773 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:14:42.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 21:14:42.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427162,ok=427162,error=0, records=41
[INFO ] 2026-06-01 21:14:52.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:14:52.779 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:14:57.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 21:14:57.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427163,ok=427163,error=0, records=41
[INFO ] 2026-06-01 21:15:01.418 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21373/300s
[INFO ] 2026-06-01 21:15:07.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:15:07.784 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:15:12.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 21:15:12.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427164,ok=427164,error=0, records=41
[INFO ] 2026-06-01 21:15:14.787 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21364/300s
[INFO ] 2026-06-01 21:15:22.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:15:22.790 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:15:27.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:15:27.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427165,ok=427165,error=0, records=41
[INFO ] 2026-06-01 21:15:37.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:15:37.794 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:15:42.166 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21373/300s
[INFO ] 2026-06-01 21:15:42.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:15:42.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427166,ok=427166,error=0, records=41
[INFO ] 2026-06-01 21:15:42.478 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21360/300s
[INFO ] 2026-06-01 21:15:52.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:15:52.800 [12511] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:15:57.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 21:15:57.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427167,ok=427167,error=0, records=41
[INFO ] 2026-06-01 21:16:07.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:16:07.805 [12486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:16:12.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 21:16:12.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427168,ok=427168,error=0, records=41
[INFO ] 2026-06-01 21:16:22.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:16:22.811 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:16:27.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 21:16:27.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427169,ok=427169,error=0, records=41
[INFO ] 2026-06-01 21:16:32.419 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17794/300s
[INFO ] 2026-06-01 21:16:32.421 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865964},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:16:32.592 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:16:32.592 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 21:16:32.592 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:16:32.592 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:16:32.592 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:16:32.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:16:32.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:16:32.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:16:32.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:16:32.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:16:32.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:16:34.696 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21360/300s
[INFO ] 2026-06-01 21:16:37.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:16:37.815 [13099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:16:42.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 21:16:42.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427170,ok=427170,error=0, records=41
[INFO ] 2026-06-01 21:16:43.017 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21369/300s
[INFO ] 2026-06-01 21:16:52.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:16:52.820 [13088] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:16:57.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 21:16:57.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427171,ok=427171,error=0, records=41
[INFO ] 2026-06-01 21:17:07.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:17:07.502 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21372/300s
[WARN ] 2026-06-01 21:17:07.826 [13099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:17:12.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 21:17:12.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427172,ok=427172,error=0, records=41
[INFO ] 2026-06-01 21:17:22.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:17:22.830 [13088] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:17:27.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-01 21:17:27.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427173,ok=427173,error=0, records=41
[INFO ] 2026-06-01 21:17:37.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:17:37.835 [13088] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:17:42.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 21:17:42.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427174,ok=427174,error=0, records=41
[INFO ] 2026-06-01 21:17:49.447 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21370/300s
[INFO ] 2026-06-01 21:17:51.348 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21370/300s
[INFO ] 2026-06-01 21:17:52.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:17:52.840 [13099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:17:57.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 21:17:57.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427175,ok=427175,error=0, records=41
[INFO ] 2026-06-01 21:17:58.560 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21370/300s
[INFO ] 2026-06-01 21:18:07.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:18:07.845 [13183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:18:12.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 21:18:12.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427176,ok=427176,error=0, records=41
[INFO ] 2026-06-01 21:18:22.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:18:22.850 [13183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:18:27.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 21:18:27.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427177,ok=427177,error=0, records=41
[INFO ] 2026-06-01 21:18:37.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:18:37.854 [13211] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:18:42.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 21:18:42.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427178,ok=427178,error=0, records=41
[INFO ] 2026-06-01 21:18:52.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:18:52.860 [13146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:18:57.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 21:18:57.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427179,ok=427179,error=0, records=41
[INFO ] 2026-06-01 21:19:07.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:19:07.865 [13118] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:19:12.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 21:19:12.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427180,ok=427180,error=0, records=41
[INFO ] 2026-06-01 21:19:22.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:19:22.871 [13146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:19:27.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 21:19:27.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427181,ok=427181,error=0, records=41
[INFO ] 2026-06-01 21:19:32.594 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865880},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:19:32.754 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:19:32.754 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:19:32.754 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:19:32.754 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:19:32.754 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:19:32.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:19:32.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:19:32.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:19:32.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:19:32.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:19:32.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:19:37.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:19:37.875 [13272] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:19:42.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 21:19:42.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427182,ok=427182,error=0, records=41
[INFO ] 2026-06-01 21:19:52.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:19:52.880 [13288] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:19:57.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 21:19:57.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427183,ok=427183,error=0, records=41
[INFO ] 2026-06-01 21:20:01.422 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21374/300s
[INFO ] 2026-06-01 21:20:07.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:20:07.885 [13288] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:20:12.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 21:20:12.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427184,ok=427184,error=0, records=41
[INFO ] 2026-06-01 21:20:14.887 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21365/300s
[INFO ] 2026-06-01 21:20:22.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:20:22.890 [13304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:20:27.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 21:20:27.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427185,ok=427185,error=0, records=41
[INFO ] 2026-06-01 21:20:37.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:20:37.894 [13338] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:20:42.172 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21374/300s
[INFO ] 2026-06-01 21:20:42.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 21:20:42.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427186,ok=427186,error=0, records=41
[INFO ] 2026-06-01 21:20:42.628 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21361/300s
[INFO ] 2026-06-01 21:20:52.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:20:52.900 [13359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:20:57.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 21:20:57.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427187,ok=427187,error=0, records=41
[INFO ] 2026-06-01 21:21:07.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:21:07.906 [13347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:21:12.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-01 21:21:12.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427188,ok=427188,error=0, records=41
[INFO ] 2026-06-01 21:21:22.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:21:22.914 [13364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:21:27.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 21:21:27.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427189,ok=427189,error=0, records=41
[INFO ] 2026-06-01 21:21:34.878 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21361/300s
[INFO ] 2026-06-01 21:21:37.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:21:37.920 [13386] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:21:42.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 21:21:42.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427190,ok=427190,error=0, records=41
[INFO ] 2026-06-01 21:21:43.071 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21370/300s
[INFO ] 2026-06-01 21:21:52.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:21:52.926 [13425] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:21:57.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-01 21:21:57.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427191,ok=427191,error=0, records=41
[INFO ] 2026-06-01 21:22:07.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:22:07.514 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21373/300s
[WARN ] 2026-06-01 21:22:07.933 [13419] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:22:12.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 21:22:12.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427192,ok=427192,error=0, records=41
[INFO ] 2026-06-01 21:22:22.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:22:22.940 [13449] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:22:27.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 21:22:27.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427193,ok=427193,error=0, records=41
[WARN ] 2026-06-01 21:22:32.445 [13438] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9855/stat), No such file or directory
[INFO ] 2026-06-01 21:22:32.754 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17795/300s
[INFO ] 2026-06-01 21:22:32.755 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865792},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:22:32.918 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:22:32.918 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:22:32.918 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:22:32.919 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:22:32.919 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:22:32.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:22:32.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:22:32.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:22:32.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:22:32.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:22:32.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:22:37.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:22:37.946 [13467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:22:42.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-01 21:22:42.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427194,ok=427194,error=0, records=41
[WARN ] 2026-06-01 21:22:47.451 [13467] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9855/stat), No such file or directory
[INFO ] 2026-06-01 21:22:49.496 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21371/300s
[INFO ] 2026-06-01 21:22:51.397 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21371/300s
[INFO ] 2026-06-01 21:22:52.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:22:52.951 [13467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:22:57.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 21:22:57.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427195,ok=427195,error=0, records=41
[INFO ] 2026-06-01 21:22:58.601 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21371/300s
[INFO ] 2026-06-01 21:23:07.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:23:07.956 [13483] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:23:12.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 21:23:12.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427196,ok=427196,error=0, records=41
[INFO ] 2026-06-01 21:23:22.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:23:22.960 [13489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:23:27.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 21:23:27.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427197,ok=427197,error=0, records=41
[INFO ] 2026-06-01 21:23:37.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 21:23:37.517 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 21:23:37.965 [13473] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:23:42.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 21:23:42.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427198,ok=427198,error=0, records=41
[INFO ] 2026-06-01 21:23:52.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:23:52.518 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 21:23:52.970 [13483] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:23:57.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 21:23:57.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427199,ok=427199,error=0, records=41
[INFO ] 2026-06-01 21:24:07.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:24:07.975 [13489] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:24:12.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 21:24:12.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427200,ok=427200,error=0, records=41
[INFO ] 2026-06-01 21:24:22.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:24:22.980 [13544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:24:27.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 21:24:27.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427201,ok=427201,error=0, records=41
[INFO ] 2026-06-01 21:24:37.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:24:37.985 [13586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:24:42.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 21:24:42.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427202,ok=427202,error=0, records=41
[INFO ] 2026-06-01 21:24:52.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:24:52.991 [13558] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:24:57.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-01 21:24:57.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427203,ok=427203,error=0, records=41
[INFO ] 2026-06-01 21:25:01.424 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21375/300s
[INFO ] 2026-06-01 21:25:07.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:25:07.996 [13558] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:25:12.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 21:25:12.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427204,ok=427204,error=0, records=41
[INFO ] 2026-06-01 21:25:14.997 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21366/300s
[INFO ] 2026-06-01 21:25:22.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:25:23.000 [13586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:25:27.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 21:25:27.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427205,ok=427205,error=0, records=41
[INFO ] 2026-06-01 21:25:32.920 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865712},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:25:33.069 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:25:33.069 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-01 21:25:33.069 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:25:33.069 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:25:33.069 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:25:33.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:25:33.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:25:33.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:25:33.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:25:33.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:25:33.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:25:37.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:25:38.005 [13641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:25:42.178 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21375/300s
[INFO ] 2026-06-01 21:25:42.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 21:25:42.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427206,ok=427206,error=0, records=41
[INFO ] 2026-06-01 21:25:42.781 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21362/300s
[INFO ] 2026-06-01 21:25:52.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:25:53.010 [13586] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:25:57.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 21:25:57.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427207,ok=427207,error=0, records=41
[INFO ] 2026-06-01 21:26:07.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:26:08.015 [13670] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:26:12.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 21:26:12.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427208,ok=427208,error=0, records=41
[INFO ] 2026-06-01 21:26:22.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:26:23.021 [13656] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:26:27.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 21:26:27.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427209,ok=427209,error=0, records=41
[INFO ] 2026-06-01 21:26:35.052 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21362/300s
[INFO ] 2026-06-01 21:26:37.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:26:38.026 [13656] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:26:42.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:26:42.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427210,ok=427210,error=0, records=41
[INFO ] 2026-06-01 21:26:43.117 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21371/300s
[INFO ] 2026-06-01 21:26:52.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:26:53.030 [13670] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:26:57.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 21:26:57.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427211,ok=427211,error=0, records=41
[INFO ] 2026-06-01 21:27:07.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:27:07.526 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21374/300s
[WARN ] 2026-06-01 21:27:08.035 [13656] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:27:12.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 21:27:12.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427212,ok=427212,error=0, records=41
[INFO ] 2026-06-01 21:27:22.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:27:23.041 [13735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:27:27.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 21:27:27.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427213,ok=427213,error=0, records=41
[INFO ] 2026-06-01 21:27:37.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:27:38.047 [13747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:27:42.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-01 21:27:42.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427214,ok=427214,error=0, records=41
[INFO ] 2026-06-01 21:27:49.518 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21372/300s
[INFO ] 2026-06-01 21:27:51.419 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21372/300s
[INFO ] 2026-06-01 21:27:52.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:27:53.052 [13771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:27:57.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-01 21:27:57.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427215,ok=427215,error=0, records=41
[INFO ] 2026-06-01 21:27:58.626 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21372/300s
[INFO ] 2026-06-01 21:28:07.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:28:07.558 [13778] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:28:12.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 21:28:12.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427216,ok=427216,error=0, records=41
[INFO ] 2026-06-01 21:28:22.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:28:22.563 [13820] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:28:27.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 21:28:27.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427217,ok=427217,error=0, records=41
[INFO ] 2026-06-01 21:28:33.069 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17796/300s
[INFO ] 2026-06-01 21:28:33.071 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865632},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:28:33.229 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:28:33.229 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 21:28:33.229 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:28:33.229 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:28:33.229 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:28:33.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:28:33.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:28:33.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:28:33.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:28:33.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:28:33.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:28:37.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:28:37.567 [13831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:28:42.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 21:28:42.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427218,ok=427218,error=0, records=41
[INFO ] 2026-06-01 21:28:52.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:28:52.571 [13831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:28:57.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:28:57.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427219,ok=427219,error=0, records=41
[INFO ] 2026-06-01 21:29:07.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:29:07.578 [13845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:29:12.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 21:29:12.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427220,ok=427220,error=0, records=41
[INFO ] 2026-06-01 21:29:22.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:29:22.583 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:29:27.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 21:29:27.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427221,ok=427221,error=0, records=41
[INFO ] 2026-06-01 21:29:37.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:29:37.589 [13907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:29:42.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 21:29:42.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427222,ok=427222,error=0, records=41
[INFO ] 2026-06-01 21:29:52.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:29:52.595 [13907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:29:57.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:29:57.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427223,ok=427223,error=0, records=41
[INFO ] 2026-06-01 21:30:01.428 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21376/300s
[INFO ] 2026-06-01 21:30:07.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:30:07.600 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:30:12.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 21:30:12.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427224,ok=427224,error=0, records=41
[INFO ] 2026-06-01 21:30:15.103 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21367/300s
[INFO ] 2026-06-01 21:30:22.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:30:22.605 [13907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:30:27.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 21:30:27.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427225,ok=427225,error=0, records=41
[INFO ] 2026-06-01 21:30:37.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:30:37.610 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:30:42.184 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21376/300s
[INFO ] 2026-06-01 21:30:42.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 21:30:42.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427226,ok=427226,error=0, records=41
[INFO ] 2026-06-01 21:30:42.930 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21363/300s
[INFO ] 2026-06-01 21:30:52.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:30:52.616 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:30:57.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:30:57.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427227,ok=427227,error=0, records=41
[INFO ] 2026-06-01 21:31:07.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:31:07.621 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:31:12.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-01 21:31:12.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427228,ok=427228,error=0, records=41
[INFO ] 2026-06-01 21:31:22.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:31:22.626 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:31:27.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 21:31:27.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427229,ok=427229,error=0, records=41
[INFO ] 2026-06-01 21:31:33.231 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865552},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:31:33.391 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:31:33.391 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:31:33.391 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:31:33.391 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:31:33.391 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:31:33.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:31:33.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:31:33.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:31:33.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:31:33.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:31:33.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:31:35.232 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21363/300s
[INFO ] 2026-06-01 21:31:37.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:31:37.631 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:31:42.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-01 21:31:42.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427230,ok=427230,error=0, records=41
[INFO ] 2026-06-01 21:31:43.170 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21372/300s
[INFO ] 2026-06-01 21:31:52.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:31:52.635 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:31:57.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 21:31:57.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427231,ok=427231,error=0, records=41
[INFO ] 2026-06-01 21:32:07.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:32:07.538 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21375/300s
[WARN ] 2026-06-01 21:32:07.640 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:32:12.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:32:12.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427232,ok=427232,error=0, records=41
[INFO ] 2026-06-01 21:32:22.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:32:22.646 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:32:27.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 21:32:27.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427233,ok=427233,error=0, records=41
[INFO ] 2026-06-01 21:32:37.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:32:37.651 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:32:42.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:32:42.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427234,ok=427234,error=0, records=41
[INFO ] 2026-06-01 21:32:49.580 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21373/300s
[INFO ] 2026-06-01 21:32:51.482 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21373/300s
[INFO ] 2026-06-01 21:32:52.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:32:52.656 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:32:57.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:32:57.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427235,ok=427235,error=0, records=41
[INFO ] 2026-06-01 21:32:58.685 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21373/300s
[INFO ] 2026-06-01 21:33:07.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:33:07.662 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:33:12.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 21:33:12.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427236,ok=427236,error=0, records=41
[INFO ] 2026-06-01 21:33:22.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:33:22.667 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:33:28.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-01 21:33:28.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427237,ok=427237,error=0, records=41
[INFO ] 2026-06-01 21:33:37.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 21:33:37.542 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 21:33:37.672 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:33:43.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:33:43.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427238,ok=427238,error=0, records=41
[INFO ] 2026-06-01 21:33:52.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:33:52.679 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:33:58.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:33:58.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427239,ok=427239,error=0, records=41
[INFO ] 2026-06-01 21:34:07.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:34:07.686 [13907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:34:13.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 21:34:13.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427240,ok=427240,error=0, records=41
[INFO ] 2026-06-01 21:34:22.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:34:22.690 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:34:28.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:34:28.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427241,ok=427241,error=0, records=41
[INFO ] 2026-06-01 21:34:33.392 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17797/300s
[INFO ] 2026-06-01 21:34:33.393 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865464},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:34:33.570 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:34:33.570 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:34:33.570 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:34:33.570 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:34:33.570 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:34:33.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:34:33.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:34:33.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:34:33.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:34:33.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:34:33.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:34:37.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:34:37.695 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:34:43.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 21:34:43.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427242,ok=427242,error=0, records=41
[INFO ] 2026-06-01 21:34:52.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:34:52.701 [13907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:34:58.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 21:34:58.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427243,ok=427243,error=0, records=41
[INFO ] 2026-06-01 21:35:01.432 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21377/300s
[INFO ] 2026-06-01 21:35:07.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:35:07.707 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:35:13.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 21:35:13.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427244,ok=427244,error=0, records=41
[INFO ] 2026-06-01 21:35:15.208 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21368/300s
[INFO ] 2026-06-01 21:35:22.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:35:22.711 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:35:28.106 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 21:35:28.106 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427245,ok=427245,error=0, records=41
[INFO ] 2026-06-01 21:35:37.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:35:37.717 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:35:42.190 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21377/300s
[INFO ] 2026-06-01 21:35:43.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 21:35:43.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427246,ok=427246,error=0, records=41
[INFO ] 2026-06-01 21:35:43.112 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21364/300s
[INFO ] 2026-06-01 21:35:52.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:35:52.720 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:35:58.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 21:35:58.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427247,ok=427247,error=0, records=41
[INFO ] 2026-06-01 21:36:07.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:36:07.725 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:36:13.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 21:36:13.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427248,ok=427248,error=0, records=41
[INFO ] 2026-06-01 21:36:22.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:36:22.729 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:36:28.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 21:36:28.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427249,ok=427249,error=0, records=41
[INFO ] 2026-06-01 21:36:35.414 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21364/300s
[INFO ] 2026-06-01 21:36:37.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:36:37.734 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:36:43.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 21:36:43.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427250,ok=427250,error=0, records=41
[INFO ] 2026-06-01 21:36:43.229 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21373/300s
[INFO ] 2026-06-01 21:36:52.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:36:52.739 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:36:58.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 21:36:58.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427251,ok=427251,error=0, records=41
[INFO ] 2026-06-01 21:37:07.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:37:07.551 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21376/300s
[WARN ] 2026-06-01 21:37:07.744 [13884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:37:13.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 21:37:13.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427252,ok=427252,error=0, records=41
[INFO ] 2026-06-01 21:37:22.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:37:22.748 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:37:28.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:37:28.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427253,ok=427253,error=0, records=41
[INFO ] 2026-06-01 21:37:33.572 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865388},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:37:33.738 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:37:33.738 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 21:37:33.738 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:37:33.738 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:37:33.738 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:37:33.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:37:33.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:37:33.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:37:33.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:37:33.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:37:33.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:37:37.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:37:37.753 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:37:43.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 21:37:43.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427254,ok=427254,error=0, records=41
[INFO ] 2026-06-01 21:37:49.637 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21374/300s
[INFO ] 2026-06-01 21:37:51.539 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21374/300s
[INFO ] 2026-06-01 21:37:52.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:37:52.758 [13921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:37:58.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 21:37:58.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427255,ok=427255,error=0, records=41
[INFO ] 2026-06-01 21:37:58.746 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21374/300s
[INFO ] 2026-06-01 21:38:07.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:38:07.762 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:38:13.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 21:38:13.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427256,ok=427256,error=0, records=41
[INFO ] 2026-06-01 21:38:22.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:38:22.767 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:38:28.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 21:38:28.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427257,ok=427257,error=0, records=41
[INFO ] 2026-06-01 21:38:37.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:38:37.771 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:38:43.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 21:38:43.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427258,ok=427258,error=0, records=41
[INFO ] 2026-06-01 21:38:52.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:38:52.555 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 21:38:52.776 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:38:58.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-01 21:38:58.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427259,ok=427259,error=0, records=41
[INFO ] 2026-06-01 21:39:07.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:39:07.780 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:39:13.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:39:13.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427260,ok=427260,error=0, records=41
[INFO ] 2026-06-01 21:39:22.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:39:22.786 [13907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:39:28.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 21:39:28.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427261,ok=427261,error=0, records=41
[INFO ] 2026-06-01 21:39:37.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:39:37.791 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:39:43.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 21:39:43.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427262,ok=427262,error=0, records=41
[INFO ] 2026-06-01 21:39:52.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:39:52.796 [13890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:39:58.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 21:39:58.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427263,ok=427263,error=0, records=41
[INFO ] 2026-06-01 21:40:01.435 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21378/300s
[INFO ] 2026-06-01 21:40:07.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:40:07.803 [13851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:40:13.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 21:40:13.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427264,ok=427264,error=0, records=41
[INFO ] 2026-06-01 21:40:15.306 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21369/300s
[INFO ] 2026-06-01 21:40:22.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:40:22.808 [14480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:40:28.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-01 21:40:28.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427265,ok=427265,error=0, records=41
[INFO ] 2026-06-01 21:40:33.739 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17798/300s
[INFO ] 2026-06-01 21:40:33.740 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865308},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:40:33.895 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:40:33.895 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 21:40:33.895 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:40:33.895 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:40:33.895 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:40:33.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:40:33.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:40:33.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:40:33.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:40:33.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:40:33.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:40:37.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:40:37.813 [14470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:40:42.197 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21378/300s
[INFO ] 2026-06-01 21:40:43.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 21:40:43.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427266,ok=427266,error=0, records=41
[INFO ] 2026-06-01 21:40:43.315 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21365/300s
[INFO ] 2026-06-01 21:40:52.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:40:52.819 [14480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:40:58.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-01 21:40:58.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427267,ok=427267,error=0, records=41
[INFO ] 2026-06-01 21:41:07.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:41:07.824 [14495] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:41:13.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-01 21:41:13.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427268,ok=427268,error=0, records=41
[INFO ] 2026-06-01 21:41:22.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:41:22.829 [14480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:41:28.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 21:41:28.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427269,ok=427269,error=0, records=41
[INFO ] 2026-06-01 21:41:35.596 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21365/300s
[INFO ] 2026-06-01 21:41:37.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:41:37.834 [14480] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:41:43.283 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21374/300s
[INFO ] 2026-06-01 21:41:43.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 21:41:43.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427270,ok=427270,error=0, records=41
[INFO ] 2026-06-01 21:41:52.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:41:52.838 [14516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:41:58.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 21:41:58.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427271,ok=427271,error=0, records=41
[INFO ] 2026-06-01 21:42:07.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:42:07.563 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21377/300s
[WARN ] 2026-06-01 21:42:07.844 [14495] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:42:13.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 21:42:13.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427272,ok=427272,error=0, records=41
[INFO ] 2026-06-01 21:42:22.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:42:22.849 [14530] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:42:28.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 21:42:28.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427273,ok=427273,error=0, records=41
[INFO ] 2026-06-01 21:42:37.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:42:37.853 [14516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:42:43.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-01 21:42:43.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427274,ok=427274,error=0, records=41
[INFO ] 2026-06-01 21:42:49.687 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21375/300s
[INFO ] 2026-06-01 21:42:51.589 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21375/300s
[INFO ] 2026-06-01 21:42:52.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:42:52.858 [14608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:42:58.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-01 21:42:58.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427275,ok=427275,error=0, records=41
[INFO ] 2026-06-01 21:42:58.796 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21375/300s
[INFO ] 2026-06-01 21:43:07.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:43:07.863 [14636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:43:13.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 21:43:13.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427276,ok=427276,error=0, records=41
[INFO ] 2026-06-01 21:43:22.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:43:22.867 [14516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:43:28.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 21:43:28.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427277,ok=427277,error=0, records=41
[INFO ] 2026-06-01 21:43:33.897 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865232},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:43:34.074 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:43:34.074 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 21:43:34.074 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:43:34.074 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:43:34.074 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:43:34.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:43:34.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:43:34.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:43:34.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:43:34.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:43:34.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:43:37.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 21:43:37.567 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 21:43:37.873 [14650] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:43:43.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 21:43:43.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427278,ok=427278,error=0, records=41
[INFO ] 2026-06-01 21:43:52.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:43:52.877 [14516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:43:58.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 21:43:58.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427279,ok=427279,error=0, records=41
[INFO ] 2026-06-01 21:44:07.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:44:07.882 [14691] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:44:13.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 21:44:13.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427280,ok=427280,error=0, records=41
[INFO ] 2026-06-01 21:44:22.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:44:22.888 [14697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:44:28.499 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 21:44:28.499 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427281,ok=427281,error=0, records=41
[INFO ] 2026-06-01 21:44:37.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:44:37.892 [14729] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:44:43.508 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 21:44:43.508 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427282,ok=427282,error=0, records=41
[INFO ] 2026-06-01 21:44:52.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:44:52.897 [14729] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:44:58.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 21:44:58.513 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427283,ok=427283,error=0, records=41
[INFO ] 2026-06-01 21:45:01.439 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21379/300s
[INFO ] 2026-06-01 21:45:07.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:45:07.903 [14681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:45:13.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 21:45:13.519 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427284,ok=427284,error=0, records=41
[INFO ] 2026-06-01 21:45:15.405 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21370/300s
[INFO ] 2026-06-01 21:45:22.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:45:22.908 [14783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:45:28.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 21:45:28.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427285,ok=427285,error=0, records=41
[INFO ] 2026-06-01 21:45:37.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:45:37.913 [14778] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:45:42.203 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21379/300s
[INFO ] 2026-06-01 21:45:43.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:45:43.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427286,ok=427286,error=0, records=41
[INFO ] 2026-06-01 21:45:43.529 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21366/300s
[INFO ] 2026-06-01 21:45:52.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:45:52.918 [14783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:45:58.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 21:45:58.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427287,ok=427287,error=0, records=41
[INFO ] 2026-06-01 21:46:07.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:46:07.923 [14828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:46:13.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 21:46:13.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427288,ok=427288,error=0, records=41
[INFO ] 2026-06-01 21:46:22.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:46:22.928 [14834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:46:28.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 21:46:28.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427289,ok=427289,error=0, records=41
[INFO ] 2026-06-01 21:46:34.075 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17799/300s
[INFO ] 2026-06-01 21:46:34.076 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865160},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:46:34.306 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:46:34.306 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:46:34.306 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:46:34.306 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:46:34.306 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:46:34.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:46:34.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:46:34.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:46:34.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:46:34.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:46:34.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:46:35.779 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21366/300s
[INFO ] 2026-06-01 21:46:37.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:46:37.933 [14868] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:46:43.339 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21375/300s
[INFO ] 2026-06-01 21:46:43.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 21:46:43.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427290,ok=427290,error=0, records=41
[INFO ] 2026-06-01 21:46:52.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:46:52.939 [14834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:46:58.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 21:46:58.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427291,ok=427291,error=0, records=41
[INFO ] 2026-06-01 21:47:07.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:47:07.575 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21378/300s
[WARN ] 2026-06-01 21:47:07.944 [14897] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:47:13.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 21:47:13.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427292,ok=427292,error=0, records=41
[INFO ] 2026-06-01 21:47:22.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:47:22.949 [14907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:47:28.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 21:47:28.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427293,ok=427293,error=0, records=41
[INFO ] 2026-06-01 21:47:37.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:47:37.953 [14929] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:47:43.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 21:47:43.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427294,ok=427294,error=0, records=41
[INFO ] 2026-06-01 21:47:49.752 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21376/300s
[INFO ] 2026-06-01 21:47:51.654 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21376/300s
[INFO ] 2026-06-01 21:47:52.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:47:52.957 [14914] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:47:58.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-01 21:47:58.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427295,ok=427295,error=0, records=41
[INFO ] 2026-06-01 21:47:58.860 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21376/300s
[INFO ] 2026-06-01 21:48:07.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:48:07.963 [14914] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:48:13.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 21:48:13.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427296,ok=427296,error=0, records=41
[INFO ] 2026-06-01 21:48:22.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:48:22.968 [14929] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:48:28.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 21:48:28.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427297,ok=427297,error=0, records=41
[INFO ] 2026-06-01 21:48:37.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:48:37.973 [14822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:48:43.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 21:48:43.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427298,ok=427298,error=0, records=41
[INFO ] 2026-06-01 21:48:52.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:48:52.978 [14822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:48:58.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:48:58.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427299,ok=427299,error=0, records=41
[INFO ] 2026-06-01 21:49:07.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:49:07.985 [14907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:49:13.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 21:49:13.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427300,ok=427300,error=0, records=41
[INFO ] 2026-06-01 21:49:22.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:49:22.991 [14914] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:49:28.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 21:49:28.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427301,ok=427301,error=0, records=41
[INFO ] 2026-06-01 21:49:34.308 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865076},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:49:34.473 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:49:34.474 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 21:49:34.474 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:49:34.474 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:49:34.474 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:49:34.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:49:34.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:49:34.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:49:34.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:49:34.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:49:34.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:49:37.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:49:37.995 [14914] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:49:43.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 21:49:43.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427302,ok=427302,error=0, records=41
[INFO ] 2026-06-01 21:49:52.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:49:53.000 [15054] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:49:58.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 21:49:58.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427303,ok=427303,error=0, records=41
[INFO ] 2026-06-01 21:50:01.443 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21380/300s
[INFO ] 2026-06-01 21:50:07.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:50:08.004 [15039] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:50:13.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-01 21:50:13.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427304,ok=427304,error=0, records=41
[INFO ] 2026-06-01 21:50:15.506 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21371/300s
[INFO ] 2026-06-01 21:50:22.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:50:23.009 [15087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:50:28.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-01 21:50:28.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427305,ok=427305,error=0, records=41
[INFO ] 2026-06-01 21:50:37.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:50:38.014 [15087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:50:42.210 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21380/300s
[INFO ] 2026-06-01 21:50:43.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-01 21:50:43.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427306,ok=427306,error=0, records=41
[INFO ] 2026-06-01 21:50:43.683 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21367/300s
[INFO ] 2026-06-01 21:50:52.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:50:53.018 [15101] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:50:58.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-01 21:50:58.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427307,ok=427307,error=0, records=41
[INFO ] 2026-06-01 21:51:07.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:51:08.024 [15073] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:51:13.693 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-01 21:51:13.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427308,ok=427308,error=0, records=41
[INFO ] 2026-06-01 21:51:22.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:51:23.030 [15101] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:51:28.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-01 21:51:28.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427309,ok=427309,error=0, records=41
[INFO ] 2026-06-01 21:51:35.963 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21367/300s
[INFO ] 2026-06-01 21:51:37.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:51:38.035 [15143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:51:43.394 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21376/300s
[INFO ] 2026-06-01 21:51:43.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 21:51:43.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427310,ok=427310,error=0, records=41
[INFO ] 2026-06-01 21:51:52.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:51:53.039 [15162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:51:58.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-01 21:51:58.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427311,ok=427311,error=0, records=41
[INFO ] 2026-06-01 21:52:07.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:52:07.588 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21379/300s
[WARN ] 2026-06-01 21:52:08.045 [15143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:52:13.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 21:52:13.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427312,ok=427312,error=0, records=41
[INFO ] 2026-06-01 21:52:22.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:52:23.050 [15207] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:52:28.721 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 21:52:28.721 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427313,ok=427313,error=0, records=41
[INFO ] 2026-06-01 21:52:34.474 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17800/300s
[INFO ] 2026-06-01 21:52:34.475 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20865000},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:52:34.651 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:52:34.651 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 21:52:34.651 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:52:34.651 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:52:34.651 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:52:34.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:52:34.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:52:34.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:52:34.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:52:34.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:52:34.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 21:52:37.555 [15212] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:52:37.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:52:43.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 21:52:43.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427314,ok=427314,error=0, records=41
[INFO ] 2026-06-01 21:52:49.817 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21377/300s
[INFO ] 2026-06-01 21:52:51.718 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21377/300s
[WARN ] 2026-06-01 21:52:52.560 [15191] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:52:52.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:52:58.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 21:52:58.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427315,ok=427315,error=0, records=41
[INFO ] 2026-06-01 21:52:58.925 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21377/300s
[WARN ] 2026-06-01 21:53:07.566 [15252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:53:07.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:53:13.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 21:53:13.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427316,ok=427316,error=0, records=41
[WARN ] 2026-06-01 21:53:22.570 [15276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:53:22.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:53:28.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 21:53:28.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427317,ok=427317,error=0, records=41
[WARN ] 2026-06-01 21:53:37.575 [15281] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:53:37.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 21:53:37.592 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 21:53:43.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 21:53:43.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427318,ok=427318,error=0, records=41
[WARN ] 2026-06-01 21:53:52.581 [15305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:53:52.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:53:52.593 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 21:53:58.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:53:58.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427319,ok=427319,error=0, records=41
[WARN ] 2026-06-01 21:54:07.586 [15299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:54:07.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:54:13.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:54:13.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427320,ok=427320,error=0, records=41
[WARN ] 2026-06-01 21:54:22.591 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:54:22.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:54:28.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 21:54:28.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427321,ok=427321,error=0, records=41
[INFO ] 2026-06-01 21:54:37.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:54:37.596 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:54:43.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 21:54:43.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427322,ok=427322,error=0, records=41
[INFO ] 2026-06-01 21:54:52.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:54:52.601 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:54:58.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 21:54:58.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427323,ok=427323,error=0, records=41
[INFO ] 2026-06-01 21:55:01.446 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21381/300s
[INFO ] 2026-06-01 21:55:07.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:55:07.606 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:55:13.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 21:55:13.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427324,ok=427324,error=0, records=41
[INFO ] 2026-06-01 21:55:15.609 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21372/300s
[INFO ] 2026-06-01 21:55:22.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:55:22.611 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:55:28.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 21:55:28.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427325,ok=427325,error=0, records=41
[INFO ] 2026-06-01 21:55:34.653 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864924},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:55:34.789 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:55:34.789 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 21:55:34.789 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:55:34.789 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:55:34.789 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:55:34.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:55:34.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:55:34.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:55:34.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:55:34.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:55:34.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:55:37.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:55:37.616 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:55:42.216 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21381/300s
[INFO ] 2026-06-01 21:55:43.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 21:55:43.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427326,ok=427326,error=0, records=41
[INFO ] 2026-06-01 21:55:43.795 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21368/300s
[INFO ] 2026-06-01 21:55:52.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:55:52.620 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:55:58.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 21:55:58.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427327,ok=427327,error=0, records=41
[INFO ] 2026-06-01 21:56:07.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:56:07.625 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:56:13.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:56:13.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427328,ok=427328,error=0, records=41
[INFO ] 2026-06-01 21:56:22.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:56:22.629 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:56:28.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 21:56:28.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427329,ok=427329,error=0, records=41
[INFO ] 2026-06-01 21:56:36.143 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21368/300s
[INFO ] 2026-06-01 21:56:37.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:56:37.635 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:56:43.452 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21377/300s
[INFO ] 2026-06-01 21:56:43.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 21:56:43.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427330,ok=427330,error=0, records=41
[INFO ] 2026-06-01 21:56:52.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:56:52.639 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:56:58.825 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:56:58.825 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427331,ok=427331,error=0, records=41
[INFO ] 2026-06-01 21:57:07.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 21:57:07.602 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21380/300s
[WARN ] 2026-06-01 21:57:07.644 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:57:13.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 21:57:13.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427332,ok=427332,error=0, records=41
[INFO ] 2026-06-01 21:57:22.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:57:22.650 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:57:28.838 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 21:57:28.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427333,ok=427333,error=0, records=41
[INFO ] 2026-06-01 21:57:37.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:57:37.657 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:57:43.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 21:57:43.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427334,ok=427334,error=0, records=41
[INFO ] 2026-06-01 21:57:49.887 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21378/300s
[INFO ] 2026-06-01 21:57:51.788 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21378/300s
[INFO ] 2026-06-01 21:57:52.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:57:52.662 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:57:58.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 21:57:58.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427335,ok=427335,error=0, records=41
[INFO ] 2026-06-01 21:57:58.995 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21378/300s
[INFO ] 2026-06-01 21:58:07.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:58:07.667 [15332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:58:13.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 21:58:13.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427336,ok=427336,error=0, records=41
[INFO ] 2026-06-01 21:58:22.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:58:22.672 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:58:28.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 21:58:28.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427337,ok=427337,error=0, records=41
[INFO ] 2026-06-01 21:58:34.790 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17801/300s
[INFO ] 2026-06-01 21:58:34.791 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 21:58:34.970 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 21:58:34.970 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 21:58:34.971 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 21:58:34.971 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 21:58:34.971 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 21:58:35.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:58:35.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 21:58:35.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:58:35.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 21:58:35.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:58:35.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 21:58:37.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:58:37.676 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:58:43.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 21:58:43.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427338,ok=427338,error=0, records=41
[INFO ] 2026-06-01 21:58:52.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:58:52.681 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:58:58.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 21:58:58.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427339,ok=427339,error=0, records=41
[INFO ] 2026-06-01 21:59:07.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:59:07.688 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:59:13.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 21:59:13.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427340,ok=427340,error=0, records=41
[INFO ] 2026-06-01 21:59:22.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:59:22.693 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:59:28.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 21:59:28.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427341,ok=427341,error=0, records=41
[INFO ] 2026-06-01 21:59:37.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:59:37.698 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:59:43.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 21:59:43.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427342,ok=427342,error=0, records=41
[INFO ] 2026-06-01 21:59:52.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 21:59:52.704 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 21:59:58.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 21:59:58.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427343,ok=427343,error=0, records=41
[INFO ] 2026-06-01 22:00:01.450 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21382/300s
[INFO ] 2026-06-01 22:00:07.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:00:07.709 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:00:13.957 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 22:00:13.957 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427344,ok=427344,error=0, records=41
[INFO ] 2026-06-01 22:00:15.711 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21373/300s
[INFO ] 2026-06-01 22:00:22.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:00:22.714 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:00:28.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-01 22:00:28.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427345,ok=427345,error=0, records=41
[INFO ] 2026-06-01 22:00:37.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:00:37.719 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:00:42.222 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21382/300s
[INFO ] 2026-06-01 22:00:43.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 22:00:43.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427346,ok=427346,error=0, records=41
[INFO ] 2026-06-01 22:00:43.969 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21369/300s
[INFO ] 2026-06-01 22:00:52.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:00:52.723 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:00:58.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 22:00:58.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427347,ok=427347,error=0, records=41
[INFO ] 2026-06-01 22:01:07.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:01:07.728 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:01:13.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 22:01:13.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427348,ok=427348,error=0, records=41
[INFO ] 2026-06-01 22:01:22.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:01:22.733 [15332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:01:28.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 22:01:28.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427349,ok=427349,error=0, records=41
[INFO ] 2026-06-01 22:01:34.972 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864768},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:01:35.114 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:01:35.114 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 22:01:35.115 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:01:35.115 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:01:35.115 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:01:35.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:01:35.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:01:35.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:01:35.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:01:35.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:01:35.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:01:36.313 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21369/300s
[INFO ] 2026-06-01 22:01:37.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:01:37.738 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:01:43.508 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21378/300s
[INFO ] 2026-06-01 22:01:43.998 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 22:01:43.998 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427350,ok=427350,error=0, records=41
[INFO ] 2026-06-01 22:01:52.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:01:52.743 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:01:59.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 22:01:59.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427351,ok=427351,error=0, records=41
[INFO ] 2026-06-01 22:02:07.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:02:07.615 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21381/300s
[WARN ] 2026-06-01 22:02:07.748 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:02:14.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-01 22:02:14.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427352,ok=427352,error=0, records=41
[INFO ] 2026-06-01 22:02:22.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:02:22.754 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:02:29.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 22:02:29.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427353,ok=427353,error=0, records=41
[INFO ] 2026-06-01 22:02:37.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:02:37.759 [15332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:02:44.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-01 22:02:44.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427354,ok=427354,error=0, records=41
[INFO ] 2026-06-01 22:02:49.952 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21379/300s
[INFO ] 2026-06-01 22:02:51.854 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21379/300s
[INFO ] 2026-06-01 22:02:52.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:02:52.764 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:02:59.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 22:02:59.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427355,ok=427355,error=0, records=41
[INFO ] 2026-06-01 22:02:59.060 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21379/300s
[INFO ] 2026-06-01 22:03:07.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:03:07.770 [15335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:03:14.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-01 22:03:14.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427356,ok=427356,error=0, records=41
[INFO ] 2026-06-01 22:03:22.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:03:22.774 [15332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:03:29.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 22:03:29.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427357,ok=427357,error=0, records=41
[INFO ] 2026-06-01 22:03:37.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 22:03:37.619 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 22:03:37.780 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:03:44.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 22:03:44.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427358,ok=427358,error=0, records=41
[INFO ] 2026-06-01 22:03:52.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:03:52.786 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:03:59.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-01 22:03:59.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427359,ok=427359,error=0, records=41
[INFO ] 2026-06-01 22:04:07.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:04:07.791 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:04:14.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 22:04:14.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427360,ok=427360,error=0, records=41
[INFO ] 2026-06-01 22:04:22.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:04:22.796 [15352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:04:29.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 22:04:29.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427361,ok=427361,error=0, records=41
[INFO ] 2026-06-01 22:04:35.115 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17802/300s
[INFO ] 2026-06-01 22:04:35.116 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864684},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:04:35.271 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:04:35.271 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:04:35.271 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:04:35.271 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:04:35.271 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:04:35.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:04:35.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:04:35.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:04:35.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:04:35.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:04:35.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:04:37.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:04:37.803 [15909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:04:44.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 22:04:44.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427362,ok=427362,error=0, records=41
[INFO ] 2026-06-01 22:04:52.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:04:52.808 [15332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:04:59.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 22:04:59.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427363,ok=427363,error=0, records=41
[INFO ] 2026-06-01 22:05:01.454 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21383/300s
[INFO ] 2026-06-01 22:05:07.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:05:07.814 [15347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:05:14.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 22:05:14.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427364,ok=427364,error=0, records=41
[INFO ] 2026-06-01 22:05:15.816 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21374/300s
[INFO ] 2026-06-01 22:05:22.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:05:22.819 [15947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:05:29.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 22:05:29.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427365,ok=427365,error=0, records=41
[INFO ] 2026-06-01 22:05:37.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:05:37.824 [15947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:05:42.229 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21383/300s
[INFO ] 2026-06-01 22:05:44.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 22:05:44.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427366,ok=427366,error=0, records=41
[INFO ] 2026-06-01 22:05:44.154 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21370/300s
[INFO ] 2026-06-01 22:05:52.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:05:52.829 [15909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:05:59.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 22:05:59.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427367,ok=427367,error=0, records=41
[INFO ] 2026-06-01 22:06:07.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:06:07.834 [15966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:06:14.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 22:06:14.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427368,ok=427368,error=0, records=41
[INFO ] 2026-06-01 22:06:22.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:06:22.839 [15981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:06:29.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 22:06:29.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427369,ok=427369,error=0, records=41
[INFO ] 2026-06-01 22:06:36.498 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21370/300s
[INFO ] 2026-06-01 22:06:37.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:06:37.844 [15952] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:06:43.565 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21379/300s
[INFO ] 2026-06-01 22:06:44.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 22:06:44.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427370,ok=427370,error=0, records=41
[INFO ] 2026-06-01 22:06:52.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:06:52.850 [16031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:06:59.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 22:06:59.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427371,ok=427371,error=0, records=41
[INFO ] 2026-06-01 22:07:07.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:07:07.628 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21382/300s
[WARN ] 2026-06-01 22:07:07.856 [16045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:07:14.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 22:07:14.190 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427372,ok=427372,error=0, records=41
[INFO ] 2026-06-01 22:07:22.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:07:22.861 [16045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:07:29.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 22:07:29.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427373,ok=427373,error=0, records=41
[INFO ] 2026-06-01 22:07:35.273 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864608},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:07:35.446 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:07:35.446 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 22:07:35.447 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:07:35.447 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:07:35.447 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:07:35.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:07:35.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:07:35.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:07:35.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:07:35.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:07:35.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:07:37.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:07:37.867 [15919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:07:44.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 22:07:44.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427374,ok=427374,error=0, records=41
[INFO ] 2026-06-01 22:07:50.013 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21380/300s
[INFO ] 2026-06-01 22:07:51.915 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21380/300s
[INFO ] 2026-06-01 22:07:52.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:07:52.872 [16072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:07:59.121 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21380/300s
[INFO ] 2026-06-01 22:07:59.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 22:07:59.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427375,ok=427375,error=0, records=41
[INFO ] 2026-06-01 22:08:07.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:08:07.878 [16108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:08:14.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 22:08:14.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427376,ok=427376,error=0, records=41
[INFO ] 2026-06-01 22:08:22.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:08:22.883 [16102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:08:29.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 22:08:29.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427377,ok=427377,error=0, records=41
[INFO ] 2026-06-01 22:08:37.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:08:37.887 [16136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:08:44.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-01 22:08:44.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427378,ok=427378,error=0, records=41
[INFO ] 2026-06-01 22:08:52.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:08:52.632 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 22:08:52.893 [16146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:08:59.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 22:08:59.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427379,ok=427379,error=0, records=41
[INFO ] 2026-06-01 22:09:07.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:09:07.900 [16173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:09:14.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 22:09:14.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427380,ok=427380,error=0, records=41
[INFO ] 2026-06-01 22:09:22.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:09:22.906 [16188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:09:29.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 22:09:29.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427381,ok=427381,error=0, records=41
[INFO ] 2026-06-01 22:09:37.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:09:37.911 [16146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:09:44.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 22:09:44.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427382,ok=427382,error=0, records=41
[INFO ] 2026-06-01 22:09:52.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:09:52.917 [16146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:09:59.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 22:09:59.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427383,ok=427383,error=0, records=41
[INFO ] 2026-06-01 22:10:01.457 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21384/300s
[INFO ] 2026-06-01 22:10:07.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:10:07.922 [16199] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:10:14.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 22:10:14.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427384,ok=427384,error=0, records=41
[INFO ] 2026-06-01 22:10:15.925 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21375/300s
[INFO ] 2026-06-01 22:10:22.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:10:22.928 [16258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:10:29.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 22:10:29.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427385,ok=427385,error=0, records=41
[INFO ] 2026-06-01 22:10:35.447 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17803/300s
[INFO ] 2026-06-01 22:10:35.448 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864536},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:10:35.609 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:10:35.609 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:10:35.609 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:10:35.609 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:10:35.609 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:10:35.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:10:35.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:10:35.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:10:35.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:10:35.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:10:35.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:10:37.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:10:37.933 [16276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:10:42.236 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21384/300s
[INFO ] 2026-06-01 22:10:44.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:10:44.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427386,ok=427386,error=0, records=41
[INFO ] 2026-06-01 22:10:44.378 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21371/300s
[INFO ] 2026-06-01 22:10:52.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:10:52.938 [16269] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:10:59.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 22:10:59.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427387,ok=427387,error=0, records=41
[INFO ] 2026-06-01 22:11:07.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:11:07.944 [16276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:11:14.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 22:11:14.388 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427388,ok=427388,error=0, records=41
[INFO ] 2026-06-01 22:11:22.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:11:22.950 [16293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:11:29.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 22:11:29.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427389,ok=427389,error=0, records=41
[INFO ] 2026-06-01 22:11:36.679 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21371/300s
[INFO ] 2026-06-01 22:11:37.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:11:37.955 [16336] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:11:43.619 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21380/300s
[INFO ] 2026-06-01 22:11:44.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 22:11:44.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427390,ok=427390,error=0, records=41
[INFO ] 2026-06-01 22:11:52.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:11:52.961 [16350] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:11:59.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-01 22:11:59.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427391,ok=427391,error=0, records=41
[INFO ] 2026-06-01 22:12:07.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:12:07.641 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21383/300s
[WARN ] 2026-06-01 22:12:07.965 [16321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:12:14.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 22:12:14.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427392,ok=427392,error=0, records=41
[INFO ] 2026-06-01 22:12:22.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:12:22.970 [16378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:12:29.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 22:12:29.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427393,ok=427393,error=0, records=41
[INFO ] 2026-06-01 22:12:37.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:12:37.975 [16392] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:12:44.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 22:12:44.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427394,ok=427394,error=0, records=41
[INFO ] 2026-06-01 22:12:50.082 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21381/300s
[INFO ] 2026-06-01 22:12:51.984 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21381/300s
[INFO ] 2026-06-01 22:12:52.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:12:52.979 [16321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:12:59.190 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21381/300s
[INFO ] 2026-06-01 22:12:59.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 22:12:59.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427395,ok=427395,error=0, records=41
[INFO ] 2026-06-01 22:13:07.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:13:07.985 [16315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:13:14.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-01 22:13:14.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427396,ok=427396,error=0, records=41
[INFO ] 2026-06-01 22:13:22.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:13:22.990 [16315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:13:29.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:13:29.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427397,ok=427397,error=0, records=41
[INFO ] 2026-06-01 22:13:35.611 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864460},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:13:35.781 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:13:35.781 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 22:13:35.781 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:13:35.781 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:13:35.781 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:13:35.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:13:35.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:13:35.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:13:35.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:13:35.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:13:35.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:13:37.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 22:13:37.645 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 22:13:37.994 [16315] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:13:44.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 22:13:44.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427398,ok=427398,error=0, records=41
[INFO ] 2026-06-01 22:13:52.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:13:52.999 [16434] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:13:59.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 22:13:59.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427399,ok=427399,error=0, records=41
[INFO ] 2026-06-01 22:14:07.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:14:08.005 [16448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:14:14.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:14:14.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427400,ok=427400,error=0, records=41
[INFO ] 2026-06-01 22:14:22.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:14:23.012 [16448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:14:29.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 22:14:29.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427401,ok=427401,error=0, records=41
[INFO ] 2026-06-01 22:14:37.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:14:38.016 [16321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:14:44.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 22:14:44.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427402,ok=427402,error=0, records=41
[INFO ] 2026-06-01 22:14:52.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:14:53.021 [16518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:14:59.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 22:14:59.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427403,ok=427403,error=0, records=41
[INFO ] 2026-06-01 22:15:01.461 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21385/300s
[INFO ] 2026-06-01 22:15:07.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:15:08.025 [16321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:15:14.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:15:14.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427404,ok=427404,error=0, records=41
[INFO ] 2026-06-01 22:15:16.027 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21376/300s
[INFO ] 2026-06-01 22:15:22.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:15:23.030 [16532] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:15:29.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 22:15:29.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427405,ok=427405,error=0, records=41
[INFO ] 2026-06-01 22:15:37.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:15:38.035 [16560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:15:42.242 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21385/300s
[INFO ] 2026-06-01 22:15:44.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 22:15:44.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427406,ok=427406,error=0, records=41
[INFO ] 2026-06-01 22:15:44.515 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21372/300s
[INFO ] 2026-06-01 22:15:52.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:15:53.040 [16580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:15:59.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 22:15:59.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427407,ok=427407,error=0, records=41
[INFO ] 2026-06-01 22:16:07.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:16:08.046 [16518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:16:14.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 22:16:14.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427408,ok=427408,error=0, records=41
[INFO ] 2026-06-01 22:16:22.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:16:23.052 [16593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:16:29.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 22:16:29.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427409,ok=427409,error=0, records=41
[INFO ] 2026-06-01 22:16:35.782 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17804/300s
[INFO ] 2026-06-01 22:16:35.783 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864380},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:16:35.948 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:16:35.948 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 22:16:35.948 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:16:35.948 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:16:35.948 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:16:35.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:16:35.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:16:35.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:16:35.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:16:35.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:16:35.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:16:36.864 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21372/300s
[WARN ] 2026-06-01 22:16:37.556 [16610] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:16:37.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:16:43.675 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21381/300s
[INFO ] 2026-06-01 22:16:44.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:16:44.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427410,ok=427410,error=0, records=41
[WARN ] 2026-06-01 22:16:52.562 [16623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:16:52.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:16:59.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:16:59.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427411,ok=427411,error=0, records=41
[WARN ] 2026-06-01 22:17:07.566 [16660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:17:07.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:17:07.654 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21384/300s
[INFO ] 2026-06-01 22:17:14.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 22:17:14.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427412,ok=427412,error=0, records=41
[WARN ] 2026-06-01 22:17:22.572 [16683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:17:22.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:17:29.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 22:17:29.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427413,ok=427413,error=0, records=41
[WARN ] 2026-06-01 22:17:37.577 [16700] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:17:37.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:17:44.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 22:17:44.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427414,ok=427414,error=0, records=41
[INFO ] 2026-06-01 22:17:50.150 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21382/300s
[INFO ] 2026-06-01 22:17:52.051 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21382/300s
[WARN ] 2026-06-01 22:17:52.581 [16700] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:17:52.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:17:59.258 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21382/300s
[INFO ] 2026-06-01 22:17:59.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 22:17:59.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427415,ok=427415,error=0, records=41
[WARN ] 2026-06-01 22:18:07.588 [16666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:18:07.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:18:14.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 22:18:14.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427416,ok=427416,error=0, records=41
[WARN ] 2026-06-01 22:18:22.594 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:18:22.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:18:29.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 22:18:29.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427417,ok=427417,error=0, records=41
[WARN ] 2026-06-01 22:18:37.598 [16728] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:18:37.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:18:44.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 22:18:44.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427418,ok=427418,error=0, records=41
[WARN ] 2026-06-01 22:18:52.604 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:18:52.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:18:59.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 22:18:59.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427419,ok=427419,error=0, records=41
[WARN ] 2026-06-01 22:19:07.611 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:19:07.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:19:14.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 22:19:14.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427420,ok=427420,error=0, records=41
[WARN ] 2026-06-01 22:19:22.617 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:19:22.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:19:29.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 22:19:29.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427421,ok=427421,error=0, records=41
[INFO ] 2026-06-01 22:19:35.950 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864304},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:19:36.114 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:19:36.114 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:19:36.114 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:19:36.114 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:19:36.114 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:19:36.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:19:36.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:19:36.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:19:36.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:19:36.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:19:36.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 22:19:37.621 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:19:37.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:19:44.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 22:19:44.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427422,ok=427422,error=0, records=41
[WARN ] 2026-06-01 22:19:52.627 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:19:52.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:19:59.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 22:19:59.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427423,ok=427423,error=0, records=41
[INFO ] 2026-06-01 22:20:01.464 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21386/300s
[WARN ] 2026-06-01 22:20:07.632 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:20:07.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:20:14.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:20:14.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427424,ok=427424,error=0, records=41
[INFO ] 2026-06-01 22:20:16.134 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21377/300s
[WARN ] 2026-06-01 22:20:22.637 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:20:22.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:20:29.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 22:20:29.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427425,ok=427425,error=0, records=41
[WARN ] 2026-06-01 22:20:37.642 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:20:37.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:20:42.250 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21386/300s
[INFO ] 2026-06-01 22:20:44.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 22:20:44.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427426,ok=427426,error=0, records=41
[INFO ] 2026-06-01 22:20:44.758 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21373/300s
[WARN ] 2026-06-01 22:20:52.647 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:20:52.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:20:59.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:20:59.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427427,ok=427427,error=0, records=41
[WARN ] 2026-06-01 22:21:07.652 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:21:07.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:21:14.838 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 22:21:14.838 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427428,ok=427428,error=0, records=41
[WARN ] 2026-06-01 22:21:22.657 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:21:22.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:21:29.844 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 22:21:29.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427429,ok=427429,error=0, records=41
[INFO ] 2026-06-01 22:21:37.050 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21373/300s
[WARN ] 2026-06-01 22:21:37.662 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:21:37.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:21:43.734 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21382/300s
[INFO ] 2026-06-01 22:21:44.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 22:21:44.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427430,ok=427430,error=0, records=41
[INFO ] 2026-06-01 22:21:52.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:21:52.666 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:21:59.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:21:59.857 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427431,ok=427431,error=0, records=41
[INFO ] 2026-06-01 22:22:07.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:22:07.666 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21385/300s
[WARN ] 2026-06-01 22:22:07.671 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:22:14.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 22:22:14.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427432,ok=427432,error=0, records=41
[INFO ] 2026-06-01 22:22:22.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:22:22.675 [16746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:22:29.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-01 22:22:29.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427433,ok=427433,error=0, records=41
[INFO ] 2026-06-01 22:22:36.114 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17805/300s
[INFO ] 2026-06-01 22:22:36.116 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864224},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:22:36.276 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:22:36.276 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 22:22:36.276 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:22:36.276 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:22:36.276 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:22:36.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:22:36.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:22:36.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:22:36.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:22:36.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:22:36.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:22:37.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:22:37.681 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:22:44.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 22:22:44.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427434,ok=427434,error=0, records=41
[INFO ] 2026-06-01 22:22:50.218 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21383/300s
[INFO ] 2026-06-01 22:22:52.120 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21383/300s
[INFO ] 2026-06-01 22:22:52.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:22:52.685 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:22:59.327 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21383/300s
[INFO ] 2026-06-01 22:22:59.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 22:22:59.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427435,ok=427435,error=0, records=41
[INFO ] 2026-06-01 22:23:07.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:23:07.691 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:23:14.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 22:23:14.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427436,ok=427436,error=0, records=41
[INFO ] 2026-06-01 22:23:22.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:23:22.696 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:23:29.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 22:23:29.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427437,ok=427437,error=0, records=41
[INFO ] 2026-06-01 22:23:37.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 22:23:37.670 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 22:23:37.702 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:23:44.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 22:23:44.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427438,ok=427438,error=0, records=41
[INFO ] 2026-06-01 22:23:52.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:23:52.671 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 22:23:52.708 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:23:59.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 22:23:59.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427439,ok=427439,error=0, records=41
[INFO ] 2026-06-01 22:24:07.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:24:07.712 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:24:14.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 22:24:14.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427440,ok=427440,error=0, records=41
[INFO ] 2026-06-01 22:24:22.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:24:22.717 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:24:29.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 22:24:29.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427441,ok=427441,error=0, records=41
[INFO ] 2026-06-01 22:24:37.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:24:37.722 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:24:44.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:24:44.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427442,ok=427442,error=0, records=41
[INFO ] 2026-06-01 22:24:52.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:24:52.727 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:24:59.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 22:24:59.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427443,ok=427443,error=0, records=41
[INFO ] 2026-06-01 22:25:01.467 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21387/300s
[INFO ] 2026-06-01 22:25:07.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:25:07.732 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:25:14.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 22:25:14.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427444,ok=427444,error=0, records=41
[INFO ] 2026-06-01 22:25:16.235 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21378/300s
[INFO ] 2026-06-01 22:25:22.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:25:22.738 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:25:29.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 22:25:29.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427445,ok=427445,error=0, records=41
[INFO ] 2026-06-01 22:25:36.278 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864148},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:25:36.454 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:25:36.454 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:25:36.455 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:25:36.455 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:25:36.455 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:25:36.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:25:36.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:25:36.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:25:36.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:25:36.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:25:36.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:25:37.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:25:37.742 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:25:42.257 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21387/300s
[INFO ] 2026-06-01 22:25:44.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:25:44.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427446,ok=427446,error=0, records=41
[INFO ] 2026-06-01 22:25:44.946 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21374/300s
[INFO ] 2026-06-01 22:25:52.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:25:52.748 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:25:59.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 22:25:59.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427447,ok=427447,error=0, records=41
[INFO ] 2026-06-01 22:26:07.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:26:07.754 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:26:14.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 22:26:14.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427448,ok=427448,error=0, records=41
[INFO ] 2026-06-01 22:26:22.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:26:22.760 [16746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:26:29.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-01 22:26:29.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427449,ok=427449,error=0, records=41
[INFO ] 2026-06-01 22:26:37.236 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21374/300s
[INFO ] 2026-06-01 22:26:37.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:26:37.764 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:26:43.789 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21383/300s
[INFO ] 2026-06-01 22:26:44.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 22:26:44.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427450,ok=427450,error=0, records=41
[INFO ] 2026-06-01 22:26:52.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:26:52.769 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:26:59.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 22:26:59.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427451,ok=427451,error=0, records=41
[INFO ] 2026-06-01 22:27:07.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:27:07.679 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21386/300s
[WARN ] 2026-06-01 22:27:07.774 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:27:14.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 22:27:14.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427452,ok=427452,error=0, records=41
[INFO ] 2026-06-01 22:27:22.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:27:22.778 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:27:29.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 22:27:29.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427453,ok=427453,error=0, records=41
[INFO ] 2026-06-01 22:27:37.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:27:37.783 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:27:44.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 22:27:44.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427454,ok=427454,error=0, records=41
[INFO ] 2026-06-01 22:27:50.289 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21384/300s
[INFO ] 2026-06-01 22:27:52.191 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21384/300s
[INFO ] 2026-06-01 22:27:52.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:27:52.788 [16734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:27:59.398 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21384/300s
[INFO ] 2026-06-01 22:28:00.003 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 22:28:00.003 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427455,ok=427455,error=0, records=41
[INFO ] 2026-06-01 22:28:07.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:28:07.792 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:28:15.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 22:28:15.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427456,ok=427456,error=0, records=41
[INFO ] 2026-06-01 22:28:22.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:28:22.797 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:28:30.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 22:28:30.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427457,ok=427457,error=0, records=41
[INFO ] 2026-06-01 22:28:36.455 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17806/300s
[INFO ] 2026-06-01 22:28:36.457 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20864072},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:28:36.618 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:28:36.618 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 22:28:36.618 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:28:36.618 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:28:36.618 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:28:36.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:28:36.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:28:36.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:28:36.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:28:36.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:28:36.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:28:37.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:28:37.804 [16751] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:28:45.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 22:28:45.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427458,ok=427458,error=0, records=41
[INFO ] 2026-06-01 22:28:52.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:28:52.809 [16735] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:29:00.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 22:29:00.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427459,ok=427459,error=0, records=41
[INFO ] 2026-06-01 22:29:07.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:29:07.814 [17326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:29:15.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 22:29:15.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427460,ok=427460,error=0, records=41
[INFO ] 2026-06-01 22:29:22.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:29:22.819 [17326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:29:30.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 22:29:30.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427461,ok=427461,error=0, records=41
[INFO ] 2026-06-01 22:29:37.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:29:37.824 [17341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:29:45.050 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 22:29:45.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427462,ok=427462,error=0, records=41
[INFO ] 2026-06-01 22:29:52.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:29:52.830 [17341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:30:00.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 22:30:00.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427463,ok=427463,error=0, records=41
[INFO ] 2026-06-01 22:30:01.470 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21388/300s
[INFO ] 2026-06-01 22:30:07.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:30:07.835 [17387] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:30:15.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-01 22:30:15.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427464,ok=427464,error=0, records=41
[INFO ] 2026-06-01 22:30:16.337 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21379/300s
[INFO ] 2026-06-01 22:30:22.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:30:22.840 [17326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:30:30.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 22:30:30.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427465,ok=427465,error=0, records=41
[INFO ] 2026-06-01 22:30:37.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:30:37.844 [17410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:30:42.263 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21388/300s
[INFO ] 2026-06-01 22:30:45.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-01 22:30:45.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427466,ok=427466,error=0, records=41
[INFO ] 2026-06-01 22:30:45.076 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21375/300s
[INFO ] 2026-06-01 22:30:52.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:30:52.848 [17321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:31:00.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 22:31:00.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427467,ok=427467,error=0, records=41
[INFO ] 2026-06-01 22:31:07.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:31:07.854 [17341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:31:15.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 22:31:15.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427468,ok=427468,error=0, records=41
[INFO ] 2026-06-01 22:31:22.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:31:22.859 [17321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:31:30.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-01 22:31:30.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427469,ok=427469,error=0, records=41
[INFO ] 2026-06-01 22:31:36.620 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863996},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:31:36.780 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:31:36.780 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-01 22:31:36.780 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:31:36.780 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:31:36.780 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:31:36.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:31:36.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:31:36.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:31:36.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:31:36.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:31:36.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:31:37.415 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21375/300s
[INFO ] 2026-06-01 22:31:37.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:31:37.864 [17438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:31:43.849 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21384/300s
[INFO ] 2026-06-01 22:31:45.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 22:31:45.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427470,ok=427470,error=0, records=41
[INFO ] 2026-06-01 22:31:52.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:31:52.869 [17424] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:32:00.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 22:32:00.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427471,ok=427471,error=0, records=41
[INFO ] 2026-06-01 22:32:07.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:32:07.693 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21387/300s
[WARN ] 2026-06-01 22:32:07.874 [17438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:32:15.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 22:32:15.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427472,ok=427472,error=0, records=41
[INFO ] 2026-06-01 22:32:22.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:32:22.880 [17321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:32:30.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 22:32:30.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427473,ok=427473,error=0, records=41
[INFO ] 2026-06-01 22:32:37.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:32:37.885 [17531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:32:45.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-01 22:32:45.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427474,ok=427474,error=0, records=41
[INFO ] 2026-06-01 22:32:50.364 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21385/300s
[INFO ] 2026-06-01 22:32:52.266 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21385/300s
[INFO ] 2026-06-01 22:32:52.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:32:52.890 [17542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:32:59.471 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21385/300s
[INFO ] 2026-06-01 22:33:00.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 22:33:00.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427475,ok=427475,error=0, records=41
[INFO ] 2026-06-01 22:33:07.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:33:07.896 [17548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:33:15.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 22:33:15.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427476,ok=427476,error=0, records=41
[INFO ] 2026-06-01 22:33:22.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:33:22.901 [17559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:33:30.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 22:33:30.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427477,ok=427477,error=0, records=41
[INFO ] 2026-06-01 22:33:37.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 22:33:37.696 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 22:33:37.906 [17593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:33:45.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 22:33:45.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427478,ok=427478,error=0, records=41
[INFO ] 2026-06-01 22:33:52.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:33:52.912 [17608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:34:00.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 22:34:00.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427479,ok=427479,error=0, records=41
[INFO ] 2026-06-01 22:34:07.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:34:07.918 [17626] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:34:15.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 22:34:15.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427480,ok=427480,error=0, records=41
[INFO ] 2026-06-01 22:34:22.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:34:22.923 [17608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:34:30.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 22:34:30.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427481,ok=427481,error=0, records=41
[INFO ] 2026-06-01 22:34:36.780 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17807/300s
[INFO ] 2026-06-01 22:34:36.782 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863912},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:34:36.940 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:34:36.941 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:34:36.941 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:34:36.941 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:34:36.941 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:34:36.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:34:36.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:34:36.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:34:36.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:34:36.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:34:36.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:34:37.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:34:37.929 [17648] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:34:45.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:34:45.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427482,ok=427482,error=0, records=41
[INFO ] 2026-06-01 22:34:52.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:34:52.934 [17675] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:35:00.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 22:35:00.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427483,ok=427483,error=0, records=41
[INFO ] 2026-06-01 22:35:01.473 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21389/300s
[INFO ] 2026-06-01 22:35:07.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:35:07.940 [17685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:35:15.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-01 22:35:15.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427484,ok=427484,error=0, records=41
[INFO ] 2026-06-01 22:35:16.442 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21380/300s
[INFO ] 2026-06-01 22:35:22.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:35:22.946 [17701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:35:30.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 22:35:30.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427485,ok=427485,error=0, records=41
[INFO ] 2026-06-01 22:35:37.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:35:37.951 [17724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:35:42.269 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21389/300s
[INFO ] 2026-06-01 22:35:45.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 22:35:45.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427486,ok=427486,error=0, records=41
[INFO ] 2026-06-01 22:35:45.326 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21376/300s
[INFO ] 2026-06-01 22:35:52.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:35:52.955 [17724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:36:00.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-01 22:36:00.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427487,ok=427487,error=0, records=41
[INFO ] 2026-06-01 22:36:07.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:36:07.960 [17724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:36:15.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-01 22:36:15.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427488,ok=427488,error=0, records=41
[INFO ] 2026-06-01 22:36:22.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:36:22.964 [17701] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:36:30.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 22:36:30.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427489,ok=427489,error=0, records=41
[INFO ] 2026-06-01 22:36:37.602 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21376/300s
[INFO ] 2026-06-01 22:36:37.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:36:37.968 [17724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:36:43.903 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21385/300s
[INFO ] 2026-06-01 22:36:45.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:36:45.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427490,ok=427490,error=0, records=41
[INFO ] 2026-06-01 22:36:52.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:36:52.972 [17794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:37:00.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 22:37:00.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427491,ok=427491,error=0, records=41
[INFO ] 2026-06-01 22:37:07.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:37:07.705 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21388/300s
[WARN ] 2026-06-01 22:37:07.976 [17738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:37:15.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:37:15.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427492,ok=427492,error=0, records=41
[INFO ] 2026-06-01 22:37:22.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:37:22.980 [17794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:37:30.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 22:37:30.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427493,ok=427493,error=0, records=41
[INFO ] 2026-06-01 22:37:36.942 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863840},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:37:37.112 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:37:37.112 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:37:37.112 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:37:37.112 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:37:37.112 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:37:37.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:37:37.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:37:37.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:37:37.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:37:37.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:37:37.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:37:37.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:37:37.985 [17794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:37:45.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 22:37:45.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427494,ok=427494,error=0, records=41
[INFO ] 2026-06-01 22:37:50.428 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21386/300s
[INFO ] 2026-06-01 22:37:52.330 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21386/300s
[INFO ] 2026-06-01 22:37:52.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:37:52.989 [17851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:37:59.536 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21386/300s
[INFO ] 2026-06-01 22:38:00.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 22:38:00.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427495,ok=427495,error=0, records=41
[INFO ] 2026-06-01 22:38:07.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:38:07.997 [17836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:38:15.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 22:38:15.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427496,ok=427496,error=0, records=41
[INFO ] 2026-06-01 22:38:22.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:38:23.002 [17851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:38:30.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 22:38:30.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427497,ok=427497,error=0, records=41
[INFO ] 2026-06-01 22:38:37.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:38:38.006 [17851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:38:45.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 22:38:45.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427498,ok=427498,error=0, records=41
[INFO ] 2026-06-01 22:38:52.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:38:52.709 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 22:38:53.012 [17738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:39:00.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:39:00.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427499,ok=427499,error=0, records=41
[INFO ] 2026-06-01 22:39:07.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:39:08.017 [17892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:39:15.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 22:39:15.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427500,ok=427500,error=0, records=41
[INFO ] 2026-06-01 22:39:22.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:39:23.023 [17934] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:39:30.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 22:39:30.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427501,ok=427501,error=0, records=41
[INFO ] 2026-06-01 22:39:37.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:39:38.028 [17878] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:39:45.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 22:39:45.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427502,ok=427502,error=0, records=41
[INFO ] 2026-06-01 22:39:52.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:39:53.033 [17878] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:40:00.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 22:40:00.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427503,ok=427503,error=0, records=41
[INFO ] 2026-06-01 22:40:01.476 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21390/300s
[INFO ] 2026-06-01 22:40:07.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:40:08.038 [17836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:40:15.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-01 22:40:15.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427504,ok=427504,error=0, records=41
[INFO ] 2026-06-01 22:40:16.541 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21381/300s
[INFO ] 2026-06-01 22:40:22.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:40:23.044 [17983] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:40:30.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 22:40:30.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427505,ok=427505,error=0, records=41
[INFO ] 2026-06-01 22:40:37.112 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17808/300s
[INFO ] 2026-06-01 22:40:37.113 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863628},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:40:37.293 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:40:37.293 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 22:40:37.293 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:40:37.293 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:40:37.293 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:40:37.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:40:37.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:40:37.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:40:37.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:40:37.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:40:37.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:40:37.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:40:38.049 [18022] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:40:42.275 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21390/300s
[INFO ] 2026-06-01 22:40:45.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 22:40:45.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427506,ok=427506,error=0, records=41
[INFO ] 2026-06-01 22:40:45.439 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21377/300s
[WARN ] 2026-06-01 22:40:52.554 [18033] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:40:52.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:41:00.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-01 22:41:00.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427507,ok=427507,error=0, records=41
[WARN ] 2026-06-01 22:41:07.560 [17971] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:41:07.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:41:15.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 22:41:15.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427508,ok=427508,error=0, records=41
[WARN ] 2026-06-01 22:41:22.565 [18056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:41:22.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:41:30.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 22:41:30.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427509,ok=427509,error=0, records=41
[WARN ] 2026-06-01 22:41:37.570 [18074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:41:37.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:41:37.786 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21377/300s
[INFO ] 2026-06-01 22:41:43.959 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21386/300s
[INFO ] 2026-06-01 22:41:45.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 22:41:45.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427510,ok=427510,error=0, records=41
[WARN ] 2026-06-01 22:41:52.574 [18111] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:41:52.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:42:00.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 22:42:00.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427511,ok=427511,error=0, records=41
[WARN ] 2026-06-01 22:42:07.580 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:42:07.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:42:07.718 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21389/300s
[INFO ] 2026-06-01 22:42:15.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:42:15.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427512,ok=427512,error=0, records=41
[WARN ] 2026-06-01 22:42:22.585 [18122] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:42:22.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:42:30.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 22:42:30.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427513,ok=427513,error=0, records=41
[WARN ] 2026-06-01 22:42:37.590 [18164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:42:37.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:42:45.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 22:42:45.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427514,ok=427514,error=0, records=41
[INFO ] 2026-06-01 22:42:50.481 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21387/300s
[INFO ] 2026-06-01 22:42:52.382 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21387/300s
[WARN ] 2026-06-01 22:42:52.594 [18164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:42:52.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:42:59.589 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21387/300s
[INFO ] 2026-06-01 22:43:00.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 22:43:00.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427515,ok=427515,error=0, records=41
[WARN ] 2026-06-01 22:43:07.600 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:43:07.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:43:15.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 22:43:15.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427516,ok=427516,error=0, records=41
[WARN ] 2026-06-01 22:43:22.604 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:43:22.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:43:30.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 22:43:30.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427517,ok=427517,error=0, records=41
[INFO ] 2026-06-01 22:43:37.295 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863556},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:43:37.472 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:43:37.472 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:43:37.472 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:43:37.472 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:43:37.472 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:43:37.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:43:37.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:43:37.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:43:37.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:43:37.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:43:37.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 22:43:37.609 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:43:37.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 22:43:37.721 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 22:43:45.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 22:43:45.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427518,ok=427518,error=0, records=41
[WARN ] 2026-06-01 22:43:52.614 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:43:52.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:44:00.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:44:00.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427519,ok=427519,error=0, records=41
[WARN ] 2026-06-01 22:44:07.619 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:44:07.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:44:15.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-01 22:44:15.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427520,ok=427520,error=0, records=41
[WARN ] 2026-06-01 22:44:22.625 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:44:22.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:44:30.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 22:44:30.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427521,ok=427521,error=0, records=41
[WARN ] 2026-06-01 22:44:37.630 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:44:37.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:44:45.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-01 22:44:45.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427522,ok=427522,error=0, records=41
[WARN ] 2026-06-01 22:44:52.636 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:44:52.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:45:00.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 22:45:00.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427523,ok=427523,error=0, records=41
[INFO ] 2026-06-01 22:45:01.480 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21391/300s
[WARN ] 2026-06-01 22:45:07.641 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:45:07.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:45:15.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 22:45:15.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427524,ok=427524,error=0, records=41
[INFO ] 2026-06-01 22:45:16.645 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21382/300s
[WARN ] 2026-06-01 22:45:22.647 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:45:22.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:45:30.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 22:45:30.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427525,ok=427525,error=0, records=41
[WARN ] 2026-06-01 22:45:37.652 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:45:37.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:45:42.281 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21391/300s
[INFO ] 2026-06-01 22:45:45.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 22:45:45.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427526,ok=427526,error=0, records=41
[INFO ] 2026-06-01 22:45:45.617 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21378/300s
[WARN ] 2026-06-01 22:45:52.657 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:45:52.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:46:00.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 22:46:00.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427527,ok=427527,error=0, records=41
[WARN ] 2026-06-01 22:46:07.662 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:46:07.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:46:15.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 22:46:15.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427528,ok=427528,error=0, records=41
[WARN ] 2026-06-01 22:46:22.667 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:46:22.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:46:30.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 22:46:30.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427529,ok=427529,error=0, records=41
[INFO ] 2026-06-01 22:46:37.473 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17809/300s
[INFO ] 2026-06-01 22:46:37.474 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863480},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:46:37.627 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:46:37.628 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:46:37.628 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:46:37.628 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:46:37.628 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:46:37.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:46:37.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:46:37.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:46:37.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:46:37.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:46:37.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-01 22:46:37.672 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:46:37.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:46:37.970 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21378/300s
[INFO ] 2026-06-01 22:46:44.017 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21387/300s
[INFO ] 2026-06-01 22:46:45.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-01 22:46:45.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427530,ok=427530,error=0, records=41
[WARN ] 2026-06-01 22:46:52.677 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:46:52.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:47:00.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 22:47:00.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427531,ok=427531,error=0, records=41
[WARN ] 2026-06-01 22:47:07.682 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:47:07.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:47:07.730 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21390/300s
[INFO ] 2026-06-01 22:47:15.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 22:47:15.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427532,ok=427532,error=0, records=41
[WARN ] 2026-06-01 22:47:22.687 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:47:22.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:47:30.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 22:47:30.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427533,ok=427533,error=0, records=41
[WARN ] 2026-06-01 22:47:37.692 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:47:37.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:47:45.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-01 22:47:45.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427534,ok=427534,error=0, records=41
[INFO ] 2026-06-01 22:47:50.552 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21388/300s
[INFO ] 2026-06-01 22:47:52.454 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21388/300s
[WARN ] 2026-06-01 22:47:52.697 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:47:52.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:47:59.661 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21388/300s
[INFO ] 2026-06-01 22:48:00.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 22:48:00.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427535,ok=427535,error=0, records=41
[WARN ] 2026-06-01 22:48:07.702 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:48:07.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:48:15.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 22:48:15.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427536,ok=427536,error=0, records=41
[WARN ] 2026-06-01 22:48:22.707 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:48:22.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:48:30.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:48:30.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427537,ok=427537,error=0, records=41
[WARN ] 2026-06-01 22:48:37.712 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:48:37.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:48:45.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 22:48:45.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427538,ok=427538,error=0, records=41
[WARN ] 2026-06-01 22:48:52.717 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:48:52.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:49:00.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 22:49:00.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427539,ok=427539,error=0, records=41
[WARN ] 2026-06-01 22:49:07.722 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:49:07.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:49:15.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 22:49:15.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427540,ok=427540,error=0, records=41
[WARN ] 2026-06-01 22:49:22.727 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:49:22.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:49:30.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:49:30.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427541,ok=427541,error=0, records=41
[INFO ] 2026-06-01 22:49:37.629 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863404},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-01 22:49:37.732 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:49:37.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-01 22:49:37.800 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:49:37.800 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 22:49:37.801 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:49:37.801 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:49:37.801 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:49:37.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:49:37.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:49:37.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:49:37.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:49:37.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:49:37.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:49:45.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 22:49:45.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427542,ok=427542,error=0, records=41
[WARN ] 2026-06-01 22:49:52.737 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:49:52.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:50:00.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 22:50:00.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427543,ok=427543,error=0, records=41
[INFO ] 2026-06-01 22:50:01.484 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21392/300s
[INFO ] 2026-06-01 22:50:07.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:50:07.742 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:50:15.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 22:50:15.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427544,ok=427544,error=0, records=41
[INFO ] 2026-06-01 22:50:16.745 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21383/300s
[INFO ] 2026-06-01 22:50:22.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:50:22.748 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:50:30.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 22:50:30.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427545,ok=427545,error=0, records=41
[INFO ] 2026-06-01 22:50:37.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:50:37.753 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:50:42.288 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21392/300s
[INFO ] 2026-06-01 22:50:45.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-01 22:50:45.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427546,ok=427546,error=0, records=41
[INFO ] 2026-06-01 22:50:45.748 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21379/300s
[INFO ] 2026-06-01 22:50:52.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:50:52.758 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:51:00.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-01 22:51:00.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427547,ok=427547,error=0, records=41
[INFO ] 2026-06-01 22:51:07.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:51:07.763 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:51:15.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 22:51:15.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427548,ok=427548,error=0, records=41
[INFO ] 2026-06-01 22:51:22.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:51:22.768 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:51:30.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 22:51:30.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427549,ok=427549,error=0, records=41
[INFO ] 2026-06-01 22:51:37.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:51:37.774 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:51:38.154 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21379/300s
[INFO ] 2026-06-01 22:51:44.074 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21388/300s
[INFO ] 2026-06-01 22:51:45.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 22:51:45.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427550,ok=427550,error=0, records=41
[INFO ] 2026-06-01 22:51:52.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:51:52.778 [18147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:52:00.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 22:52:00.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427551,ok=427551,error=0, records=41
[INFO ] 2026-06-01 22:52:07.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:52:07.743 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21391/300s
[WARN ] 2026-06-01 22:52:07.782 [18110] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:52:15.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-01 22:52:15.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427552,ok=427552,error=0, records=41
[INFO ] 2026-06-01 22:52:22.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:52:22.787 [18189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:52:30.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 22:52:30.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427553,ok=427553,error=0, records=41
[INFO ] 2026-06-01 22:52:37.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:52:37.793 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:52:37.801 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17810/300s
[INFO ] 2026-06-01 22:52:37.802 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863328},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:52:37.981 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:52:37.981 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 22:52:37.981 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:52:37.982 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:52:37.982 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:52:38.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:52:38.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:52:38.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:52:38.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:52:38.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:52:38.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:52:45.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 22:52:45.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427554,ok=427554,error=0, records=41
[INFO ] 2026-06-01 22:52:50.624 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21389/300s
[INFO ] 2026-06-01 22:52:52.526 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21389/300s
[INFO ] 2026-06-01 22:52:52.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:52:52.798 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:52:59.732 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21389/300s
[INFO ] 2026-06-01 22:53:00.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 22:53:00.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427555,ok=427555,error=0, records=41
[INFO ] 2026-06-01 22:53:07.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:53:07.803 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:53:15.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 22:53:15.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427556,ok=427556,error=0, records=41
[INFO ] 2026-06-01 22:53:22.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:53:22.808 [18734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:53:30.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 22:53:30.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427557,ok=427557,error=0, records=41
[INFO ] 2026-06-01 22:53:37.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 22:53:37.747 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 22:53:37.813 [18146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:53:45.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 22:53:45.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427558,ok=427558,error=0, records=41
[INFO ] 2026-06-01 22:53:52.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:53:52.748 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 22:53:52.818 [18756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:54:00.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 22:54:00.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427559,ok=427559,error=0, records=41
[INFO ] 2026-06-01 22:54:07.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:54:07.823 [18179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:54:15.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 22:54:15.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427560,ok=427560,error=0, records=41
[INFO ] 2026-06-01 22:54:22.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:54:22.829 [18799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:54:30.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 22:54:30.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427561,ok=427561,error=0, records=41
[INFO ] 2026-06-01 22:54:37.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:54:37.835 [18766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:54:45.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 22:54:45.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427562,ok=427562,error=0, records=41
[INFO ] 2026-06-01 22:54:52.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:54:52.839 [18766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:55:00.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 22:55:00.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427563,ok=427563,error=0, records=41
[INFO ] 2026-06-01 22:55:01.487 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21393/300s
[INFO ] 2026-06-01 22:55:07.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:55:07.844 [18785] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:55:16.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 22:55:16.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427564,ok=427564,error=0, records=41
[INFO ] 2026-06-01 22:55:16.847 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21384/300s
[INFO ] 2026-06-01 22:55:22.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:55:22.849 [18836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:55:31.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 22:55:31.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427565,ok=427565,error=0, records=41
[INFO ] 2026-06-01 22:55:37.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:55:37.855 [18836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:55:37.983 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863248},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:55:38.140 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:55:38.140 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 22:55:38.140 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:55:38.140 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:55:38.140 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:55:38.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:55:38.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:55:38.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:55:38.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:55:38.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:55:38.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:55:42.294 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21393/300s
[INFO ] 2026-06-01 22:55:46.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 22:55:46.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427566,ok=427566,error=0, records=41
[INFO ] 2026-06-01 22:55:46.040 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21380/300s
[INFO ] 2026-06-01 22:55:52.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:55:52.861 [18853] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:56:01.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 22:56:01.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427567,ok=427567,error=0, records=41
[INFO ] 2026-06-01 22:56:07.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:56:07.866 [18896] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:56:16.050 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-01 22:56:16.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427568,ok=427568,error=0, records=41
[INFO ] 2026-06-01 22:56:22.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:56:22.871 [18882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:56:31.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-01 22:56:31.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427569,ok=427569,error=0, records=41
[INFO ] 2026-06-01 22:56:37.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:56:37.875 [18813] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:56:38.327 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21380/300s
[INFO ] 2026-06-01 22:56:44.128 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21389/300s
[INFO ] 2026-06-01 22:56:46.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 22:56:46.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427570,ok=427570,error=0, records=41
[INFO ] 2026-06-01 22:56:52.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:56:52.880 [18939] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:57:01.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-01 22:57:01.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427571,ok=427571,error=0, records=41
[INFO ] 2026-06-01 22:57:07.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 22:57:07.756 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21392/300s
[WARN ] 2026-06-01 22:57:07.885 [18813] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:57:16.082 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 22:57:16.082 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427572,ok=427572,error=0, records=41
[INFO ] 2026-06-01 22:57:22.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:57:22.889 [18939] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:57:31.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 22:57:31.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427573,ok=427573,error=0, records=41
[INFO ] 2026-06-01 22:57:37.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:57:37.895 [18981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:57:46.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-01 22:57:46.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427574,ok=427574,error=0, records=41
[INFO ] 2026-06-01 22:57:50.677 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21390/300s
[INFO ] 2026-06-01 22:57:52.578 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21390/300s
[INFO ] 2026-06-01 22:57:52.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:57:52.900 [18971] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:57:59.783 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21390/300s
[INFO ] 2026-06-01 22:58:01.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-01 22:58:01.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427575,ok=427575,error=0, records=41
[INFO ] 2026-06-01 22:58:07.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:58:07.906 [19021] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:58:16.106 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-01 22:58:16.106 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427576,ok=427576,error=0, records=41
[INFO ] 2026-06-01 22:58:22.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:58:22.911 [19004] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:58:31.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 22:58:31.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427577,ok=427577,error=0, records=41
[INFO ] 2026-06-01 22:58:37.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:58:37.916 [19038] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:58:38.140 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17811/300s
[INFO ] 2026-06-01 22:58:38.142 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 22:58:38.305 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 22:58:38.305 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 22:58:38.306 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 22:58:38.306 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 22:58:38.306 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 22:58:38.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:58:38.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 22:58:38.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:58:38.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 22:58:38.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:58:38.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 22:58:46.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 22:58:46.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427578,ok=427578,error=0, records=41
[INFO ] 2026-06-01 22:58:52.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:58:52.921 [19047] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:59:01.206 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 22:59:01.206 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427579,ok=427579,error=0, records=41
[INFO ] 2026-06-01 22:59:07.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:59:07.926 [19088] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:59:16.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 22:59:16.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427580,ok=427580,error=0, records=41
[INFO ] 2026-06-01 22:59:22.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:59:22.930 [19100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:59:31.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-01 22:59:31.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427581,ok=427581,error=0, records=41
[INFO ] 2026-06-01 22:59:37.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:59:37.936 [19073] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 22:59:46.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 22:59:46.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427582,ok=427582,error=0, records=41
[INFO ] 2026-06-01 22:59:52.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 22:59:52.942 [19136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:00:01.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 23:00:01.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427583,ok=427583,error=0, records=41
[INFO ] 2026-06-01 23:00:01.491 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21394/300s
[INFO ] 2026-06-01 23:00:07.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:00:07.946 [19099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:00:16.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 23:00:16.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427584,ok=427584,error=0, records=41
[INFO ] 2026-06-01 23:00:16.949 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21385/300s
[INFO ] 2026-06-01 23:00:22.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:00:22.952 [19152] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:00:31.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 23:00:31.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427585,ok=427585,error=0, records=41
[INFO ] 2026-06-01 23:00:37.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:00:37.956 [19099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:00:42.300 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21394/300s
[INFO ] 2026-06-01 23:00:46.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 23:00:46.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427586,ok=427586,error=0, records=41
[INFO ] 2026-06-01 23:00:46.249 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21381/300s
[INFO ] 2026-06-01 23:00:52.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:00:52.966 [19099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:01:01.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-01 23:01:01.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427587,ok=427587,error=0, records=41
[INFO ] 2026-06-01 23:01:07.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:01:07.970 [19093] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:01:16.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 23:01:16.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427588,ok=427588,error=0, records=41
[WARN ] 2026-06-01 23:01:17.474 [19323] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9849/stat), No such file or directory
[WARN ] 2026-06-01 23:01:17.474 [19323] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10538/stat), No such file or directory
[INFO ] 2026-06-01 23:01:22.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:01:22.975 [19099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:01:31.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 23:01:31.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427589,ok=427589,error=0, records=41
[WARN ] 2026-06-01 23:01:32.479 [19099] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9849/stat), No such file or directory
[WARN ] 2026-06-01 23:01:32.479 [19099] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10538/stat), No such file or directory
[INFO ] 2026-06-01 23:01:37.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:01:37.981 [19351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:01:38.307 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20863032},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:01:38.467 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:01:38.467 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 23:01:38.467 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:01:38.467 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:01:38.467 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:01:38.510 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21381/300s
[INFO ] 2026-06-01 23:01:38.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:01:38.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:01:38.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:01:38.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:01:38.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:01:38.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:01:44.190 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21390/300s
[INFO ] 2026-06-01 23:01:46.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-01 23:01:46.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427590,ok=427590,error=0, records=41
[WARN ] 2026-06-01 23:01:47.484 [19366] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9849/stat), No such file or directory
[WARN ] 2026-06-01 23:01:47.484 [19366] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10538/stat), No such file or directory
[INFO ] 2026-06-01 23:01:52.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:01:52.986 [19153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:02:01.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-01 23:02:01.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427591,ok=427591,error=0, records=41
[INFO ] 2026-06-01 23:02:07.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:02:07.768 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21393/300s
[WARN ] 2026-06-01 23:02:07.991 [19183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:02:16.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 23:02:16.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427592,ok=427592,error=0, records=41
[INFO ] 2026-06-01 23:02:22.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:02:22.996 [19099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:02:31.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 23:02:31.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427593,ok=427593,error=0, records=41
[WARN ] 2026-06-01 23:02:32.501 [19099] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13456/stat), No such file or directory
[WARN ] 2026-06-01 23:02:32.501 [19099] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9848/stat), No such file or directory
[INFO ] 2026-06-01 23:02:37.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:02:38.001 [19153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:02:46.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 23:02:46.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427594,ok=427594,error=0, records=41
[WARN ] 2026-06-01 23:02:47.506 [19408] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/13456/stat), No such file or directory
[WARN ] 2026-06-01 23:02:47.507 [19408] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/9848/stat), No such file or directory
[INFO ] 2026-06-01 23:02:50.721 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21391/300s
[INFO ] 2026-06-01 23:02:52.628 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21391/300s
[INFO ] 2026-06-01 23:02:52.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:02:53.007 [19431] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:02:59.827 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21391/300s
[INFO ] 2026-06-01 23:03:01.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 23:03:01.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427595,ok=427595,error=0, records=41
[INFO ] 2026-06-01 23:03:07.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:03:08.012 [19408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:03:16.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-01 23:03:16.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427596,ok=427596,error=0, records=41
[INFO ] 2026-06-01 23:03:22.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:03:23.017 [19460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:03:31.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 23:03:31.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427597,ok=427597,error=0, records=41
[INFO ] 2026-06-01 23:03:37.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 23:03:37.772 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 23:03:38.021 [19475] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:03:46.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-01 23:03:46.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427598,ok=427598,error=0, records=41
[INFO ] 2026-06-01 23:03:52.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:03:53.027 [19491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:04:01.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-01 23:04:01.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427599,ok=427599,error=0, records=41
[INFO ] 2026-06-01 23:04:07.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:04:08.033 [19183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:04:16.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-01 23:04:16.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427600,ok=427600,error=0, records=41
[INFO ] 2026-06-01 23:04:22.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:04:23.038 [19521] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:04:31.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 23:04:31.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427601,ok=427601,error=0, records=41
[INFO ] 2026-06-01 23:04:37.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:04:38.043 [19526] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:04:38.468 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17812/300s
[INFO ] 2026-06-01 23:04:38.469 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:04:38.626 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:04:38.626 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:04:38.626 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:04:38.626 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:04:38.626 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:04:38.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:04:38.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:04:38.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:04:38.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:04:38.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:04:38.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:04:46.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-01 23:04:46.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427602,ok=427602,error=0, records=41
[INFO ] 2026-06-01 23:04:52.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:04:53.048 [19548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:05:01.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 23:05:01.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427603,ok=427603,error=0, records=41
[INFO ] 2026-06-01 23:05:01.495 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21395/300s
[WARN ] 2026-06-01 23:05:07.555 [19548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:05:07.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:05:16.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-01 23:05:16.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427604,ok=427604,error=0, records=41
[INFO ] 2026-06-01 23:05:17.058 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21386/300s
[WARN ] 2026-06-01 23:05:22.561 [19578] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:05:22.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:05:31.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 23:05:31.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427605,ok=427605,error=0, records=41
[WARN ] 2026-06-01 23:05:37.565 [19612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:05:37.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:05:42.307 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21395/300s
[INFO ] 2026-06-01 23:05:46.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 23:05:46.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427606,ok=427606,error=0, records=41
[INFO ] 2026-06-01 23:05:46.375 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21382/300s
[WARN ] 2026-06-01 23:05:52.570 [19625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:05:52.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:06:01.382 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 23:06:01.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427607,ok=427607,error=0, records=41
[WARN ] 2026-06-01 23:06:07.575 [19644] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:06:07.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:06:16.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 23:06:16.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427608,ok=427608,error=0, records=41
[WARN ] 2026-06-01 23:06:22.580 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:06:22.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:06:31.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 23:06:31.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427609,ok=427609,error=0, records=41
[WARN ] 2026-06-01 23:06:37.588 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:06:37.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:06:38.691 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21382/300s
[INFO ] 2026-06-01 23:06:44.246 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21391/300s
[INFO ] 2026-06-01 23:06:46.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 23:06:46.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427610,ok=427610,error=0, records=41
[WARN ] 2026-06-01 23:06:52.593 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:06:52.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:07:01.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 23:07:01.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427611,ok=427611,error=0, records=41
[WARN ] 2026-06-01 23:07:07.597 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:07:07.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:07:07.781 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21394/300s
[INFO ] 2026-06-01 23:07:16.416 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 23:07:16.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427612,ok=427612,error=0, records=41
[WARN ] 2026-06-01 23:07:22.603 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:07:22.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:07:31.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 23:07:31.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427613,ok=427613,error=0, records=41
[WARN ] 2026-06-01 23:07:37.608 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:07:37.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:07:38.627 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862860},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:07:38.779 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:07:38.779 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:07:38.779 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:07:38.779 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:07:38.779 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:07:38.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:07:38.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:07:38.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:07:38.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:07:38.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:07:38.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:07:46.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 23:07:46.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427614,ok=427614,error=0, records=41
[INFO ] 2026-06-01 23:07:50.775 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21392/300s
[WARN ] 2026-06-01 23:07:52.613 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:07:52.676 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21392/300s
[INFO ] 2026-06-01 23:07:52.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:07:59.880 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21392/300s
[INFO ] 2026-06-01 23:08:01.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 23:08:01.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427615,ok=427615,error=0, records=41
[WARN ] 2026-06-01 23:08:07.618 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:08:07.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:08:16.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-01 23:08:16.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427616,ok=427616,error=0, records=41
[WARN ] 2026-06-01 23:08:22.624 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:08:22.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:08:31.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-01 23:08:31.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427617,ok=427617,error=0, records=41
[WARN ] 2026-06-01 23:08:37.629 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:08:37.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:08:46.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 23:08:46.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427618,ok=427618,error=0, records=41
[WARN ] 2026-06-01 23:08:52.635 [19689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:08:52.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:08:52.785 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 23:09:01.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 23:09:01.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427619,ok=427619,error=0, records=41
[WARN ] 2026-06-01 23:09:07.640 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:09:07.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:09:16.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 23:09:16.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427620,ok=427620,error=0, records=41
[WARN ] 2026-06-01 23:09:22.645 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:09:22.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:09:31.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-01 23:09:31.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427621,ok=427621,error=0, records=41
[WARN ] 2026-06-01 23:09:37.649 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:09:37.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:09:46.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-01 23:09:46.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427622,ok=427622,error=0, records=41
[WARN ] 2026-06-01 23:09:52.655 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:09:52.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:10:01.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10127, records=41
[INFO ] 2026-06-01 23:10:01.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427623,ok=427623,error=0, records=41
[INFO ] 2026-06-01 23:10:01.499 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21396/300s
[WARN ] 2026-06-01 23:10:07.660 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:10:07.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:10:16.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-01 23:10:16.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427624,ok=427624,error=0, records=41
[INFO ] 2026-06-01 23:10:17.162 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21387/300s
[WARN ] 2026-06-01 23:10:22.664 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:10:22.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:10:31.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-01 23:10:31.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427625,ok=427625,error=0, records=41
[WARN ] 2026-06-01 23:10:37.669 [19689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:10:37.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:10:38.779 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17813/300s
[INFO ] 2026-06-01 23:10:38.781 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:10:38.946 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:10:38.946 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 23:10:38.946 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:10:38.946 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:10:38.946 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:10:38.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:10:38.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:10:38.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:10:38.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:10:38.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:10:38.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:10:42.313 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21396/300s
[INFO ] 2026-06-01 23:10:46.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-01 23:10:46.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427626,ok=427626,error=0, records=41
[INFO ] 2026-06-01 23:10:46.504 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21383/300s
[WARN ] 2026-06-01 23:10:52.673 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:10:52.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:11:01.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-01 23:11:01.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427627,ok=427627,error=0, records=41
[WARN ] 2026-06-01 23:11:07.677 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:11:07.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:11:16.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 23:11:16.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427628,ok=427628,error=0, records=41
[WARN ] 2026-06-01 23:11:22.682 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:11:22.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:11:31.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-01 23:11:31.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427629,ok=427629,error=0, records=41
[WARN ] 2026-06-01 23:11:37.688 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:11:37.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:11:38.870 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21383/300s
[INFO ] 2026-06-01 23:11:44.297 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21392/300s
[INFO ] 2026-06-01 23:11:46.544 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 23:11:46.544 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427630,ok=427630,error=0, records=41
[WARN ] 2026-06-01 23:11:52.694 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:11:52.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:12:01.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 23:12:01.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427631,ok=427631,error=0, records=41
[WARN ] 2026-06-01 23:12:07.699 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:12:07.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:12:07.794 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21395/300s
[INFO ] 2026-06-01 23:12:16.562 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 23:12:16.562 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427632,ok=427632,error=0, records=41
[WARN ] 2026-06-01 23:12:22.704 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:12:22.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:12:31.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 23:12:31.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427633,ok=427633,error=0, records=41
[WARN ] 2026-06-01 23:12:37.708 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:12:37.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:12:46.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 23:12:46.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427634,ok=427634,error=0, records=41
[INFO ] 2026-06-01 23:12:50.821 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21393/300s
[WARN ] 2026-06-01 23:12:52.713 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:12:52.723 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21393/300s
[INFO ] 2026-06-01 23:12:52.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:12:59.929 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21393/300s
[INFO ] 2026-06-01 23:13:01.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 23:13:01.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427635,ok=427635,error=0, records=41
[WARN ] 2026-06-01 23:13:07.718 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:13:07.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:13:16.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 23:13:16.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427636,ok=427636,error=0, records=41
[WARN ] 2026-06-01 23:13:22.724 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:13:22.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:13:31.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 23:13:31.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427637,ok=427637,error=0, records=41
[WARN ] 2026-06-01 23:13:37.730 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:13:37.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 23:13:37.798 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 23:13:38.948 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:13:39.106 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:13:39.106 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:13:39.106 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:13:39.106 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:13:39.106 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:13:39.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:13:39.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:13:39.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:13:39.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:13:39.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:13:39.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:13:46.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 23:13:46.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427638,ok=427638,error=0, records=41
[WARN ] 2026-06-01 23:13:52.735 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:13:52.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:14:01.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-01 23:14:01.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427639,ok=427639,error=0, records=41
[WARN ] 2026-06-01 23:14:07.742 [19714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:14:07.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:14:16.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 23:14:16.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427640,ok=427640,error=0, records=41
[WARN ] 2026-06-01 23:14:22.747 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:14:22.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:14:31.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 23:14:31.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427641,ok=427641,error=0, records=41
[WARN ] 2026-06-01 23:14:37.752 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:14:37.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:14:46.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 23:14:46.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427642,ok=427642,error=0, records=41
[WARN ] 2026-06-01 23:14:52.757 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:14:52.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:15:01.502 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21397/300s
[INFO ] 2026-06-01 23:15:01.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 23:15:01.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427643,ok=427643,error=0, records=41
[WARN ] 2026-06-01 23:15:07.761 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:15:07.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:15:16.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 23:15:16.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427644,ok=427644,error=0, records=41
[INFO ] 2026-06-01 23:15:17.264 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21388/300s
[WARN ] 2026-06-01 23:15:22.766 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:15:22.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:15:31.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 23:15:31.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427645,ok=427645,error=0, records=41
[WARN ] 2026-06-01 23:15:37.771 [19678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:15:37.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:15:42.320 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21397/300s
[INFO ] 2026-06-01 23:15:46.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 23:15:46.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427646,ok=427646,error=0, records=41
[INFO ] 2026-06-01 23:15:46.669 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21384/300s
[WARN ] 2026-06-01 23:15:52.776 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:15:52.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:16:01.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 23:16:01.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427647,ok=427647,error=0, records=41
[WARN ] 2026-06-01 23:16:07.780 [19689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:16:07.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:16:16.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 23:16:16.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427648,ok=427648,error=0, records=41
[WARN ] 2026-06-01 23:16:22.785 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:16:22.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:16:31.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-01 23:16:31.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427649,ok=427649,error=0, records=41
[WARN ] 2026-06-01 23:16:37.790 [19689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:16:37.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:16:39.054 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21384/300s
[INFO ] 2026-06-01 23:16:39.106 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17814/300s
[INFO ] 2026-06-01 23:16:39.108 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862620},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:16:39.277 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:16:39.277 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:16:39.277 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:16:39.277 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:16:39.277 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:16:39.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:16:39.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:16:39.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:16:39.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:16:39.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:16:39.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:16:44.354 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21393/300s
[INFO ] 2026-06-01 23:16:46.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 23:16:46.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427650,ok=427650,error=0, records=41
[WARN ] 2026-06-01 23:16:52.795 [19689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:16:52.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:17:01.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-01 23:17:01.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427651,ok=427651,error=0, records=41
[WARN ] 2026-06-01 23:17:07.800 [19683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:17:07.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:17:07.807 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21396/300s
[INFO ] 2026-06-01 23:17:16.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-01 23:17:16.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427652,ok=427652,error=0, records=41
[WARN ] 2026-06-01 23:17:22.805 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:17:22.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:17:31.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-01 23:17:31.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427653,ok=427653,error=0, records=41
[INFO ] 2026-06-01 23:17:37.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:17:37.811 [20276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:17:46.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 23:17:46.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427654,ok=427654,error=0, records=41
[INFO ] 2026-06-01 23:17:50.888 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21394/300s
[INFO ] 2026-06-01 23:17:52.790 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21394/300s
[INFO ] 2026-06-01 23:17:52.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:17:52.816 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:17:59.996 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21394/300s
[INFO ] 2026-06-01 23:18:01.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 23:18:01.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427655,ok=427655,error=0, records=41
[INFO ] 2026-06-01 23:18:07.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:18:07.821 [20260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:18:16.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-01 23:18:16.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427656,ok=427656,error=0, records=41
[INFO ] 2026-06-01 23:18:22.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:18:22.826 [20323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:18:31.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 23:18:31.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427657,ok=427657,error=0, records=41
[INFO ] 2026-06-01 23:18:37.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:18:37.831 [20265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:18:46.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 23:18:46.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427658,ok=427658,error=0, records=41
[INFO ] 2026-06-01 23:18:52.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:18:52.836 [20351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:19:01.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-01 23:19:01.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427659,ok=427659,error=0, records=41
[INFO ] 2026-06-01 23:19:07.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:19:07.842 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:19:16.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-01 23:19:16.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427660,ok=427660,error=0, records=41
[INFO ] 2026-06-01 23:19:22.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:19:22.847 [20323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:19:31.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 23:19:31.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427661,ok=427661,error=0, records=41
[INFO ] 2026-06-01 23:19:37.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:19:37.853 [20351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:19:39.279 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862548},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:19:39.445 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:19:39.445 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 23:19:39.445 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:19:39.445 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:19:39.445 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:19:39.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:19:39.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:19:39.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:19:39.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:19:39.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:19:39.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:19:46.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 23:19:46.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427662,ok=427662,error=0, records=41
[INFO ] 2026-06-01 23:19:52.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:19:52.858 [19667] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:20:01.506 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21398/300s
[INFO ] 2026-06-01 23:20:01.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-01 23:20:01.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427663,ok=427663,error=0, records=41
[INFO ] 2026-06-01 23:20:07.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:20:07.864 [20323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:20:16.787 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-01 23:20:16.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427664,ok=427664,error=0, records=41
[INFO ] 2026-06-01 23:20:17.366 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21389/300s
[INFO ] 2026-06-01 23:20:22.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:20:22.869 [20265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:20:31.793 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-01 23:20:31.793 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427665,ok=427665,error=0, records=41
[INFO ] 2026-06-01 23:20:37.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:20:37.875 [20375] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:20:42.327 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21398/300s
[INFO ] 2026-06-01 23:20:46.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-01 23:20:46.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427666,ok=427666,error=0, records=41
[INFO ] 2026-06-01 23:20:46.799 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21385/300s
[INFO ] 2026-06-01 23:20:52.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:20:52.880 [20464] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:21:01.805 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-01 23:21:01.805 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427667,ok=427667,error=0, records=41
[INFO ] 2026-06-01 23:21:07.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:21:07.885 [20474] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:21:16.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-01 23:21:16.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427668,ok=427668,error=0, records=41
[INFO ] 2026-06-01 23:21:22.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:21:22.891 [20502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:21:31.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-01 23:21:31.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427669,ok=427669,error=0, records=41
[INFO ] 2026-06-01 23:21:37.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:21:37.895 [20519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:21:39.238 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21385/300s
[INFO ] 2026-06-01 23:21:44.407 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21394/300s
[INFO ] 2026-06-01 23:21:46.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-01 23:21:46.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427670,ok=427670,error=0, records=41
[INFO ] 2026-06-01 23:21:52.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:21:52.903 [20513] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:22:01.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 23:22:01.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427671,ok=427671,error=0, records=41
[INFO ] 2026-06-01 23:22:07.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:22:07.819 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21397/300s
[WARN ] 2026-06-01 23:22:07.907 [20545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:22:16.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 23:22:16.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427672,ok=427672,error=0, records=41
[INFO ] 2026-06-01 23:22:22.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:22:22.912 [20567] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:22:31.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 23:22:31.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427673,ok=427673,error=0, records=41
[INFO ] 2026-06-01 23:22:37.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:22:37.916 [20561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:22:39.445 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17815/300s
[INFO ] 2026-06-01 23:22:39.447 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862472},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:22:39.605 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:22:39.605 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 23:22:39.605 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:22:39.605 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:22:39.605 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:22:39.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:22:39.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:22:39.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:22:39.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:22:39.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:22:39.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:22:46.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 23:22:46.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427674,ok=427674,error=0, records=41
[INFO ] 2026-06-01 23:22:50.957 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21395/300s
[INFO ] 2026-06-01 23:22:52.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:22:52.859 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21395/300s
[WARN ] 2026-06-01 23:22:52.922 [20601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:23:00.065 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21395/300s
[INFO ] 2026-06-01 23:23:01.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 23:23:01.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427675,ok=427675,error=0, records=41
[INFO ] 2026-06-01 23:23:07.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:23:07.927 [20617] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:23:16.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 23:23:16.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427676,ok=427676,error=0, records=41
[INFO ] 2026-06-01 23:23:22.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:23:22.931 [20628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:23:31.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 23:23:31.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427677,ok=427677,error=0, records=41
[INFO ] 2026-06-01 23:23:37.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 23:23:37.823 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 23:23:37.936 [20646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:23:46.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 23:23:46.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427678,ok=427678,error=0, records=41
[INFO ] 2026-06-01 23:23:52.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:23:52.824 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-01 23:23:52.942 [20662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:24:01.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-01 23:24:01.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427679,ok=427679,error=0, records=41
[INFO ] 2026-06-01 23:24:07.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:24:07.947 [20682] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:24:16.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 23:24:16.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427680,ok=427680,error=0, records=41
[INFO ] 2026-06-01 23:24:22.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:24:22.952 [20662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:24:31.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-01 23:24:31.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427681,ok=427681,error=0, records=41
[INFO ] 2026-06-01 23:24:37.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:24:37.957 [20662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:24:47.021 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-01 23:24:47.021 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427682,ok=427682,error=0, records=41
[INFO ] 2026-06-01 23:24:52.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:24:52.961 [20662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:25:01.509 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21399/300s
[INFO ] 2026-06-01 23:25:02.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-01 23:25:02.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427683,ok=427683,error=0, records=41
[INFO ] 2026-06-01 23:25:07.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:25:07.967 [20706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:25:17.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-01 23:25:17.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427684,ok=427684,error=0, records=41
[INFO ] 2026-06-01 23:25:17.469 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21390/300s
[INFO ] 2026-06-01 23:25:22.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:25:22.971 [20734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:25:32.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 23:25:32.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427685,ok=427685,error=0, records=41
[INFO ] 2026-06-01 23:25:37.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:25:37.975 [20762] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:25:39.607 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862392},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:25:39.785 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:25:39.785 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 23:25:39.785 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:25:39.785 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:25:39.785 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:25:39.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:25:39.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:25:39.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:25:39.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:25:39.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:25:39.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:25:42.334 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21399/300s
[INFO ] 2026-06-01 23:25:47.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 23:25:47.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427686,ok=427686,error=0, records=41
[INFO ] 2026-06-01 23:25:47.046 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21386/300s
[INFO ] 2026-06-01 23:25:52.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:25:52.979 [20762] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:26:02.050 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 23:26:02.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427687,ok=427687,error=0, records=41
[INFO ] 2026-06-01 23:26:07.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:26:07.984 [20748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:26:17.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 23:26:17.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427688,ok=427688,error=0, records=41
[INFO ] 2026-06-01 23:26:22.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:26:22.990 [20748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:26:32.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 23:26:32.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427689,ok=427689,error=0, records=41
[INFO ] 2026-06-01 23:26:37.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:26:37.994 [20819] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:26:39.425 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21386/300s
[INFO ] 2026-06-01 23:26:44.468 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21395/300s
[INFO ] 2026-06-01 23:26:47.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-01 23:26:47.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427690,ok=427690,error=0, records=41
[INFO ] 2026-06-01 23:26:52.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:26:52.999 [20833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:27:02.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 23:27:02.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427691,ok=427691,error=0, records=41
[INFO ] 2026-06-01 23:27:07.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:27:07.833 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21398/300s
[WARN ] 2026-06-01 23:27:08.005 [20847] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:27:17.082 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-01 23:27:17.082 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427692,ok=427692,error=0, records=41
[INFO ] 2026-06-01 23:27:22.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:27:23.009 [20833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:27:32.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-01 23:27:32.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427693,ok=427693,error=0, records=41
[INFO ] 2026-06-01 23:27:37.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:27:38.014 [20861] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:27:47.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 23:27:47.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427694,ok=427694,error=0, records=41
[INFO ] 2026-06-01 23:27:51.027 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21396/300s
[INFO ] 2026-06-01 23:27:52.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:27:52.929 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21396/300s
[WARN ] 2026-06-01 23:27:53.019 [20889] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:28:00.136 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21396/300s
[INFO ] 2026-06-01 23:28:02.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-01 23:28:02.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427695,ok=427695,error=0, records=41
[INFO ] 2026-06-01 23:28:07.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:28:08.024 [20903] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:28:17.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 23:28:17.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427696,ok=427696,error=0, records=41
[INFO ] 2026-06-01 23:28:22.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:28:23.029 [20805] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:28:32.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 23:28:32.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427697,ok=427697,error=0, records=41
[INFO ] 2026-06-01 23:28:37.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:28:38.033 [20805] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:28:39.785 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17816/300s
[INFO ] 2026-06-01 23:28:39.786 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862320},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:28:39.933 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:28:39.933 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:28:39.934 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:28:39.934 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:28:39.934 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:28:39.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:28:39.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:28:39.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:28:39.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:28:39.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:28:39.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:28:47.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 23:28:47.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427698,ok=427698,error=0, records=41
[INFO ] 2026-06-01 23:28:52.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:28:53.039 [20917] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:29:02.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 23:29:02.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427699,ok=427699,error=0, records=41
[INFO ] 2026-06-01 23:29:07.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:29:08.044 [20966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:29:17.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 23:29:17.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427700,ok=427700,error=0, records=41
[INFO ] 2026-06-01 23:29:22.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:29:23.050 [20974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:29:32.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 23:29:32.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427701,ok=427701,error=0, records=41
[WARN ] 2026-06-01 23:29:37.555 [21002] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:29:37.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:29:47.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 23:29:47.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427702,ok=427702,error=0, records=41
[WARN ] 2026-06-01 23:29:52.561 [21008] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:29:52.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:30:01.513 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21400/300s
[INFO ] 2026-06-01 23:30:02.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 23:30:02.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427703,ok=427703,error=0, records=41
[WARN ] 2026-06-01 23:30:07.565 [21037] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:30:07.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:30:17.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 23:30:17.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427704,ok=427704,error=0, records=41
[INFO ] 2026-06-01 23:30:17.569 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21391/300s
[WARN ] 2026-06-01 23:30:22.571 [21059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:30:22.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:30:32.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 23:30:32.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427705,ok=427705,error=0, records=41
[WARN ] 2026-06-01 23:30:37.575 [21077] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:30:37.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:30:42.340 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21400/300s
[INFO ] 2026-06-01 23:30:47.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 23:30:47.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427706,ok=427706,error=0, records=41
[INFO ] 2026-06-01 23:30:47.238 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21387/300s
[WARN ] 2026-06-01 23:30:52.580 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:30:52.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:31:02.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-01 23:31:02.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427707,ok=427707,error=0, records=41
[WARN ] 2026-06-01 23:31:07.585 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:31:07.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:31:17.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-01 23:31:17.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427708,ok=427708,error=0, records=41
[WARN ] 2026-06-01 23:31:22.592 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:31:22.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:31:32.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-01 23:31:32.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427709,ok=427709,error=0, records=41
[WARN ] 2026-06-01 23:31:37.597 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:31:37.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:31:39.604 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21387/300s
[INFO ] 2026-06-01 23:31:39.935 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862240},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:31:40.100 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:31:40.100 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-01 23:31:40.100 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:31:40.100 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:31:40.100 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:31:40.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:31:40.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:31:40.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:31:40.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:31:40.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:31:40.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:31:44.524 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21396/300s
[INFO ] 2026-06-01 23:31:47.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 23:31:47.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427710,ok=427710,error=0, records=41
[WARN ] 2026-06-01 23:31:52.602 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:31:52.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:32:02.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-01 23:32:02.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427711,ok=427711,error=0, records=41
[WARN ] 2026-06-01 23:32:07.608 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:32:07.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:32:07.846 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21399/300s
[INFO ] 2026-06-01 23:32:17.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 23:32:17.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427712,ok=427712,error=0, records=41
[WARN ] 2026-06-01 23:32:22.614 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:32:22.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:32:32.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 23:32:32.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427713,ok=427713,error=0, records=41
[WARN ] 2026-06-01 23:32:37.620 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:32:37.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:32:47.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 23:32:47.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427714,ok=427714,error=0, records=41
[INFO ] 2026-06-01 23:32:51.091 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21397/300s
[WARN ] 2026-06-01 23:32:52.626 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:32:52.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:32:52.993 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21397/300s
[INFO ] 2026-06-01 23:33:00.200 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21397/300s
[INFO ] 2026-06-01 23:33:02.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-01 23:33:02.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427715,ok=427715,error=0, records=41
[WARN ] 2026-06-01 23:33:07.631 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:33:07.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:33:17.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 23:33:17.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427716,ok=427716,error=0, records=41
[WARN ] 2026-06-01 23:33:22.637 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:33:22.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:33:32.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-01 23:33:32.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427717,ok=427717,error=0, records=41
[WARN ] 2026-06-01 23:33:37.642 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:33:37.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 23:33:37.850 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 23:33:47.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 23:33:47.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427718,ok=427718,error=0, records=41
[WARN ] 2026-06-01 23:33:52.647 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:33:52.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:34:02.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 23:34:02.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427719,ok=427719,error=0, records=41
[WARN ] 2026-06-01 23:34:07.652 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:34:07.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:34:17.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-01 23:34:17.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427720,ok=427720,error=0, records=41
[WARN ] 2026-06-01 23:34:22.658 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:34:22.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:34:32.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-01 23:34:32.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427721,ok=427721,error=0, records=41
[WARN ] 2026-06-01 23:34:37.663 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:34:37.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:34:40.100 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17817/300s
[INFO ] 2026-06-01 23:34:40.102 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862156},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:34:40.257 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:34:40.257 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 23:34:40.257 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:34:40.257 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:34:40.257 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:34:40.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:34:40.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:34:40.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:34:40.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:34:40.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:34:40.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:34:47.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-01 23:34:47.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427722,ok=427722,error=0, records=41
[WARN ] 2026-06-01 23:34:52.668 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:34:52.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:35:01.517 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21401/300s
[INFO ] 2026-06-01 23:35:02.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 23:35:02.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427723,ok=427723,error=0, records=41
[WARN ] 2026-06-01 23:35:07.673 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:35:07.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:35:17.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 23:35:17.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427724,ok=427724,error=0, records=41
[INFO ] 2026-06-01 23:35:17.675 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21392/300s
[WARN ] 2026-06-01 23:35:22.677 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:35:22.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:35:32.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 23:35:32.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427725,ok=427725,error=0, records=41
[WARN ] 2026-06-01 23:35:37.683 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:35:37.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:35:42.347 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21401/300s
[INFO ] 2026-06-01 23:35:47.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 23:35:47.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427726,ok=427726,error=0, records=41
[INFO ] 2026-06-01 23:35:47.440 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21388/300s
[WARN ] 2026-06-01 23:35:47.687 [21107] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19272/stat), No such file or directory
[WARN ] 2026-06-01 23:35:52.688 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:35:52.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:36:02.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-01 23:36:02.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427727,ok=427727,error=0, records=41
[WARN ] 2026-06-01 23:36:07.694 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:36:07.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:36:17.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-01 23:36:17.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427728,ok=427728,error=0, records=41
[WARN ] 2026-06-01 23:36:22.698 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:36:22.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:36:32.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-01 23:36:32.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427729,ok=427729,error=0, records=41
[WARN ] 2026-06-01 23:36:37.703 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:36:37.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:36:39.784 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21388/300s
[INFO ] 2026-06-01 23:36:44.579 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21397/300s
[INFO ] 2026-06-01 23:36:47.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-01 23:36:47.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427730,ok=427730,error=0, records=41
[WARN ] 2026-06-01 23:36:52.708 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:36:52.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:37:02.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 23:37:02.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427731,ok=427731,error=0, records=41
[WARN ] 2026-06-01 23:37:07.713 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:37:07.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:37:07.859 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21400/300s
[INFO ] 2026-06-01 23:37:17.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 23:37:17.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427732,ok=427732,error=0, records=41
[WARN ] 2026-06-01 23:37:22.718 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:37:22.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:37:32.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 23:37:32.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427733,ok=427733,error=0, records=41
[WARN ] 2026-06-01 23:37:37.723 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:37:37.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:37:40.258 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20862068},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:37:40.413 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:37:40.413 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:37:40.413 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:37:40.414 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:37:40.414 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:37:40.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:37:40.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:37:40.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:37:40.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:37:40.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:37:40.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:37:47.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 23:37:47.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427734,ok=427734,error=0, records=41
[INFO ] 2026-06-01 23:37:51.143 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21398/300s
[WARN ] 2026-06-01 23:37:52.728 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:37:52.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:37:53.045 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21398/300s
[INFO ] 2026-06-01 23:38:00.251 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21398/300s
[INFO ] 2026-06-01 23:38:02.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 23:38:02.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427735,ok=427735,error=0, records=41
[WARN ] 2026-06-01 23:38:07.734 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:38:07.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:38:17.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 23:38:17.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427736,ok=427736,error=0, records=41
[WARN ] 2026-06-01 23:38:22.740 [21096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:38:22.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:38:32.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 23:38:32.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427737,ok=427737,error=0, records=41
[WARN ] 2026-06-01 23:38:37.747 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:38:37.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:38:47.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 23:38:47.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427738,ok=427738,error=0, records=41
[WARN ] 2026-06-01 23:38:52.752 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:38:52.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:38:52.863 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 23:39:02.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 23:39:02.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427739,ok=427739,error=0, records=41
[WARN ] 2026-06-01 23:39:07.756 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:39:07.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:39:17.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 23:39:17.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427740,ok=427740,error=0, records=41
[WARN ] 2026-06-01 23:39:22.761 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:39:22.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:39:32.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 23:39:32.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427741,ok=427741,error=0, records=41
[WARN ] 2026-06-01 23:39:37.766 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:39:37.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:39:47.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-01 23:39:47.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427742,ok=427742,error=0, records=41
[WARN ] 2026-06-01 23:39:52.771 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:39:52.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:40:01.520 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21402/300s
[INFO ] 2026-06-01 23:40:02.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-01 23:40:02.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427743,ok=427743,error=0, records=41
[WARN ] 2026-06-01 23:40:07.776 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:40:07.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:40:17.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 23:40:17.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427744,ok=427744,error=0, records=41
[INFO ] 2026-06-01 23:40:17.779 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21393/300s
[WARN ] 2026-06-01 23:40:22.781 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:40:22.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:40:32.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-01 23:40:32.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427745,ok=427745,error=0, records=41
[WARN ] 2026-06-01 23:40:37.787 [21107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:40:37.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:40:40.414 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17818/300s
[INFO ] 2026-06-01 23:40:40.415 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861988},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:40:40.587 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:40:40.587 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-01 23:40:42.354 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21402/300s
[INFO ] 2026-06-01 23:40:47.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 23:40:47.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427746,ok=427746,error=0, records=41
[INFO ] 2026-06-01 23:40:47.581 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21389/300s
[WARN ] 2026-06-01 23:40:52.793 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:40:52.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:41:02.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-01 23:41:02.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427747,ok=427747,error=0, records=41
[WARN ] 2026-06-01 23:41:07.798 [21090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:41:07.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:41:17.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-01 23:41:17.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427748,ok=427748,error=0, records=41
[WARN ] 2026-06-01 23:41:22.803 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:41:22.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:41:32.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-01 23:41:32.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427749,ok=427749,error=0, records=41
[WARN ] 2026-06-01 23:41:37.807 [21663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:41:37.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:41:39.969 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21389/300s
[INFO ] 2026-06-01 23:41:44.638 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21398/300s
[INFO ] 2026-06-01 23:41:47.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-01 23:41:47.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427750,ok=427750,error=0, records=41
[WARN ] 2026-06-01 23:41:52.813 [21692] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:41:52.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:42:02.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 23:42:02.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427751,ok=427751,error=0, records=41
[WARN ] 2026-06-01 23:42:07.817 [21128] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:42:07.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:42:07.872 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21401/300s
[INFO ] 2026-06-01 23:42:17.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13302, records=49
[INFO ] 2026-06-01 23:42:17.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427752,ok=427752,error=0, records=49
[WARN ] 2026-06-01 23:42:22.822 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:42:22.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:42:32.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 23:42:32.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427753,ok=427753,error=0, records=41
[WARN ] 2026-06-01 23:42:37.827 [21138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:42:37.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:42:47.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 23:42:47.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427754,ok=427754,error=0, records=41
[INFO ] 2026-06-01 23:42:51.209 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21399/300s
[WARN ] 2026-06-01 23:42:52.832 [21749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:42:52.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:42:53.111 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21399/300s
[INFO ] 2026-06-01 23:43:00.317 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21399/300s
[INFO ] 2026-06-01 23:43:02.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-01 23:43:02.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427755,ok=427755,error=0, records=41
[WARN ] 2026-06-01 23:43:07.837 [21763] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:43:07.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:43:17.687 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-01 23:43:17.687 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427756,ok=427756,error=0, records=41
[WARN ] 2026-06-01 23:43:22.841 [21763] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:43:22.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:43:32.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-01 23:43:32.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427757,ok=427757,error=0, records=41
[WARN ] 2026-06-01 23:43:37.846 [21749] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:43:37.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 23:43:37.875 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-01 23:43:40.589 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861912},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:43:40.762 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:43:40.762 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 23:43:40.762 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:43:40.762 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:43:40.762 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:43:40.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:43:40.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:43:40.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:43:40.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:43:40.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:43:40.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:43:47.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-01 23:43:47.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427758,ok=427758,error=0, records=41
[WARN ] 2026-06-01 23:43:52.851 [21763] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:43:52.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:44:02.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 23:44:02.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427759,ok=427759,error=0, records=41
[WARN ] 2026-06-01 23:44:07.855 [21801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:44:07.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:44:17.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 23:44:17.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427760,ok=427760,error=0, records=41
[WARN ] 2026-06-01 23:44:22.861 [21801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:44:22.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:44:32.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-01 23:44:32.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427761,ok=427761,error=0, records=41
[WARN ] 2026-06-01 23:44:37.867 [21801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:44:37.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:44:47.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-01 23:44:47.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427762,ok=427762,error=0, records=41
[WARN ] 2026-06-01 23:44:52.874 [21861] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:44:52.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:45:01.523 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21403/300s
[INFO ] 2026-06-01 23:45:02.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-01 23:45:02.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427763,ok=427763,error=0, records=41
[INFO ] 2026-06-01 23:45:07.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:45:07.879 [21877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:45:17.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-01 23:45:17.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427764,ok=427764,error=0, records=41
[INFO ] 2026-06-01 23:45:17.882 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21394/300s
[INFO ] 2026-06-01 23:45:22.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:45:22.884 [21871] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:45:32.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 23:45:32.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427765,ok=427765,error=0, records=41
[INFO ] 2026-06-01 23:45:37.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:45:37.891 [21861] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:45:42.360 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21403/300s
[INFO ] 2026-06-01 23:45:47.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-01 23:45:47.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427766,ok=427766,error=0, records=41
[INFO ] 2026-06-01 23:45:47.790 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21390/300s
[INFO ] 2026-06-01 23:45:52.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:45:52.895 [21815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:46:02.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 23:46:02.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427767,ok=427767,error=0, records=41
[INFO ] 2026-06-01 23:46:07.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:46:07.900 [21928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:46:17.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-01 23:46:17.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427768,ok=427768,error=0, records=41
[INFO ] 2026-06-01 23:46:22.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:46:22.905 [21950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:46:32.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 23:46:32.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427769,ok=427769,error=0, records=41
[INFO ] 2026-06-01 23:46:37.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:46:37.910 [21972] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:46:40.160 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21390/300s
[INFO ] 2026-06-01 23:46:40.762 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17819/300s
[INFO ] 2026-06-01 23:46:40.764 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861836},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:46:40.941 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:46:40.941 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 23:46:40.942 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:46:40.942 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:46:40.942 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:46:40.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:46:40.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:46:40.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:46:40.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:46:40.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:46:40.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:46:44.692 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21399/300s
[INFO ] 2026-06-01 23:46:47.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-01 23:46:47.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427770,ok=427770,error=0, records=41
[INFO ] 2026-06-01 23:46:52.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:46:52.915 [21966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:47:02.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-01 23:47:02.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427771,ok=427771,error=0, records=41
[INFO ] 2026-06-01 23:47:07.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:47:07.884 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21402/300s
[WARN ] 2026-06-01 23:47:07.920 [22007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:47:17.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-01 23:47:17.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427772,ok=427772,error=0, records=41
[INFO ] 2026-06-01 23:47:22.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:47:22.926 [22024] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:47:32.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-01 23:47:32.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427773,ok=427773,error=0, records=41
[INFO ] 2026-06-01 23:47:37.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:47:37.932 [22034] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:47:47.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 23:47:47.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427774,ok=427774,error=0, records=41
[INFO ] 2026-06-01 23:47:51.274 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21400/300s
[INFO ] 2026-06-01 23:47:52.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:47:52.937 [21966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:47:53.176 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21400/300s
[INFO ] 2026-06-01 23:48:00.379 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21400/300s
[INFO ] 2026-06-01 23:48:02.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11277, records=44
[INFO ] 2026-06-01 23:48:02.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427775,ok=427775,error=0, records=44
[INFO ] 2026-06-01 23:48:07.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:48:07.944 [22072] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:48:17.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 23:48:17.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427776,ok=427776,error=0, records=41
[INFO ] 2026-06-01 23:48:22.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:48:22.949 [22083] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:48:32.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-01 23:48:32.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427777,ok=427777,error=0, records=41
[INFO ] 2026-06-01 23:48:37.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:48:37.954 [22066] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:48:48.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-01 23:48:48.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427778,ok=427778,error=0, records=41
[INFO ] 2026-06-01 23:48:52.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:48:52.959 [22082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:49:03.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-01 23:49:03.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427779,ok=427779,error=0, records=41
[INFO ] 2026-06-01 23:49:07.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:49:07.963 [22082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:49:18.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 23:49:18.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427780,ok=427780,error=0, records=41
[INFO ] 2026-06-01 23:49:22.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:49:22.968 [22097] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:49:33.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 23:49:33.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427781,ok=427781,error=0, records=41
[INFO ] 2026-06-01 23:49:37.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:49:37.973 [22082] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:49:40.943 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861756},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:49:41.118 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:49:41.119 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-01 23:49:41.119 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:49:41.119 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:49:41.119 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:49:41.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:49:41.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:49:41.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:49:41.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:49:41.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:49:41.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:49:48.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-01 23:49:48.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427782,ok=427782,error=0, records=41
[INFO ] 2026-06-01 23:49:52.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:49:52.977 [22125] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:50:01.527 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21404/300s
[INFO ] 2026-06-01 23:50:03.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-01 23:50:03.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427783,ok=427783,error=0, records=41
[INFO ] 2026-06-01 23:50:07.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:50:07.982 [22139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:50:17.985 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21395/300s
[INFO ] 2026-06-01 23:50:18.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-01 23:50:18.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427784,ok=427784,error=0, records=41
[INFO ] 2026-06-01 23:50:22.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:50:22.987 [22139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:50:33.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-01 23:50:33.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427785,ok=427785,error=0, records=41
[INFO ] 2026-06-01 23:50:37.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:50:37.991 [22169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:50:42.366 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21404/300s
[INFO ] 2026-06-01 23:50:48.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-01 23:50:48.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427786,ok=427786,error=0, records=41
[INFO ] 2026-06-01 23:50:48.113 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21391/300s
[INFO ] 2026-06-01 23:50:52.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:50:52.996 [22230] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:51:03.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-01 23:51:03.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427787,ok=427787,error=0, records=41
[INFO ] 2026-06-01 23:51:07.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:51:08.001 [22244] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:51:18.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-01 23:51:18.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427788,ok=427788,error=0, records=41
[INFO ] 2026-06-01 23:51:22.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:51:23.006 [22230] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:51:33.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 23:51:33.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427789,ok=427789,error=0, records=41
[INFO ] 2026-06-01 23:51:37.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:51:38.011 [22244] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:51:40.342 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21391/300s
[INFO ] 2026-06-01 23:51:44.749 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21400/300s
[INFO ] 2026-06-01 23:51:48.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-01 23:51:48.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427790,ok=427790,error=0, records=41
[INFO ] 2026-06-01 23:51:52.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:51:53.017 [22258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:52:03.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-01 23:52:03.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427791,ok=427791,error=0, records=41
[INFO ] 2026-06-01 23:52:07.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:52:07.896 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21403/300s
[WARN ] 2026-06-01 23:52:08.023 [22202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:52:18.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 23:52:18.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427792,ok=427792,error=0, records=41
[INFO ] 2026-06-01 23:52:22.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:52:23.028 [22313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:52:33.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 23:52:33.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427793,ok=427793,error=0, records=41
[INFO ] 2026-06-01 23:52:37.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:52:38.033 [22125] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:52:41.119 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17820/300s
[INFO ] 2026-06-01 23:52:41.120 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861688},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:52:41.304 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:52:41.304 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-01 23:52:41.304 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:52:41.304 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:52:41.304 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:52:41.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:52:41.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:52:41.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:52:41.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:52:41.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:52:41.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:52:48.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-01 23:52:48.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427794,ok=427794,error=0, records=41
[INFO ] 2026-06-01 23:52:51.336 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21401/300s
[INFO ] 2026-06-01 23:52:52.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:52:53.038 [22125] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:52:53.239 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21401/300s
[INFO ] 2026-06-01 23:53:00.445 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21401/300s
[INFO ] 2026-06-01 23:53:03.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-01 23:53:03.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427795,ok=427795,error=0, records=41
[INFO ] 2026-06-01 23:53:07.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:53:08.044 [22371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:53:18.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-01 23:53:18.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427796,ok=427796,error=0, records=41
[INFO ] 2026-06-01 23:53:22.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-01 23:53:23.049 [22384] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:53:33.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-01 23:53:33.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427797,ok=427797,error=0, records=41
[INFO ] 2026-06-01 23:53:37.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-01 23:53:37.900 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-01 23:53:38.054 [22391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:53:48.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 23:53:48.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427798,ok=427798,error=0, records=41
[WARN ] 2026-06-01 23:53:52.558 [22425] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:53:52.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:53:52.900 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-01 23:54:03.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-01 23:54:03.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427799,ok=427799,error=0, records=41
[WARN ] 2026-06-01 23:54:07.564 [22408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:54:07.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:54:18.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-01 23:54:18.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427800,ok=427800,error=0, records=41
[WARN ] 2026-06-01 23:54:22.570 [22453] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:54:22.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:54:33.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-01 23:54:33.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427801,ok=427801,error=0, records=41
[WARN ] 2026-06-01 23:54:37.575 [22408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:54:37.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:54:48.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-01 23:54:48.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427802,ok=427802,error=0, records=41
[WARN ] 2026-06-01 23:54:52.579 [22490] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:54:52.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:55:01.530 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21405/300s
[INFO ] 2026-06-01 23:55:03.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-01 23:55:03.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427803,ok=427803,error=0, records=41
[WARN ] 2026-06-01 23:55:07.585 [22490] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:55:07.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:55:18.088 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21396/300s
[INFO ] 2026-06-01 23:55:18.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-01 23:55:18.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427804,ok=427804,error=0, records=41
[WARN ] 2026-06-01 23:55:22.591 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:55:22.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:55:33.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-01 23:55:33.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427805,ok=427805,error=0, records=41
[WARN ] 2026-06-01 23:55:37.596 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:55:37.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:55:41.306 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:55:41.470 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:55:41.470 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-01 23:55:41.470 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:55:41.470 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:55:41.470 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:55:41.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:55:41.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:55:41.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:55:41.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:55:41.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:55:41.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:55:42.373 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21405/300s
[INFO ] 2026-06-01 23:55:48.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-01 23:55:48.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427806,ok=427806,error=0, records=41
[INFO ] 2026-06-01 23:55:48.253 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21392/300s
[WARN ] 2026-06-01 23:55:52.600 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:55:52.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:56:03.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-01 23:56:03.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427807,ok=427807,error=0, records=41
[WARN ] 2026-06-01 23:56:07.606 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:56:07.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:56:18.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-01 23:56:18.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427808,ok=427808,error=0, records=41
[WARN ] 2026-06-01 23:56:22.610 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:56:22.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:56:33.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-01 23:56:33.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427809,ok=427809,error=0, records=41
[WARN ] 2026-06-01 23:56:37.616 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:56:37.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:56:40.528 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21392/300s
[INFO ] 2026-06-01 23:56:44.805 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21401/300s
[INFO ] 2026-06-01 23:56:48.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-01 23:56:48.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427810,ok=427810,error=0, records=41
[WARN ] 2026-06-01 23:56:52.622 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:56:52.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:57:03.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-01 23:57:03.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427811,ok=427811,error=0, records=41
[WARN ] 2026-06-01 23:57:07.627 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:57:07.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:57:07.910 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21404/300s
[INFO ] 2026-06-01 23:57:18.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-01 23:57:18.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427812,ok=427812,error=0, records=41
[WARN ] 2026-06-01 23:57:22.632 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:57:22.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:57:33.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-01 23:57:33.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427813,ok=427813,error=0, records=41
[WARN ] 2026-06-01 23:57:37.637 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:57:37.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:57:48.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-01 23:57:48.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427814,ok=427814,error=0, records=41
[INFO ] 2026-06-01 23:57:51.403 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21402/300s
[WARN ] 2026-06-01 23:57:52.643 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:57:52.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:57:53.305 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21402/300s
[INFO ] 2026-06-01 23:58:00.512 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21402/300s
[INFO ] 2026-06-01 23:58:03.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-01 23:58:03.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427815,ok=427815,error=0, records=41
[WARN ] 2026-06-01 23:58:07.649 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:58:07.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:58:18.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-01 23:58:18.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427816,ok=427816,error=0, records=41
[WARN ] 2026-06-01 23:58:22.654 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:58:22.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:58:33.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-01 23:58:33.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427817,ok=427817,error=0, records=41
[WARN ] 2026-06-01 23:58:37.659 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:58:37.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:58:41.470 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17821/300s
[INFO ] 2026-06-01 23:58:41.472 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861908},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-01 23:58:41.637 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-01 23:58:41.638 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-01 23:58:41.638 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-01 23:58:41.638 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-01 23:58:41.638 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-01 23:58:41.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:58:41.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-01 23:58:41.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:58:41.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-01 23:58:41.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:58:41.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-01 23:58:48.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-01 23:58:48.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427818,ok=427818,error=0, records=41
[WARN ] 2026-06-01 23:58:52.663 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:58:52.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:59:03.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-01 23:59:03.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427819,ok=427819,error=0, records=41
[WARN ] 2026-06-01 23:59:07.668 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:59:07.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:59:18.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-01 23:59:18.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427820,ok=427820,error=0, records=41
[WARN ] 2026-06-01 23:59:22.673 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:59:22.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:59:33.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-01 23:59:33.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427821,ok=427821,error=0, records=41
[WARN ] 2026-06-01 23:59:37.678 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:59:37.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-01 23:59:48.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-01 23:59:48.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427822,ok=427822,error=0, records=41
[WARN ] 2026-06-01 23:59:52.684 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-01 23:59:52.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:00:01.538 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21406/300s
[INFO ] 2026-06-02 00:00:03.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 00:00:03.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427823,ok=427823,error=0, records=41
[WARN ] 2026-06-02 00:00:07.710 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:00:07.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:00:18.213 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21397/300s
[INFO ] 2026-06-02 00:00:18.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-02 00:00:18.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427824,ok=427824,error=0, records=41
[WARN ] 2026-06-02 00:00:22.716 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:00:22.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:00:33.562 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 00:00:33.562 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427825,ok=427825,error=0, records=41
[WARN ] 2026-06-02 00:00:37.721 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:00:37.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:00:42.389 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21406/300s
[INFO ] 2026-06-02 00:00:48.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-02 00:00:48.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427826,ok=427826,error=0, records=41
[INFO ] 2026-06-02 00:00:48.570 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21393/300s
[WARN ] 2026-06-02 00:00:52.726 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:00:52.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:01:03.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 00:01:03.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427827,ok=427827,error=0, records=41
[WARN ] 2026-06-02 00:01:07.731 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:01:07.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:01:18.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 00:01:18.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427828,ok=427828,error=0, records=41
[WARN ] 2026-06-02 00:01:22.735 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:01:22.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:01:33.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 00:01:33.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427829,ok=427829,error=0, records=41
[WARN ] 2026-06-02 00:01:37.739 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:01:37.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:01:40.736 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21393/300s
[INFO ] 2026-06-02 00:01:41.639 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861828},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:01:41.796 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:01:41.796 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 00:01:41.796 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:01:41.796 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:01:41.796 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:01:41.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:01:41.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:01:41.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:01:41.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:01:41.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:01:41.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:01:44.888 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21402/300s
[INFO ] 2026-06-02 00:01:48.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 00:01:48.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427830,ok=427830,error=0, records=41
[WARN ] 2026-06-02 00:01:52.743 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:01:52.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:02:03.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-02 00:02:03.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427831,ok=427831,error=0, records=41
[WARN ] 2026-06-02 00:02:07.749 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:02:07.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:02:07.922 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21405/300s
[INFO ] 2026-06-02 00:02:18.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 00:02:18.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427832,ok=427832,error=0, records=41
[WARN ] 2026-06-02 00:02:22.755 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:02:22.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:02:33.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 00:02:33.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427833,ok=427833,error=0, records=41
[WARN ] 2026-06-02 00:02:37.761 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:02:37.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:02:48.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 00:02:48.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427834,ok=427834,error=0, records=41
[INFO ] 2026-06-02 00:02:51.489 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21403/300s
[WARN ] 2026-06-02 00:02:52.767 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:02:52.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:02:53.391 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21403/300s
[INFO ] 2026-06-02 00:03:00.556 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21403/300s
[INFO ] 2026-06-02 00:03:03.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 00:03:03.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427835,ok=427835,error=0, records=41
[WARN ] 2026-06-02 00:03:07.772 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:03:07.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:03:18.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-02 00:03:18.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427836,ok=427836,error=0, records=41
[WARN ] 2026-06-02 00:03:22.776 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:03:22.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:03:33.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 00:03:33.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427837,ok=427837,error=0, records=41
[WARN ] 2026-06-02 00:03:37.781 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:03:37.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 00:03:37.926 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 00:03:48.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 00:03:48.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427838,ok=427838,error=0, records=41
[WARN ] 2026-06-02 00:03:52.785 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:03:52.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:04:03.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 00:04:03.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427839,ok=427839,error=0, records=41
[WARN ] 2026-06-02 00:04:07.791 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:04:07.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:04:18.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 00:04:18.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427840,ok=427840,error=0, records=41
[WARN ] 2026-06-02 00:04:22.796 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:04:22.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:04:33.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 00:04:33.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427841,ok=427841,error=0, records=41
[WARN ] 2026-06-02 00:04:37.800 [22525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:04:37.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:04:41.796 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17822/300s
[INFO ] 2026-06-02 00:04:41.798 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861752},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:04:41.951 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:04:41.951 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 00:04:41.951 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:04:41.951 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:04:41.951 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:04:41.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:04:41.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:04:41.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:04:41.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:04:41.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:04:41.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:04:48.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 00:04:48.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427842,ok=427842,error=0, records=41
[WARN ] 2026-06-02 00:04:52.806 [22494] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:04:52.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:05:01.541 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21407/300s
[INFO ] 2026-06-02 00:05:03.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 00:05:03.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427843,ok=427843,error=0, records=41
[WARN ] 2026-06-02 00:05:07.811 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:05:07.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:05:18.315 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21398/300s
[INFO ] 2026-06-02 00:05:18.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 00:05:18.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427844,ok=427844,error=0, records=41
[WARN ] 2026-06-02 00:05:22.817 [22531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:05:22.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:05:33.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 00:05:33.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427845,ok=427845,error=0, records=41
[WARN ] 2026-06-02 00:05:37.822 [23109] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:05:37.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:05:42.395 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21407/300s
[INFO ] 2026-06-02 00:05:48.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 00:05:48.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427846,ok=427846,error=0, records=41
[INFO ] 2026-06-02 00:05:48.696 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21394/300s
[WARN ] 2026-06-02 00:05:52.827 [23123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:05:52.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:06:03.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 00:06:03.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427847,ok=427847,error=0, records=41
[WARN ] 2026-06-02 00:06:07.832 [23123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:06:07.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:06:18.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 00:06:18.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427848,ok=427848,error=0, records=41
[WARN ] 2026-06-02 00:06:22.837 [23095] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:06:22.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:06:33.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 00:06:33.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427849,ok=427849,error=0, records=41
[WARN ] 2026-06-02 00:06:37.842 [23080] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:06:37.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:06:40.917 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21394/300s
[INFO ] 2026-06-02 00:06:44.944 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21403/300s
[INFO ] 2026-06-02 00:06:48.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 00:06:48.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427850,ok=427850,error=0, records=41
[WARN ] 2026-06-02 00:06:52.847 [23137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:06:52.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:07:03.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 00:07:03.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427851,ok=427851,error=0, records=41
[WARN ] 2026-06-02 00:07:07.854 [23151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:07:07.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:07:07.935 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21406/300s
[INFO ] 2026-06-02 00:07:18.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 00:07:18.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427852,ok=427852,error=0, records=41
[WARN ] 2026-06-02 00:07:22.859 [23080] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:07:22.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:07:33.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 00:07:33.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427853,ok=427853,error=0, records=41
[WARN ] 2026-06-02 00:07:37.863 [23201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:07:37.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:07:41.953 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861672},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:07:42.116 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:07:42.116 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 00:07:42.116 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:07:42.116 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:07:42.116 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:07:42.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:07:42.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:07:42.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:07:42.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:07:42.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:07:42.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:07:48.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 00:07:48.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427854,ok=427854,error=0, records=41
[INFO ] 2026-06-02 00:07:51.566 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21404/300s
[WARN ] 2026-06-02 00:07:52.867 [23230] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:07:52.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:07:53.467 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21404/300s
[INFO ] 2026-06-02 00:08:00.623 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21404/300s
[INFO ] 2026-06-02 00:08:03.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 00:08:03.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427855,ok=427855,error=0, records=41
[WARN ] 2026-06-02 00:08:07.871 [23123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:08:07.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:08:18.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 00:08:18.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427856,ok=427856,error=0, records=41
[WARN ] 2026-06-02 00:08:22.877 [23254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:08:22.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:08:33.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 00:08:33.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427857,ok=427857,error=0, records=41
[WARN ] 2026-06-02 00:08:37.884 [23282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:08:37.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:08:48.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 00:08:48.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427858,ok=427858,error=0, records=41
[WARN ] 2026-06-02 00:08:52.889 [23293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:08:52.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:08:52.939 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 00:09:03.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 00:09:03.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427859,ok=427859,error=0, records=41
[WARN ] 2026-06-02 00:09:07.893 [23307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:09:07.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:09:18.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 00:09:18.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427860,ok=427860,error=0, records=41
[WARN ] 2026-06-02 00:09:22.898 [23292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:09:22.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:09:33.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 00:09:33.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427861,ok=427861,error=0, records=41
[WARN ] 2026-06-02 00:09:37.904 [23335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:09:37.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:09:48.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 00:09:48.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427862,ok=427862,error=0, records=41
[WARN ] 2026-06-02 00:09:52.910 [23340] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:09:52.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:10:01.544 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21408/300s
[INFO ] 2026-06-02 00:10:03.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10384, records=41
[INFO ] 2026-06-02 00:10:03.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427863,ok=427863,error=0, records=41
[WARN ] 2026-06-02 00:10:07.916 [23323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:10:07.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:10:18.419 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21399/300s
[INFO ] 2026-06-02 00:10:18.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 00:10:18.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427864,ok=427864,error=0, records=41
[WARN ] 2026-06-02 00:10:22.921 [23396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:10:22.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:10:33.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 00:10:33.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427865,ok=427865,error=0, records=41
[WARN ] 2026-06-02 00:10:37.926 [23396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:10:37.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:10:42.116 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17823/300s
[INFO ] 2026-06-02 00:10:42.118 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861592},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:10:42.278 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:10:42.278 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 00:10:42.278 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:10:42.278 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:10:42.278 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:10:42.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:10:42.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:10:42.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:10:42.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:10:42.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:10:42.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:10:42.401 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21408/300s
[INFO ] 2026-06-02 00:10:48.811 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-02 00:10:48.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427866,ok=427866,error=0, records=41
[INFO ] 2026-06-02 00:10:48.811 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21395/300s
[WARN ] 2026-06-02 00:10:52.932 [23390] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:10:52.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:11:03.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-02 00:11:03.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427867,ok=427867,error=0, records=41
[WARN ] 2026-06-02 00:11:07.937 [23446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:11:07.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:11:18.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 00:11:18.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427868,ok=427868,error=0, records=41
[INFO ] 2026-06-02 00:11:22.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:11:22.945 [23441] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:11:33.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 00:11:33.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427869,ok=427869,error=0, records=41
[INFO ] 2026-06-02 00:11:37.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:11:37.951 [23478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:11:41.089 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21395/300s
[INFO ] 2026-06-02 00:11:44.991 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21404/300s
[INFO ] 2026-06-02 00:11:48.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 00:11:48.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427870,ok=427870,error=0, records=41
[INFO ] 2026-06-02 00:11:52.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:11:52.956 [23492] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:12:03.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 00:12:03.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427871,ok=427871,error=0, records=41
[INFO ] 2026-06-02 00:12:07.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:12:07.947 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21407/300s
[WARN ] 2026-06-02 00:12:07.960 [23468] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:12:18.922 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 00:12:18.922 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427872,ok=427872,error=0, records=41
[INFO ] 2026-06-02 00:12:22.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:12:22.965 [23478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:12:33.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 00:12:33.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427873,ok=427873,error=0, records=41
[INFO ] 2026-06-02 00:12:37.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:12:37.970 [23478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:12:48.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 00:12:48.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427874,ok=427874,error=0, records=41
[INFO ] 2026-06-02 00:12:51.595 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21405/300s
[INFO ] 2026-06-02 00:12:52.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:12:52.975 [23519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:12:53.497 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21405/300s
[INFO ] 2026-06-02 00:13:00.633 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21405/300s
[INFO ] 2026-06-02 00:13:03.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10409, records=41
[INFO ] 2026-06-02 00:13:03.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427875,ok=427875,error=0, records=41
[INFO ] 2026-06-02 00:13:07.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:13:07.979 [23519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:13:18.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 00:13:18.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427876,ok=427876,error=0, records=41
[INFO ] 2026-06-02 00:13:22.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:13:22.984 [23561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:13:33.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 00:13:33.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427877,ok=427877,error=0, records=41
[INFO ] 2026-06-02 00:13:37.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 00:13:37.950 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 00:13:37.990 [23588] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:13:42.279 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861516},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:13:42.427 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:13:42.428 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 00:13:42.428 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:13:42.428 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:13:42.428 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:13:42.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:13:42.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:13:42.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:13:42.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:13:42.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:13:42.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:13:48.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-02 00:13:48.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427878,ok=427878,error=0, records=41
[INFO ] 2026-06-02 00:13:52.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:13:52.995 [23588] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:14:03.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 00:14:03.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427879,ok=427879,error=0, records=41
[INFO ] 2026-06-02 00:14:07.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:14:08.000 [23547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:14:18.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 00:14:18.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427880,ok=427880,error=0, records=41
[INFO ] 2026-06-02 00:14:22.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:14:23.006 [23631] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:14:33.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 00:14:33.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427881,ok=427881,error=0, records=41
[INFO ] 2026-06-02 00:14:37.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:14:38.011 [23478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:14:48.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 00:14:48.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427882,ok=427882,error=0, records=41
[INFO ] 2026-06-02 00:14:52.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:14:53.016 [23659] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:15:01.547 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21409/300s
[INFO ] 2026-06-02 00:15:03.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-02 00:15:03.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427883,ok=427883,error=0, records=41
[INFO ] 2026-06-02 00:15:07.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:15:08.021 [23478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:15:18.525 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21400/300s
[INFO ] 2026-06-02 00:15:18.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 00:15:18.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427884,ok=427884,error=0, records=41
[INFO ] 2026-06-02 00:15:22.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:15:23.026 [23687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:15:33.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 00:15:33.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427885,ok=427885,error=0, records=41
[INFO ] 2026-06-02 00:15:37.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:15:38.031 [23687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:15:42.406 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21409/300s
[INFO ] 2026-06-02 00:15:49.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-02 00:15:49.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427886,ok=427886,error=0, records=41
[INFO ] 2026-06-02 00:15:49.000 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21396/300s
[INFO ] 2026-06-02 00:15:52.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:15:53.036 [23673] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:16:04.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 00:16:04.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427887,ok=427887,error=0, records=41
[INFO ] 2026-06-02 00:16:07.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:16:08.041 [23715] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:16:19.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 00:16:19.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427888,ok=427888,error=0, records=41
[INFO ] 2026-06-02 00:16:22.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:16:23.046 [23748] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:16:34.030 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-02 00:16:34.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427889,ok=427889,error=0, records=41
[INFO ] 2026-06-02 00:16:37.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:16:38.050 [23743] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:16:41.263 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21396/300s
[INFO ] 2026-06-02 00:16:42.428 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17824/300s
[INFO ] 2026-06-02 00:16:42.429 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861428},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:16:42.596 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:16:42.596 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 00:16:42.597 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:16:42.597 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:16:42.597 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:16:42.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:16:42.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:16:42.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:16:42.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:16:42.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:16:42.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:16:45.044 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21405/300s
[INFO ] 2026-06-02 00:16:49.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 00:16:49.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427890,ok=427890,error=0, records=41
[WARN ] 2026-06-02 00:16:52.555 [23766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:16:52.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:17:04.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 00:17:04.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427891,ok=427891,error=0, records=41
[WARN ] 2026-06-02 00:17:07.559 [23807] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:17:07.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:17:07.959 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21408/300s
[INFO ] 2026-06-02 00:17:19.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 00:17:19.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427892,ok=427892,error=0, records=41
[WARN ] 2026-06-02 00:17:22.564 [23818] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:17:22.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:17:34.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 00:17:34.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427893,ok=427893,error=0, records=41
[WARN ] 2026-06-02 00:17:37.571 [23807] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:17:37.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:17:49.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 00:17:49.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427894,ok=427894,error=0, records=41
[INFO ] 2026-06-02 00:17:51.638 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21406/300s
[WARN ] 2026-06-02 00:17:52.576 [23795] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:17:52.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:17:53.540 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21406/300s
[INFO ] 2026-06-02 00:18:00.660 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21406/300s
[INFO ] 2026-06-02 00:18:04.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 00:18:04.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427895,ok=427895,error=0, records=41
[WARN ] 2026-06-02 00:18:07.582 [23877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:18:07.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:18:19.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 00:18:19.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427896,ok=427896,error=0, records=41
[WARN ] 2026-06-02 00:18:22.587 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:18:22.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:18:34.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 00:18:34.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427897,ok=427897,error=0, records=41
[WARN ] 2026-06-02 00:18:37.591 [23887] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:18:37.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:18:49.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 00:18:49.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427898,ok=427898,error=0, records=41
[WARN ] 2026-06-02 00:18:52.596 [23910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:18:52.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:19:04.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 00:19:04.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427899,ok=427899,error=0, records=41
[WARN ] 2026-06-02 00:19:07.601 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:19:07.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:19:19.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 00:19:19.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427900,ok=427900,error=0, records=41
[WARN ] 2026-06-02 00:19:22.606 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:19:22.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:19:34.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 00:19:34.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427901,ok=427901,error=0, records=41
[WARN ] 2026-06-02 00:19:37.612 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:19:37.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:19:42.598 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861356},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:19:42.769 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:19:42.769 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 00:19:42.769 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:19:42.769 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:19:42.769 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:19:42.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:19:42.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:19:42.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:19:42.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:19:42.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:19:42.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:19:49.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 00:19:49.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427902,ok=427902,error=0, records=41
[WARN ] 2026-06-02 00:19:52.617 [23877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:19:52.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:20:01.550 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21410/300s
[INFO ] 2026-06-02 00:20:04.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 00:20:04.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427903,ok=427903,error=0, records=41
[WARN ] 2026-06-02 00:20:07.622 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:20:07.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:20:18.627 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21401/300s
[INFO ] 2026-06-02 00:20:19.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 00:20:19.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427904,ok=427904,error=0, records=41
[WARN ] 2026-06-02 00:20:22.629 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:20:22.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:20:34.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 00:20:34.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427905,ok=427905,error=0, records=41
[WARN ] 2026-06-02 00:20:37.636 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:20:37.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:20:42.413 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21410/300s
[INFO ] 2026-06-02 00:20:49.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 00:20:49.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427906,ok=427906,error=0, records=41
[INFO ] 2026-06-02 00:20:49.216 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21397/300s
[WARN ] 2026-06-02 00:20:52.641 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:20:52.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:21:04.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 00:21:04.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427907,ok=427907,error=0, records=41
[WARN ] 2026-06-02 00:21:07.646 [23910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:21:07.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:21:19.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 00:21:19.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427908,ok=427908,error=0, records=41
[WARN ] 2026-06-02 00:21:22.651 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:21:22.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:21:34.236 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 00:21:34.236 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427909,ok=427909,error=0, records=41
[WARN ] 2026-06-02 00:21:37.656 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:21:37.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:21:41.450 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21397/300s
[INFO ] 2026-06-02 00:21:45.099 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21406/300s
[INFO ] 2026-06-02 00:21:49.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 00:21:49.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427910,ok=427910,error=0, records=41
[WARN ] 2026-06-02 00:21:52.661 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:21:52.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:22:04.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 00:22:04.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427911,ok=427911,error=0, records=41
[WARN ] 2026-06-02 00:22:07.666 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:22:07.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:22:07.971 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21409/300s
[INFO ] 2026-06-02 00:22:19.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 00:22:19.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427912,ok=427912,error=0, records=41
[WARN ] 2026-06-02 00:22:22.671 [23877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:22:22.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:22:34.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 00:22:34.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427913,ok=427913,error=0, records=41
[WARN ] 2026-06-02 00:22:37.675 [23877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:22:37.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:22:42.769 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17825/300s
[INFO ] 2026-06-02 00:22:42.771 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861276},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:22:42.936 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:22:42.936 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 00:22:42.936 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:22:42.936 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:22:42.936 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:22:42.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:22:42.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:22:42.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:22:42.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:22:42.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:22:42.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:22:49.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 00:22:49.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427914,ok=427914,error=0, records=41
[INFO ] 2026-06-02 00:22:51.715 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21407/300s
[WARN ] 2026-06-02 00:22:52.681 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:22:52.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:22:53.617 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21407/300s
[INFO ] 2026-06-02 00:23:00.724 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21407/300s
[INFO ] 2026-06-02 00:23:04.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 00:23:04.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427915,ok=427915,error=0, records=41
[WARN ] 2026-06-02 00:23:07.686 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:23:07.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:23:19.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 00:23:19.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427916,ok=427916,error=0, records=41
[WARN ] 2026-06-02 00:23:22.691 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:23:22.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:23:34.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 00:23:34.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427917,ok=427917,error=0, records=41
[WARN ] 2026-06-02 00:23:37.696 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:23:37.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 00:23:37.975 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 00:23:49.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 00:23:49.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427918,ok=427918,error=0, records=41
[WARN ] 2026-06-02 00:23:52.703 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:23:52.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:23:52.978 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 00:24:04.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 00:24:04.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427919,ok=427919,error=0, records=41
[WARN ] 2026-06-02 00:24:07.710 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:24:07.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:24:19.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 00:24:19.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427920,ok=427920,error=0, records=41
[WARN ] 2026-06-02 00:24:22.714 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:24:22.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:24:34.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 00:24:34.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427921,ok=427921,error=0, records=41
[WARN ] 2026-06-02 00:24:37.719 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:24:37.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:24:49.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 00:24:49.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427922,ok=427922,error=0, records=41
[WARN ] 2026-06-02 00:24:52.724 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:24:52.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:25:01.554 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21411/300s
[INFO ] 2026-06-02 00:25:04.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 00:25:04.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427923,ok=427923,error=0, records=41
[WARN ] 2026-06-02 00:25:07.729 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:25:07.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:25:18.732 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21402/300s
[INFO ] 2026-06-02 00:25:19.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 00:25:19.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427924,ok=427924,error=0, records=41
[WARN ] 2026-06-02 00:25:22.734 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:25:22.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:25:34.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 00:25:34.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427925,ok=427925,error=0, records=41
[WARN ] 2026-06-02 00:25:37.740 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:25:37.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:25:42.420 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21411/300s
[INFO ] 2026-06-02 00:25:42.938 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861196},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:25:43.123 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:25:43.123 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 00:25:43.123 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:25:43.123 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:25:43.123 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:25:43.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:25:43.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:25:43.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:25:43.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:25:43.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:25:43.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:25:49.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-02 00:25:49.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427926,ok=427926,error=0, records=41
[INFO ] 2026-06-02 00:25:49.334 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21398/300s
[WARN ] 2026-06-02 00:25:52.745 [23877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:25:52.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:26:04.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 00:26:04.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427927,ok=427927,error=0, records=41
[WARN ] 2026-06-02 00:26:07.750 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:26:07.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:26:19.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 00:26:19.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427928,ok=427928,error=0, records=41
[WARN ] 2026-06-02 00:26:22.756 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:26:22.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:26:34.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-02 00:26:34.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427929,ok=427929,error=0, records=41
[WARN ] 2026-06-02 00:26:37.760 [23910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:26:37.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:26:41.637 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21398/300s
[INFO ] 2026-06-02 00:26:45.161 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21407/300s
[INFO ] 2026-06-02 00:26:49.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 00:26:49.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427930,ok=427930,error=0, records=41
[WARN ] 2026-06-02 00:26:52.767 [23877] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:26:52.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:27:04.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 00:27:04.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427931,ok=427931,error=0, records=41
[WARN ] 2026-06-02 00:27:07.772 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:27:07.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:27:07.987 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21410/300s
[INFO ] 2026-06-02 00:27:19.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 00:27:19.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427932,ok=427932,error=0, records=41
[WARN ] 2026-06-02 00:27:22.777 [23919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:27:22.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:27:34.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 00:27:34.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427933,ok=427933,error=0, records=41
[WARN ] 2026-06-02 00:27:37.782 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:27:37.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:27:49.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 00:27:49.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427934,ok=427934,error=0, records=41
[INFO ] 2026-06-02 00:27:51.793 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21408/300s
[WARN ] 2026-06-02 00:27:52.788 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:27:52.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:27:53.694 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21408/300s
[INFO ] 2026-06-02 00:28:00.800 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21408/300s
[INFO ] 2026-06-02 00:28:04.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 00:28:04.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427935,ok=427935,error=0, records=41
[WARN ] 2026-06-02 00:28:07.793 [23910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:28:07.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:28:19.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 00:28:19.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427936,ok=427936,error=0, records=41
[WARN ] 2026-06-02 00:28:22.799 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:28:22.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:28:34.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 00:28:34.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427937,ok=427937,error=0, records=41
[WARN ] 2026-06-02 00:28:37.804 [24443] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:28:37.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:28:43.123 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17826/300s
[INFO ] 2026-06-02 00:28:43.125 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861120},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:28:43.293 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:28:43.293 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 00:28:43.294 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:28:43.294 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:28:43.294 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:28:43.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:28:43.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:28:43.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:28:43.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:28:43.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:28:43.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:28:49.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 00:28:49.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427938,ok=427938,error=0, records=41
[WARN ] 2026-06-02 00:28:52.810 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:28:52.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:29:04.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 00:29:04.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427939,ok=427939,error=0, records=41
[WARN ] 2026-06-02 00:29:07.816 [24453] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:29:07.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:29:19.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 00:29:19.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427940,ok=427940,error=0, records=41
[WARN ] 2026-06-02 00:29:22.821 [24453] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:29:22.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:29:34.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 00:29:34.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427941,ok=427941,error=0, records=41
[WARN ] 2026-06-02 00:29:37.826 [24487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:29:37.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:29:49.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 00:29:49.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427942,ok=427942,error=0, records=41
[WARN ] 2026-06-02 00:29:52.831 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:29:52.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:30:01.558 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21412/300s
[INFO ] 2026-06-02 00:30:04.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 00:30:04.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427943,ok=427943,error=0, records=41
[WARN ] 2026-06-02 00:30:07.837 [24534] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:30:07.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:30:18.841 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21403/300s
[INFO ] 2026-06-02 00:30:19.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 00:30:19.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427944,ok=427944,error=0, records=41
[WARN ] 2026-06-02 00:30:22.842 [23894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:30:22.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:30:34.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 00:30:34.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427945,ok=427945,error=0, records=41
[WARN ] 2026-06-02 00:30:37.847 [24557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:30:37.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:30:42.427 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21412/300s
[INFO ] 2026-06-02 00:30:49.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 00:30:49.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427946,ok=427946,error=0, records=41
[INFO ] 2026-06-02 00:30:49.459 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21399/300s
[WARN ] 2026-06-02 00:30:52.853 [24543] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:30:52.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:31:04.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 00:31:04.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427947,ok=427947,error=0, records=41
[WARN ] 2026-06-02 00:31:07.859 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:31:07.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:31:19.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 00:31:19.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427948,ok=427948,error=0, records=41
[WARN ] 2026-06-02 00:31:22.865 [24501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:31:22.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:31:34.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 00:31:34.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427949,ok=427949,error=0, records=41
[WARN ] 2026-06-02 00:31:37.870 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:31:37.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:31:41.816 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21399/300s
[INFO ] 2026-06-02 00:31:43.295 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20861044},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:31:43.442 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:31:43.442 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 00:31:43.442 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:31:43.442 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:31:43.442 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:31:43.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:31:43.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:31:43.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:31:43.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:31:43.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:31:43.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:31:45.219 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21408/300s
[INFO ] 2026-06-02 00:31:49.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 00:31:49.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427950,ok=427950,error=0, records=41
[WARN ] 2026-06-02 00:31:52.875 [24571] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:31:52.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:32:04.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 00:32:04.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427951,ok=427951,error=0, records=41
[WARN ] 2026-06-02 00:32:07.880 [24645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:32:08.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:32:08.000 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21411/300s
[INFO ] 2026-06-02 00:32:19.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 00:32:19.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427952,ok=427952,error=0, records=41
[WARN ] 2026-06-02 00:32:22.885 [24631] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:32:23.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:32:34.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 00:32:34.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427953,ok=427953,error=0, records=41
[WARN ] 2026-06-02 00:32:37.891 [24678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:32:38.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:32:49.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 00:32:49.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427954,ok=427954,error=0, records=41
[INFO ] 2026-06-02 00:32:51.867 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21409/300s
[WARN ] 2026-06-02 00:32:52.896 [24695] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:32:53.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:32:53.768 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21409/300s
[INFO ] 2026-06-02 00:33:00.873 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21409/300s
[INFO ] 2026-06-02 00:33:04.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 00:33:04.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427955,ok=427955,error=0, records=41
[WARN ] 2026-06-02 00:33:07.900 [24689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:33:08.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:33:19.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-02 00:33:19.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427956,ok=427956,error=0, records=41
[WARN ] 2026-06-02 00:33:22.906 [24706] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:33:23.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:33:34.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 00:33:34.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427957,ok=427957,error=0, records=41
[WARN ] 2026-06-02 00:33:37.912 [24733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:33:38.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 00:33:38.004 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 00:33:49.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 00:33:49.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427958,ok=427958,error=0, records=41
[WARN ] 2026-06-02 00:33:52.916 [24738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:33:53.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:34:04.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 00:34:04.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427959,ok=427959,error=0, records=41
[WARN ] 2026-06-02 00:34:07.921 [24738] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:34:08.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:34:19.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 00:34:19.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427960,ok=427960,error=0, records=41
[WARN ] 2026-06-02 00:34:22.926 [24750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:34:23.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:34:34.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 00:34:34.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427961,ok=427961,error=0, records=41
[WARN ] 2026-06-02 00:34:37.931 [24812] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:34:38.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:34:43.442 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17827/300s
[INFO ] 2026-06-02 00:34:43.444 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860956},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:34:43.613 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:34:43.613 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 00:34:43.613 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:34:43.613 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:34:43.613 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:34:43.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:34:43.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:34:43.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:34:43.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:34:43.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:34:43.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:34:49.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 00:34:49.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427962,ok=427962,error=0, records=41
[WARN ] 2026-06-02 00:34:52.936 [24812] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:34:53.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:35:01.561 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21413/300s
[INFO ] 2026-06-02 00:35:04.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 00:35:04.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427963,ok=427963,error=0, records=41
[WARN ] 2026-06-02 00:35:07.942 [24782] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:35:08.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:35:18.946 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21404/300s
[INFO ] 2026-06-02 00:35:19.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 00:35:19.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427964,ok=427964,error=0, records=41
[WARN ] 2026-06-02 00:35:22.948 [24850] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:35:23.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:35:34.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 00:35:34.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427965,ok=427965,error=0, records=41
[WARN ] 2026-06-02 00:35:37.953 [24845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:35:38.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:35:42.433 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21413/300s
[INFO ] 2026-06-02 00:35:49.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 00:35:49.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427966,ok=427966,error=0, records=41
[INFO ] 2026-06-02 00:35:49.634 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21400/300s
[WARN ] 2026-06-02 00:35:52.958 [24845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:35:53.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:36:04.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 00:36:04.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427967,ok=427967,error=0, records=41
[WARN ] 2026-06-02 00:36:07.964 [24850] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:36:08.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:36:19.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-02 00:36:19.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427968,ok=427968,error=0, records=41
[WARN ] 2026-06-02 00:36:22.969 [24845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:36:23.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:36:34.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 00:36:34.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427969,ok=427969,error=0, records=41
[WARN ] 2026-06-02 00:36:37.974 [24845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:36:38.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:36:41.994 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21400/300s
[INFO ] 2026-06-02 00:36:45.273 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21409/300s
[INFO ] 2026-06-02 00:36:49.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 00:36:49.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427970,ok=427970,error=0, records=41
[WARN ] 2026-06-02 00:36:52.979 [24850] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:36:53.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:37:04.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 00:37:04.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427971,ok=427971,error=0, records=41
[WARN ] 2026-06-02 00:37:07.984 [24855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:37:08.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:37:08.012 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21412/300s
[INFO ] 2026-06-02 00:37:19.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 00:37:19.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427972,ok=427972,error=0, records=41
[WARN ] 2026-06-02 00:37:22.989 [24850] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:37:23.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:37:34.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10723, records=44
[INFO ] 2026-06-02 00:37:34.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427973,ok=427973,error=0, records=44
[WARN ] 2026-06-02 00:37:37.994 [24923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:37:38.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:37:43.615 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860884},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:37:43.774 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:37:43.774 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-02 00:37:43.774 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:37:43.774 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:37:43.774 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:37:43.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:37:43.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:37:43.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:37:43.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:37:43.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:37:43.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:37:49.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 00:37:49.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427974,ok=427974,error=0, records=41
[INFO ] 2026-06-02 00:37:51.909 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21410/300s
[WARN ] 2026-06-02 00:37:52.999 [24845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:37:53.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:37:53.810 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21410/300s
[INFO ] 2026-06-02 00:38:00.917 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21410/300s
[INFO ] 2026-06-02 00:38:04.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 00:38:04.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427975,ok=427975,error=0, records=41
[WARN ] 2026-06-02 00:38:08.004 [24951] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:38:08.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:38:19.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 00:38:19.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427976,ok=427976,error=0, records=41
[WARN ] 2026-06-02 00:38:23.009 [25020] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:38:23.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:38:34.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 00:38:34.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427977,ok=427977,error=0, records=41
[WARN ] 2026-06-02 00:38:38.013 [25034] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:38:38.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:38:49.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 00:38:49.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427978,ok=427978,error=0, records=41
[INFO ] 2026-06-02 00:38:53.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:38:53.016 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 00:38:53.019 [24951] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:39:04.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 00:39:04.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427979,ok=427979,error=0, records=41
[INFO ] 2026-06-02 00:39:08.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:39:08.023 [25020] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:39:19.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 00:39:19.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427980,ok=427980,error=0, records=41
[INFO ] 2026-06-02 00:39:23.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:39:23.028 [25062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:39:34.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 00:39:34.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427981,ok=427981,error=0, records=41
[INFO ] 2026-06-02 00:39:38.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:39:38.033 [25090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:39:49.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 00:39:49.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427982,ok=427982,error=0, records=41
[INFO ] 2026-06-02 00:39:53.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:39:53.039 [24993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:40:01.564 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21414/300s
[INFO ] 2026-06-02 00:40:04.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 00:40:04.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427983,ok=427983,error=0, records=41
[INFO ] 2026-06-02 00:40:08.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:40:08.046 [25100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:40:19.050 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21405/300s
[INFO ] 2026-06-02 00:40:19.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 00:40:19.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427984,ok=427984,error=0, records=41
[INFO ] 2026-06-02 00:40:23.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 00:40:23.053 [25145] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:40:34.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 00:40:34.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427985,ok=427985,error=0, records=41
[WARN ] 2026-06-02 00:40:37.558 [25139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:40:38.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:40:42.440 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21414/300s
[INFO ] 2026-06-02 00:40:43.774 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17828/300s
[INFO ] 2026-06-02 00:40:43.776 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860800},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:40:43.937 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:40:43.937 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 00:40:43.937 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:40:43.937 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:40:43.937 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:40:43.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:40:43.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:40:43.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:40:43.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:40:43.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:40:43.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:40:49.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 00:40:49.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427986,ok=427986,error=0, records=41
[INFO ] 2026-06-02 00:40:49.809 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21401/300s
[WARN ] 2026-06-02 00:40:52.563 [25181] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:40:53.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:41:04.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 00:41:04.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427987,ok=427987,error=0, records=41
[WARN ] 2026-06-02 00:41:07.568 [25161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:41:08.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:41:19.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 00:41:19.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427988,ok=427988,error=0, records=41
[WARN ] 2026-06-02 00:41:22.573 [25208] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:41:23.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:41:34.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 00:41:34.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427989,ok=427989,error=0, records=41
[WARN ] 2026-06-02 00:41:37.578 [25238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:41:38.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:41:42.180 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21401/300s
[INFO ] 2026-06-02 00:41:45.331 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21410/300s
[INFO ] 2026-06-02 00:41:49.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 00:41:49.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427990,ok=427990,error=0, records=41
[WARN ] 2026-06-02 00:41:52.583 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:41:53.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:42:04.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 00:42:04.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427991,ok=427991,error=0, records=41
[WARN ] 2026-06-02 00:42:07.590 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:42:08.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:42:08.026 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21413/300s
[INFO ] 2026-06-02 00:42:19.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 00:42:19.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427992,ok=427992,error=0, records=41
[WARN ] 2026-06-02 00:42:22.595 [25250] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:42:23.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:42:34.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 00:42:34.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427993,ok=427993,error=0, records=41
[WARN ] 2026-06-02 00:42:37.601 [25180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:42:38.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:42:49.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 00:42:49.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427994,ok=427994,error=0, records=41
[INFO ] 2026-06-02 00:42:51.978 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21411/300s
[WARN ] 2026-06-02 00:42:52.605 [25180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:42:53.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:42:53.880 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21411/300s
[INFO ] 2026-06-02 00:43:00.987 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21411/300s
[INFO ] 2026-06-02 00:43:04.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 00:43:04.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427995,ok=427995,error=0, records=41
[WARN ] 2026-06-02 00:43:07.610 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:43:08.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:43:19.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 00:43:19.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427996,ok=427996,error=0, records=41
[WARN ] 2026-06-02 00:43:22.616 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:43:23.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:43:34.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 00:43:34.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427997,ok=427997,error=0, records=41
[WARN ] 2026-06-02 00:43:37.621 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:43:38.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 00:43:38.030 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 00:43:43.939 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860728},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:43:44.103 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:43:44.103 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 00:43:44.103 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:43:44.103 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:43:44.103 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:43:44.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:43:44.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:43:44.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:43:44.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:43:44.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:43:44.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:43:49.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 00:43:49.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427998,ok=427998,error=0, records=41
[WARN ] 2026-06-02 00:43:52.626 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:43:53.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:44:04.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 00:44:04.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=427999,ok=427999,error=0, records=41
[WARN ] 2026-06-02 00:44:07.631 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:44:08.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:44:19.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 00:44:19.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428000,ok=428000,error=0, records=41
[WARN ] 2026-06-02 00:44:22.636 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:44:23.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:44:34.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 00:44:34.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428001,ok=428001,error=0, records=41
[WARN ] 2026-06-02 00:44:37.641 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:44:38.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:44:49.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 00:44:49.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428002,ok=428002,error=0, records=41
[WARN ] 2026-06-02 00:44:52.646 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:44:53.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:45:01.567 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21415/300s
[INFO ] 2026-06-02 00:45:04.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 00:45:04.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428003,ok=428003,error=0, records=41
[WARN ] 2026-06-02 00:45:07.652 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:45:08.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:45:19.155 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21406/300s
[INFO ] 2026-06-02 00:45:19.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 00:45:19.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428004,ok=428004,error=0, records=41
[WARN ] 2026-06-02 00:45:22.657 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:45:23.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:45:34.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 00:45:34.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428005,ok=428005,error=0, records=41
[WARN ] 2026-06-02 00:45:37.662 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:45:38.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:45:42.447 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21415/300s
[INFO ] 2026-06-02 00:45:49.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 00:45:49.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428006,ok=428006,error=0, records=41
[INFO ] 2026-06-02 00:45:49.929 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21402/300s
[WARN ] 2026-06-02 00:45:52.665 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:45:53.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:46:04.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 00:46:04.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428007,ok=428007,error=0, records=41
[WARN ] 2026-06-02 00:46:07.669 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:46:08.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:46:19.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 00:46:19.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428008,ok=428008,error=0, records=41
[WARN ] 2026-06-02 00:46:22.676 [25180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:46:23.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:46:34.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 00:46:34.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428009,ok=428009,error=0, records=41
[WARN ] 2026-06-02 00:46:37.680 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:46:38.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:46:42.365 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21402/300s
[INFO ] 2026-06-02 00:46:44.103 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17829/300s
[INFO ] 2026-06-02 00:46:44.105 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860652},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:46:44.280 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:46:44.280 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 00:46:44.281 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:46:44.281 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:46:44.281 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:46:44.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:46:44.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:46:44.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:46:44.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:46:44.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:46:44.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:46:45.387 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21411/300s
[INFO ] 2026-06-02 00:46:49.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 00:46:49.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428010,ok=428010,error=0, records=41
[WARN ] 2026-06-02 00:46:52.686 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:46:53.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:47:04.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 00:47:04.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428011,ok=428011,error=0, records=41
[WARN ] 2026-06-02 00:47:07.692 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:47:08.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:47:08.039 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21414/300s
[INFO ] 2026-06-02 00:47:19.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 00:47:19.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428012,ok=428012,error=0, records=41
[WARN ] 2026-06-02 00:47:22.698 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:47:23.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:47:34.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 00:47:34.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428013,ok=428013,error=0, records=41
[WARN ] 2026-06-02 00:47:37.704 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:47:38.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:47:49.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 00:47:49.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428014,ok=428014,error=0, records=41
[INFO ] 2026-06-02 00:47:52.048 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21412/300s
[WARN ] 2026-06-02 00:47:52.709 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:47:53.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:47:53.950 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21412/300s
[INFO ] 2026-06-02 00:48:01.056 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21412/300s
[INFO ] 2026-06-02 00:48:04.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 00:48:04.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428015,ok=428015,error=0, records=41
[WARN ] 2026-06-02 00:48:07.714 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:48:08.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:48:19.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 00:48:19.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428016,ok=428016,error=0, records=41
[WARN ] 2026-06-02 00:48:22.719 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:48:23.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:48:34.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 00:48:34.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428017,ok=428017,error=0, records=41
[WARN ] 2026-06-02 00:48:37.726 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:48:38.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:48:50.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 00:48:50.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428018,ok=428018,error=0, records=41
[WARN ] 2026-06-02 00:48:52.731 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:48:53.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:49:05.011 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 00:49:05.011 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428019,ok=428019,error=0, records=41
[WARN ] 2026-06-02 00:49:07.735 [25180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:49:08.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:49:20.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 00:49:20.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428020,ok=428020,error=0, records=41
[WARN ] 2026-06-02 00:49:22.741 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:49:23.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:49:35.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 00:49:35.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428021,ok=428021,error=0, records=41
[WARN ] 2026-06-02 00:49:37.746 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:49:38.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:49:44.282 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:49:44.443 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:49:44.443 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 00:49:44.443 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:49:44.443 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:49:44.443 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:49:44.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:49:44.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:49:44.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:49:44.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:49:44.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:49:44.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:49:50.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 00:49:50.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428022,ok=428022,error=0, records=41
[WARN ] 2026-06-02 00:49:52.752 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:49:53.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:50:01.571 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21416/300s
[INFO ] 2026-06-02 00:50:05.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 00:50:05.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428023,ok=428023,error=0, records=41
[WARN ] 2026-06-02 00:50:07.758 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:50:08.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:50:19.262 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21407/300s
[INFO ] 2026-06-02 00:50:20.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 00:50:20.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428024,ok=428024,error=0, records=41
[WARN ] 2026-06-02 00:50:22.763 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:50:23.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:50:35.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 00:50:35.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428025,ok=428025,error=0, records=41
[WARN ] 2026-06-02 00:50:37.769 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:50:38.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:50:42.453 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21416/300s
[INFO ] 2026-06-02 00:50:50.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 00:50:50.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428026,ok=428026,error=0, records=41
[INFO ] 2026-06-02 00:50:50.140 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21403/300s
[WARN ] 2026-06-02 00:50:52.773 [25277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:50:53.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:51:05.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 00:51:05.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428027,ok=428027,error=0, records=41
[WARN ] 2026-06-02 00:51:07.778 [25231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:51:08.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:51:20.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 00:51:20.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428028,ok=428028,error=0, records=41
[WARN ] 2026-06-02 00:51:22.785 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:51:23.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:51:35.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 00:51:35.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428029,ok=428029,error=0, records=41
[WARN ] 2026-06-02 00:51:37.790 [25254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:51:38.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:51:42.546 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21403/300s
[INFO ] 2026-06-02 00:51:45.445 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21412/300s
[INFO ] 2026-06-02 00:51:50.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 00:51:50.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428030,ok=428030,error=0, records=41
[WARN ] 2026-06-02 00:51:52.795 [25180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:51:53.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:52:05.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 00:52:05.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428031,ok=428031,error=0, records=41
[WARN ] 2026-06-02 00:52:07.799 [25180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:52:08.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:52:08.052 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21415/300s
[INFO ] 2026-06-02 00:52:20.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11134, records=46
[INFO ] 2026-06-02 00:52:20.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428032,ok=428032,error=0, records=46
[WARN ] 2026-06-02 00:52:22.805 [25283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:52:23.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:52:35.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 00:52:35.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428033,ok=428033,error=0, records=41
[WARN ] 2026-06-02 00:52:37.810 [25828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:52:38.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:52:44.443 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17830/300s
[INFO ] 2026-06-02 00:52:44.445 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:52:44.599 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:52:44.599 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 00:52:44.599 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:52:44.599 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:52:44.599 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:52:44.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:52:44.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:52:44.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:52:44.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:52:44.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:52:44.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:52:50.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-02 00:52:50.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428034,ok=428034,error=0, records=41
[INFO ] 2026-06-02 00:52:52.115 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21413/300s
[WARN ] 2026-06-02 00:52:52.815 [25828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:52:53.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:52:54.017 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21413/300s
[INFO ] 2026-06-02 00:53:01.123 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21413/300s
[INFO ] 2026-06-02 00:53:05.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 00:53:05.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428035,ok=428035,error=0, records=41
[WARN ] 2026-06-02 00:53:07.820 [25858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:53:08.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:53:20.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 00:53:20.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428036,ok=428036,error=0, records=41
[WARN ] 2026-06-02 00:53:22.826 [25844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:53:23.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:53:35.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 00:53:35.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428037,ok=428037,error=0, records=41
[WARN ] 2026-06-02 00:53:37.831 [25858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:53:38.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 00:53:38.055 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 00:53:50.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 00:53:50.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428038,ok=428038,error=0, records=41
[WARN ] 2026-06-02 00:53:52.837 [25858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:53:53.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:53:53.056 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 00:54:05.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 00:54:05.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428039,ok=428039,error=0, records=41
[WARN ] 2026-06-02 00:54:07.843 [25886] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:54:08.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:54:20.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 00:54:20.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428040,ok=428040,error=0, records=41
[WARN ] 2026-06-02 00:54:22.849 [25828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:54:23.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:54:35.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-02 00:54:35.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428041,ok=428041,error=0, records=41
[WARN ] 2026-06-02 00:54:37.854 [25923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:54:38.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:54:50.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 00:54:50.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428042,ok=428042,error=0, records=41
[WARN ] 2026-06-02 00:54:52.859 [25828] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:54:53.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:55:01.574 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21417/300s
[INFO ] 2026-06-02 00:55:05.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 00:55:05.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428043,ok=428043,error=0, records=41
[WARN ] 2026-06-02 00:55:07.864 [25872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:55:08.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:55:19.368 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21408/300s
[INFO ] 2026-06-02 00:55:20.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 00:55:20.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428044,ok=428044,error=0, records=41
[WARN ] 2026-06-02 00:55:22.869 [25979] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:55:23.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:55:35.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 00:55:35.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428045,ok=428045,error=0, records=41
[WARN ] 2026-06-02 00:55:37.876 [25872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:55:38.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:55:42.460 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21417/300s
[INFO ] 2026-06-02 00:55:44.600 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860416},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:55:44.767 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:55:44.768 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 00:55:44.768 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:55:44.768 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:55:44.768 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:55:44.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:55:44.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:55:44.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:55:44.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:55:44.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:55:44.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:55:50.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 00:55:50.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428046,ok=428046,error=0, records=41
[INFO ] 2026-06-02 00:55:50.320 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21404/300s
[WARN ] 2026-06-02 00:55:52.882 [26003] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:55:53.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:56:05.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-02 00:56:05.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428047,ok=428047,error=0, records=41
[WARN ] 2026-06-02 00:56:07.889 [26026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:56:08.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:56:20.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-02 00:56:20.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428048,ok=428048,error=0, records=41
[WARN ] 2026-06-02 00:56:22.893 [26025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:56:23.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:56:35.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 00:56:35.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428049,ok=428049,error=0, records=41
[WARN ] 2026-06-02 00:56:37.898 [26041] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:56:38.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:56:42.729 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21404/300s
[INFO ] 2026-06-02 00:56:45.497 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21413/300s
[INFO ] 2026-06-02 00:56:50.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 00:56:50.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428050,ok=428050,error=0, records=41
[WARN ] 2026-06-02 00:56:52.903 [26059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:56:53.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:57:05.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 00:57:05.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428051,ok=428051,error=0, records=41
[WARN ] 2026-06-02 00:57:07.910 [26059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:57:08.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:57:08.065 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21416/300s
[INFO ] 2026-06-02 00:57:20.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 00:57:20.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428052,ok=428052,error=0, records=41
[WARN ] 2026-06-02 00:57:22.915 [26105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:57:23.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:57:35.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 00:57:35.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428053,ok=428053,error=0, records=41
[WARN ] 2026-06-02 00:57:37.921 [26121] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:57:38.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:57:50.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 00:57:50.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428054,ok=428054,error=0, records=41
[INFO ] 2026-06-02 00:57:52.159 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21414/300s
[WARN ] 2026-06-02 00:57:52.926 [26106] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:57:53.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:57:54.060 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21414/300s
[INFO ] 2026-06-02 00:58:01.166 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21414/300s
[INFO ] 2026-06-02 00:58:05.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 00:58:05.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428055,ok=428055,error=0, records=41
[WARN ] 2026-06-02 00:58:07.932 [26106] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:58:08.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:58:20.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 00:58:20.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428056,ok=428056,error=0, records=41
[WARN ] 2026-06-02 00:58:22.937 [26177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:58:23.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:58:35.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 00:58:35.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428057,ok=428057,error=0, records=41
[WARN ] 2026-06-02 00:58:37.943 [26147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:58:38.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:58:44.768 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17831/300s
[INFO ] 2026-06-02 00:58:44.769 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860344},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 00:58:44.922 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 00:58:44.922 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 00:58:44.922 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 00:58:44.922 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 00:58:44.922 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 00:58:44.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:58:44.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 00:58:44.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:58:44.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 00:58:44.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:58:44.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 00:58:50.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 00:58:50.388 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428058,ok=428058,error=0, records=41
[WARN ] 2026-06-02 00:58:52.948 [26177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:58:53.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:59:05.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12149, records=49
[INFO ] 2026-06-02 00:59:05.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428059,ok=428059,error=0, records=49
[WARN ] 2026-06-02 00:59:07.953 [26177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:59:08.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:59:20.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 00:59:20.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428060,ok=428060,error=0, records=41
[WARN ] 2026-06-02 00:59:22.958 [26147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:59:23.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:59:35.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 00:59:35.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428061,ok=428061,error=0, records=41
[WARN ] 2026-06-02 00:59:37.963 [26220] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:59:38.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 00:59:50.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 00:59:50.413 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428062,ok=428062,error=0, records=41
[WARN ] 2026-06-02 00:59:52.967 [26262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 00:59:53.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:00:01.577 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21418/300s
[INFO ] 2026-06-02 01:00:05.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-02 01:00:05.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428063,ok=428063,error=0, records=41
[WARN ] 2026-06-02 01:00:07.972 [26262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:00:08.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:00:19.476 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21409/300s
[INFO ] 2026-06-02 01:00:20.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 01:00:20.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428064,ok=428064,error=0, records=41
[WARN ] 2026-06-02 01:00:22.977 [26262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:00:23.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:00:35.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 01:00:35.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428065,ok=428065,error=0, records=41
[WARN ] 2026-06-02 01:00:37.982 [26262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:00:38.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:00:42.466 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21418/300s
[INFO ] 2026-06-02 01:00:50.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-02 01:00:50.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428066,ok=428066,error=0, records=41
[INFO ] 2026-06-02 01:00:50.435 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21405/300s
[WARN ] 2026-06-02 01:00:52.987 [26262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:00:53.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:01:05.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 01:01:05.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428067,ok=428067,error=0, records=41
[WARN ] 2026-06-02 01:01:07.993 [26234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:01:08.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:01:20.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:01:20.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428068,ok=428068,error=0, records=41
[WARN ] 2026-06-02 01:01:22.999 [26177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:01:23.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:01:35.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 01:01:35.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428069,ok=428069,error=0, records=41
[WARN ] 2026-06-02 01:01:38.004 [26234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:01:38.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:01:42.912 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21405/300s
[INFO ] 2026-06-02 01:01:44.924 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860252},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:01:45.108 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:01:45.108 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 01:01:45.109 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:01:45.109 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:01:45.109 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:01:45.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:01:45.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:01:45.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:01:45.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:01:45.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:01:45.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:01:45.553 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21414/300s
[INFO ] 2026-06-02 01:01:50.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 01:01:50.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428070,ok=428070,error=0, records=41
[WARN ] 2026-06-02 01:01:53.008 [26234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:01:53.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:02:05.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10406, records=41
[INFO ] 2026-06-02 01:02:05.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428071,ok=428071,error=0, records=41
[WARN ] 2026-06-02 01:02:08.013 [26177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:02:08.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:02:08.077 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21417/300s
[INFO ] 2026-06-02 01:02:20.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 01:02:20.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428072,ok=428072,error=0, records=41
[WARN ] 2026-06-02 01:02:23.017 [26177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:02:23.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:02:35.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 01:02:35.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428073,ok=428073,error=0, records=41
[WARN ] 2026-06-02 01:02:38.022 [26399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:02:38.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:02:50.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-02 01:02:50.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428074,ok=428074,error=0, records=41
[INFO ] 2026-06-02 01:02:52.215 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21415/300s
[WARN ] 2026-06-02 01:02:53.026 [26234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:02:53.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:02:54.117 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21415/300s
[INFO ] 2026-06-02 01:03:01.223 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21415/300s
[INFO ] 2026-06-02 01:03:05.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 01:03:05.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428075,ok=428075,error=0, records=41
[WARN ] 2026-06-02 01:03:08.031 [26456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:03:08.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:03:20.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 01:03:20.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428076,ok=428076,error=0, records=41
[WARN ] 2026-06-02 01:03:23.036 [26483] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:03:23.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:03:35.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 01:03:35.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428077,ok=428077,error=0, records=41
[WARN ] 2026-06-02 01:03:38.042 [26234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:03:38.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 01:03:38.081 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 01:03:50.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 01:03:50.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428078,ok=428078,error=0, records=41
[WARN ] 2026-06-02 01:03:53.047 [26518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:03:53.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:04:05.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 01:04:05.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428079,ok=428079,error=0, records=41
[WARN ] 2026-06-02 01:04:08.051 [26512] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:04:08.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:04:20.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 01:04:20.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428080,ok=428080,error=0, records=41
[WARN ] 2026-06-02 01:04:22.557 [26552] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:04:23.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:04:35.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 01:04:35.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428081,ok=428081,error=0, records=41
[WARN ] 2026-06-02 01:04:37.561 [26569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:04:38.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:04:45.109 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17832/300s
[INFO ] 2026-06-02 01:04:45.110 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:04:45.263 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:04:45.263 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 01:04:45.264 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:04:45.264 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:04:45.264 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:04:45.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:04:45.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:04:45.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:04:45.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:04:45.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:04:45.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:04:50.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:04:50.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428082,ok=428082,error=0, records=41
[WARN ] 2026-06-02 01:04:52.565 [26517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:04:53.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:05:01.581 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21419/300s
[INFO ] 2026-06-02 01:05:05.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 01:05:05.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428083,ok=428083,error=0, records=41
[WARN ] 2026-06-02 01:05:07.572 [26612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:05:08.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:05:19.576 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21410/300s
[INFO ] 2026-06-02 01:05:20.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 01:05:20.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428084,ok=428084,error=0, records=41
[WARN ] 2026-06-02 01:05:22.578 [26595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:05:23.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:05:35.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-02 01:05:35.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428085,ok=428085,error=0, records=41
[WARN ] 2026-06-02 01:05:37.583 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:05:38.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:05:42.472 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21419/300s
[INFO ] 2026-06-02 01:05:50.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-02 01:05:50.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428086,ok=428086,error=0, records=41
[INFO ] 2026-06-02 01:05:50.625 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21406/300s
[WARN ] 2026-06-02 01:05:52.589 [26661] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:05:53.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:06:05.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 01:06:05.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428087,ok=428087,error=0, records=41
[WARN ] 2026-06-02 01:06:07.596 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:06:08.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:06:20.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 01:06:20.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428088,ok=428088,error=0, records=41
[WARN ] 2026-06-02 01:06:22.600 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:06:23.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:06:35.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:06:35.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428089,ok=428089,error=0, records=41
[WARN ] 2026-06-02 01:06:37.605 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:06:38.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:06:43.094 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21406/300s
[INFO ] 2026-06-02 01:06:45.607 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21415/300s
[INFO ] 2026-06-02 01:06:50.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 01:06:50.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428090,ok=428090,error=0, records=41
[WARN ] 2026-06-02 01:06:52.609 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:06:53.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:07:05.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 01:07:05.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428091,ok=428091,error=0, records=41
[WARN ] 2026-06-02 01:07:07.614 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:07:08.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:07:08.090 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21418/300s
[INFO ] 2026-06-02 01:07:20.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 01:07:20.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428092,ok=428092,error=0, records=41
[WARN ] 2026-06-02 01:07:22.619 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:07:23.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:07:35.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 01:07:35.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428093,ok=428093,error=0, records=41
[WARN ] 2026-06-02 01:07:37.624 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:07:38.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:07:45.265 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:07:45.434 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:07:45.435 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 01:07:45.435 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:07:45.435 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:07:45.435 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:07:45.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:07:45.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:07:45.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:07:45.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:07:45.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:07:45.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:07:50.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 01:07:50.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428094,ok=428094,error=0, records=41
[INFO ] 2026-06-02 01:07:52.270 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21416/300s
[WARN ] 2026-06-02 01:07:52.628 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:07:53.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:07:54.171 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21416/300s
[INFO ] 2026-06-02 01:08:01.277 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21416/300s
[INFO ] 2026-06-02 01:08:05.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-02 01:08:05.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428095,ok=428095,error=0, records=41
[WARN ] 2026-06-02 01:08:07.632 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:08:08.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:08:20.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 01:08:20.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428096,ok=428096,error=0, records=41
[WARN ] 2026-06-02 01:08:22.637 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:08:23.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:08:35.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 01:08:35.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428097,ok=428097,error=0, records=41
[WARN ] 2026-06-02 01:08:37.641 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:08:38.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:08:50.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 01:08:50.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428098,ok=428098,error=0, records=41
[WARN ] 2026-06-02 01:08:52.645 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:08:53.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:08:53.094 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 01:09:05.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 01:09:05.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428099,ok=428099,error=0, records=41
[WARN ] 2026-06-02 01:09:07.649 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:09:08.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:09:20.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 01:09:20.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428100,ok=428100,error=0, records=41
[WARN ] 2026-06-02 01:09:22.656 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:09:23.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:09:35.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 01:09:35.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428101,ok=428101,error=0, records=41
[WARN ] 2026-06-02 01:09:37.661 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:09:38.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:09:50.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 01:09:50.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428102,ok=428102,error=0, records=41
[WARN ] 2026-06-02 01:09:52.667 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:09:53.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:10:01.584 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21420/300s
[INFO ] 2026-06-02 01:10:05.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 01:10:05.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428103,ok=428103,error=0, records=41
[WARN ] 2026-06-02 01:10:07.671 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:10:08.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:10:19.674 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21411/300s
[INFO ] 2026-06-02 01:10:20.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 01:10:20.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428104,ok=428104,error=0, records=41
[WARN ] 2026-06-02 01:10:22.675 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:10:23.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:10:35.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 01:10:35.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428105,ok=428105,error=0, records=41
[WARN ] 2026-06-02 01:10:37.680 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:10:38.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:10:42.478 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21420/300s
[INFO ] 2026-06-02 01:10:45.435 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17833/300s
[INFO ] 2026-06-02 01:10:45.436 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20860020},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:10:45.606 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:10:45.606 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 01:10:45.606 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:10:45.606 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:10:45.606 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:10:45.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:10:45.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:10:45.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:10:45.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:10:45.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:10:45.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:10:50.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 01:10:50.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428106,ok=428106,error=0, records=41
[INFO ] 2026-06-02 01:10:50.743 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21407/300s
[WARN ] 2026-06-02 01:10:52.685 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:10:53.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:11:05.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 01:11:05.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428107,ok=428107,error=0, records=41
[WARN ] 2026-06-02 01:11:07.690 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:11:08.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:11:20.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 01:11:20.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428108,ok=428108,error=0, records=41
[WARN ] 2026-06-02 01:11:22.694 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:11:23.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:11:35.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 01:11:35.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428109,ok=428109,error=0, records=41
[WARN ] 2026-06-02 01:11:37.701 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:11:38.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:11:43.272 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21407/300s
[INFO ] 2026-06-02 01:11:45.665 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21416/300s
[INFO ] 2026-06-02 01:11:50.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 01:11:50.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428110,ok=428110,error=0, records=41
[WARN ] 2026-06-02 01:11:52.706 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:11:53.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:12:05.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 01:12:05.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428111,ok=428111,error=0, records=41
[WARN ] 2026-06-02 01:12:07.712 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:12:08.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:12:08.103 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21419/300s
[INFO ] 2026-06-02 01:12:20.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 01:12:20.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428112,ok=428112,error=0, records=41
[WARN ] 2026-06-02 01:12:22.717 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:12:23.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:12:35.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-02 01:12:35.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428113,ok=428113,error=0, records=41
[WARN ] 2026-06-02 01:12:37.723 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:12:38.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:12:50.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 01:12:50.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428114,ok=428114,error=0, records=41
[INFO ] 2026-06-02 01:12:52.321 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21417/300s
[WARN ] 2026-06-02 01:12:52.727 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:12:53.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:12:54.223 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21417/300s
[INFO ] 2026-06-02 01:13:01.330 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21417/300s
[INFO ] 2026-06-02 01:13:05.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 01:13:05.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428115,ok=428115,error=0, records=41
[WARN ] 2026-06-02 01:13:07.731 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:13:08.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:13:20.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10380, records=41
[INFO ] 2026-06-02 01:13:20.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428116,ok=428116,error=0, records=41
[WARN ] 2026-06-02 01:13:22.736 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:13:23.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:13:35.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 01:13:35.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428117,ok=428117,error=0, records=41
[WARN ] 2026-06-02 01:13:37.741 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:13:38.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 01:13:38.107 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 01:13:45.608 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859924},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:13:45.760 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:13:45.760 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 01:13:45.760 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:13:45.760 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:13:45.760 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:13:45.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:13:45.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:13:45.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:13:45.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:13:45.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:13:45.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:13:50.838 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 01:13:50.838 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428118,ok=428118,error=0, records=41
[WARN ] 2026-06-02 01:13:52.747 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:13:53.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:14:05.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 01:14:05.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428119,ok=428119,error=0, records=41
[WARN ] 2026-06-02 01:14:07.752 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:14:08.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:14:20.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 01:14:20.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428120,ok=428120,error=0, records=41
[WARN ] 2026-06-02 01:14:22.758 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:14:23.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:14:35.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 01:14:35.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428121,ok=428121,error=0, records=41
[WARN ] 2026-06-02 01:14:37.762 [26647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:14:38.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:14:50.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 01:14:50.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428122,ok=428122,error=0, records=41
[WARN ] 2026-06-02 01:14:52.767 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:14:53.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:15:01.588 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21421/300s
[INFO ] 2026-06-02 01:15:05.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 01:15:05.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428123,ok=428123,error=0, records=41
[WARN ] 2026-06-02 01:15:07.771 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:15:08.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:15:19.775 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21412/300s
[INFO ] 2026-06-02 01:15:20.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:15:20.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428124,ok=428124,error=0, records=41
[WARN ] 2026-06-02 01:15:22.777 [26640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:15:23.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:15:35.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 01:15:35.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428125,ok=428125,error=0, records=41
[WARN ] 2026-06-02 01:15:37.782 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:15:38.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:15:42.484 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21421/300s
[INFO ] 2026-06-02 01:15:50.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 01:15:50.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428126,ok=428126,error=0, records=41
[INFO ] 2026-06-02 01:15:50.969 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21408/300s
[WARN ] 2026-06-02 01:15:52.787 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:15:53.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:16:05.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 01:16:05.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428127,ok=428127,error=0, records=41
[WARN ] 2026-06-02 01:16:07.792 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:16:08.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:16:20.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 01:16:20.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428128,ok=428128,error=0, records=41
[WARN ] 2026-06-02 01:16:22.797 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:16:23.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:16:35.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 01:16:35.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428129,ok=428129,error=0, records=41
[WARN ] 2026-06-02 01:16:37.804 [26666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:16:38.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:16:43.456 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21408/300s
[INFO ] 2026-06-02 01:16:45.720 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21417/300s
[INFO ] 2026-06-02 01:16:45.760 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17834/300s
[INFO ] 2026-06-02 01:16:45.762 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:16:45.934 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:16:45.934 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 01:16:45.934 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:16:45.934 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:16:45.935 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:16:45.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:16:45.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:16:45.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:16:45.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:16:45.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:16:45.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:16:51.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 01:16:51.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428130,ok=428130,error=0, records=41
[WARN ] 2026-06-02 01:16:52.809 [27256] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:16:53.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:17:06.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 01:17:06.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428131,ok=428131,error=0, records=41
[WARN ] 2026-06-02 01:17:07.814 [27265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:17:08.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:17:08.116 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21420/300s
[INFO ] 2026-06-02 01:17:21.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:17:21.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428132,ok=428132,error=0, records=41
[WARN ] 2026-06-02 01:17:22.820 [27271] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:17:23.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:17:36.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 01:17:36.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428133,ok=428133,error=0, records=41
[WARN ] 2026-06-02 01:17:37.825 [27271] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:17:38.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:17:51.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 01:17:51.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428134,ok=428134,error=0, records=41
[INFO ] 2026-06-02 01:17:52.380 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21418/300s
[WARN ] 2026-06-02 01:17:52.831 [27265] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:17:53.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:17:54.282 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21418/300s
[INFO ] 2026-06-02 01:18:01.388 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21418/300s
[INFO ] 2026-06-02 01:18:06.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 01:18:06.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428135,ok=428135,error=0, records=41
[WARN ] 2026-06-02 01:18:07.837 [27326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:18:08.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:18:21.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 01:18:21.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428136,ok=428136,error=0, records=41
[WARN ] 2026-06-02 01:18:22.843 [27335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:18:23.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:18:36.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 01:18:36.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428137,ok=428137,error=0, records=41
[WARN ] 2026-06-02 01:18:37.848 [27299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:18:38.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:18:51.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 01:18:51.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428138,ok=428138,error=0, records=41
[WARN ] 2026-06-02 01:18:52.852 [27335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:18:53.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:19:06.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 01:19:06.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428139,ok=428139,error=0, records=41
[WARN ] 2026-06-02 01:19:07.858 [27335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:19:08.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:19:21.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 01:19:21.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428140,ok=428140,error=0, records=41
[WARN ] 2026-06-02 01:19:22.863 [27390] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:19:23.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:19:36.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 01:19:36.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428141,ok=428141,error=0, records=41
[WARN ] 2026-06-02 01:19:37.868 [27335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:19:38.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:19:45.936 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:19:46.089 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:19:46.089 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 01:19:46.089 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:19:46.089 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:19:46.089 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:19:46.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:19:46.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:19:46.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:19:46.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:19:46.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:19:46.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:19:51.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 01:19:51.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428142,ok=428142,error=0, records=41
[WARN ] 2026-06-02 01:19:52.872 [27299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:19:53.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:20:01.591 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21422/300s
[INFO ] 2026-06-02 01:20:06.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-02 01:20:06.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428143,ok=428143,error=0, records=41
[WARN ] 2026-06-02 01:20:07.877 [27446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:20:08.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:20:19.880 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21413/300s
[INFO ] 2026-06-02 01:20:21.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 01:20:21.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428144,ok=428144,error=0, records=41
[WARN ] 2026-06-02 01:20:22.882 [27446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:20:23.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:20:36.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 01:20:36.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428145,ok=428145,error=0, records=41
[WARN ] 2026-06-02 01:20:37.888 [27390] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:20:38.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:20:42.491 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21422/300s
[INFO ] 2026-06-02 01:20:51.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 01:20:51.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428146,ok=428146,error=0, records=41
[INFO ] 2026-06-02 01:20:51.222 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21409/300s
[WARN ] 2026-06-02 01:20:52.893 [27490] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:20:53.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:21:06.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 01:21:06.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428147,ok=428147,error=0, records=41
[WARN ] 2026-06-02 01:21:07.899 [27515] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:21:08.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:21:17.403 [27498] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19275/stat), No such file or directory
[WARN ] 2026-06-02 01:21:17.403 [27498] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19264/stat), No such file or directory
[WARN ] 2026-06-02 01:21:17.403 [27498] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19420/stat), No such file or directory
[WARN ] 2026-06-02 01:21:17.404 [27498] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21349/stat), No such file or directory
[WARN ] 2026-06-02 01:21:17.404 [27498] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19203/stat), No such file or directory
[INFO ] 2026-06-02 01:21:21.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:21:21.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428148,ok=428148,error=0, records=41
[WARN ] 2026-06-02 01:21:22.906 [27539] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:21:23.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:21:32.411 [27540] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19275/stat), No such file or directory
[WARN ] 2026-06-02 01:21:32.412 [27540] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19264/stat), No such file or directory
[WARN ] 2026-06-02 01:21:32.412 [27540] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19420/stat), No such file or directory
[WARN ] 2026-06-02 01:21:32.412 [27540] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21349/stat), No such file or directory
[WARN ] 2026-06-02 01:21:32.412 [27540] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19203/stat), No such file or directory
[INFO ] 2026-06-02 01:21:36.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 01:21:36.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428149,ok=428149,error=0, records=41
[WARN ] 2026-06-02 01:21:37.912 [27540] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:21:38.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:21:43.639 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21409/300s
[INFO ] 2026-06-02 01:21:45.776 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21418/300s
[WARN ] 2026-06-02 01:21:47.417 [27571] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19275/stat), No such file or directory
[WARN ] 2026-06-02 01:21:47.417 [27571] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19264/stat), No such file or directory
[WARN ] 2026-06-02 01:21:47.417 [27571] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19420/stat), No such file or directory
[WARN ] 2026-06-02 01:21:47.417 [27571] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/21349/stat), No such file or directory
[WARN ] 2026-06-02 01:21:47.417 [27571] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/19203/stat), No such file or directory
[INFO ] 2026-06-02 01:21:51.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 01:21:51.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428150,ok=428150,error=0, records=41
[WARN ] 2026-06-02 01:21:52.918 [27571] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:21:53.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:22:06.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 01:22:06.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428151,ok=428151,error=0, records=41
[WARN ] 2026-06-02 01:22:07.923 [27594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:22:08.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:22:08.129 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21421/300s
[INFO ] 2026-06-02 01:22:21.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 01:22:21.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428152,ok=428152,error=0, records=41
[WARN ] 2026-06-02 01:22:22.929 [27605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:22:23.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:22:36.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 01:22:36.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428153,ok=428153,error=0, records=41
[WARN ] 2026-06-02 01:22:37.934 [27571] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:22:38.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:22:46.089 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17835/300s
[INFO ] 2026-06-02 01:22:46.091 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859700},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:22:46.266 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:22:46.266 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 01:22:46.266 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:22:46.266 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:22:46.266 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:22:46.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:22:46.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:22:46.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:22:46.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:22:46.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:22:46.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:22:51.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 01:22:51.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428154,ok=428154,error=0, records=41
[INFO ] 2026-06-02 01:22:52.423 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21419/300s
[WARN ] 2026-06-02 01:22:52.940 [27615] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:22:53.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:22:54.323 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21419/300s
[INFO ] 2026-06-02 01:23:01.427 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21419/300s
[INFO ] 2026-06-02 01:23:06.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 01:23:06.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428155,ok=428155,error=0, records=41
[WARN ] 2026-06-02 01:23:07.945 [27556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:23:08.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:23:21.278 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 01:23:21.278 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428156,ok=428156,error=0, records=41
[WARN ] 2026-06-02 01:23:22.953 [27657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:23:23.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:23:36.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 01:23:36.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428157,ok=428157,error=0, records=41
[WARN ] 2026-06-02 01:23:37.958 [27662] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:23:38.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 01:23:38.132 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 01:23:51.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 01:23:51.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428158,ok=428158,error=0, records=41
[WARN ] 2026-06-02 01:23:52.964 [27645] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:23:53.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:23:53.133 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 01:24:06.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 01:24:06.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428159,ok=428159,error=0, records=41
[WARN ] 2026-06-02 01:24:07.968 [27556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:24:08.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:24:21.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:24:21.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428160,ok=428160,error=0, records=41
[WARN ] 2026-06-02 01:24:22.973 [27604] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:24:23.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:24:36.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 01:24:36.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428161,ok=428161,error=0, records=41
[WARN ] 2026-06-02 01:24:37.978 [27556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:24:38.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:24:51.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 01:24:51.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428162,ok=428162,error=0, records=41
[WARN ] 2026-06-02 01:24:52.983 [27556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:24:53.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:25:01.594 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21423/300s
[INFO ] 2026-06-02 01:25:06.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-02 01:25:06.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428163,ok=428163,error=0, records=41
[WARN ] 2026-06-02 01:25:07.988 [27734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:25:08.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:25:19.993 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21414/300s
[INFO ] 2026-06-02 01:25:21.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-02 01:25:21.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428164,ok=428164,error=0, records=41
[WARN ] 2026-06-02 01:25:22.994 [27734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:25:23.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:25:36.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 01:25:36.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428165,ok=428165,error=0, records=41
[WARN ] 2026-06-02 01:25:38.000 [27775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:25:38.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:25:42.496 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21423/300s
[INFO ] 2026-06-02 01:25:46.268 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859628},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:25:46.425 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:25:46.425 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 01:25:46.425 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:25:46.425 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:25:46.425 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:25:46.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:25:46.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:25:46.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:25:46.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:25:46.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:25:46.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:25:51.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 01:25:51.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428166,ok=428166,error=0, records=41
[INFO ] 2026-06-02 01:25:51.402 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21410/300s
[WARN ] 2026-06-02 01:25:53.004 [27804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:25:53.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:26:06.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 01:26:06.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428167,ok=428167,error=0, records=41
[WARN ] 2026-06-02 01:26:08.009 [27761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:26:08.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:26:21.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:26:21.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428168,ok=428168,error=0, records=41
[WARN ] 2026-06-02 01:26:23.013 [27832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:26:23.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:26:36.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:26:36.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428169,ok=428169,error=0, records=41
[WARN ] 2026-06-02 01:26:38.019 [27775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:26:38.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:26:43.816 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21410/300s
[INFO ] 2026-06-02 01:26:45.824 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21419/300s
[INFO ] 2026-06-02 01:26:51.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 01:26:51.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428170,ok=428170,error=0, records=41
[WARN ] 2026-06-02 01:26:53.024 [27761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:26:53.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:27:06.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 01:27:06.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428171,ok=428171,error=0, records=41
[WARN ] 2026-06-02 01:27:08.028 [27761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:27:08.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:27:08.141 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21422/300s
[INFO ] 2026-06-02 01:27:21.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 01:27:21.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428172,ok=428172,error=0, records=41
[WARN ] 2026-06-02 01:27:23.034 [27888] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:27:23.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:27:36.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:27:36.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428173,ok=428173,error=0, records=41
[WARN ] 2026-06-02 01:27:38.038 [27761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:27:38.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:27:51.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 01:27:51.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428174,ok=428174,error=0, records=41
[INFO ] 2026-06-02 01:27:52.452 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21420/300s
[WARN ] 2026-06-02 01:27:53.043 [27921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:27:53.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:27:54.354 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21420/300s
[INFO ] 2026-06-02 01:28:01.460 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21420/300s
[INFO ] 2026-06-02 01:28:06.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 01:28:06.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428175,ok=428175,error=0, records=41
[WARN ] 2026-06-02 01:28:08.050 [27938] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:28:08.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:28:21.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 01:28:21.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428176,ok=428176,error=0, records=41
[WARN ] 2026-06-02 01:28:22.556 [27920] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:28:23.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:28:36.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 01:28:36.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428177,ok=428177,error=0, records=41
[WARN ] 2026-06-02 01:28:37.562 [27961] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:28:38.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:28:46.425 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17836/300s
[INFO ] 2026-06-02 01:28:46.427 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:28:46.598 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:28:46.598 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 01:28:46.598 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:28:46.598 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:28:46.598 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:28:46.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:28:46.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:28:46.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:28:46.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:28:46.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:28:46.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:28:51.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 01:28:51.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428178,ok=428178,error=0, records=41
[WARN ] 2026-06-02 01:28:52.567 [27989] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:28:53.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:29:06.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 01:29:06.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428179,ok=428179,error=0, records=41
[WARN ] 2026-06-02 01:29:07.572 [27960] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:29:08.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:29:21.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 01:29:21.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428180,ok=428180,error=0, records=41
[WARN ] 2026-06-02 01:29:22.577 [28017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:29:23.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:29:36.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 01:29:36.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428181,ok=428181,error=0, records=41
[WARN ] 2026-06-02 01:29:37.583 [27988] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:29:38.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:29:51.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:29:51.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428182,ok=428182,error=0, records=41
[WARN ] 2026-06-02 01:29:52.588 [28059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:29:53.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:30:01.598 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21424/300s
[INFO ] 2026-06-02 01:30:06.588 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 01:30:06.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428183,ok=428183,error=0, records=41
[WARN ] 2026-06-02 01:30:07.595 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:30:08.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:30:20.099 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21415/300s
[INFO ] 2026-06-02 01:30:21.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11430, records=45
[INFO ] 2026-06-02 01:30:21.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428184,ok=428184,error=0, records=45
[WARN ] 2026-06-02 01:30:22.600 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:30:23.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:30:36.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 01:30:36.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428185,ok=428185,error=0, records=41
[WARN ] 2026-06-02 01:30:37.606 [28059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:30:38.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:30:42.503 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21424/300s
[INFO ] 2026-06-02 01:30:51.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 01:30:51.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428186,ok=428186,error=0, records=41
[INFO ] 2026-06-02 01:30:51.607 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21411/300s
[WARN ] 2026-06-02 01:30:52.611 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:30:53.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:31:06.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 01:31:06.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428187,ok=428187,error=0, records=41
[WARN ] 2026-06-02 01:31:07.616 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:31:08.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:31:21.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:31:21.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428188,ok=428188,error=0, records=41
[WARN ] 2026-06-02 01:31:22.621 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:31:23.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:31:36.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 01:31:36.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428189,ok=428189,error=0, records=41
[WARN ] 2026-06-02 01:31:37.626 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:31:38.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:31:43.999 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21411/300s
[INFO ] 2026-06-02 01:31:45.880 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21420/300s
[INFO ] 2026-06-02 01:31:46.600 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:31:46.773 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:31:46.773 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 01:31:46.773 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:31:46.773 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:31:46.773 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:31:46.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:31:46.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:31:46.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:31:46.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:31:46.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:31:46.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:31:51.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 01:31:51.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428190,ok=428190,error=0, records=41
[WARN ] 2026-06-02 01:31:52.631 [28059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:31:53.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:32:06.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:32:06.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428191,ok=428191,error=0, records=41
[WARN ] 2026-06-02 01:32:07.637 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:32:08.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:32:08.154 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21423/300s
[INFO ] 2026-06-02 01:32:21.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 01:32:21.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428192,ok=428192,error=0, records=41
[WARN ] 2026-06-02 01:32:22.641 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:32:23.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:32:36.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 01:32:36.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428193,ok=428193,error=0, records=41
[WARN ] 2026-06-02 01:32:37.647 [28059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:32:38.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:32:51.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:32:51.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428194,ok=428194,error=0, records=41
[INFO ] 2026-06-02 01:32:52.509 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21421/300s
[WARN ] 2026-06-02 01:32:52.653 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:32:53.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:32:54.410 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21421/300s
[INFO ] 2026-06-02 01:33:01.513 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21421/300s
[INFO ] 2026-06-02 01:33:06.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 01:33:06.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428195,ok=428195,error=0, records=41
[WARN ] 2026-06-02 01:33:07.659 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:33:08.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:33:21.813 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-02 01:33:21.813 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428196,ok=428196,error=0, records=41
[WARN ] 2026-06-02 01:33:22.663 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:33:23.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:33:36.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 01:33:36.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428197,ok=428197,error=0, records=41
[WARN ] 2026-06-02 01:33:37.667 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:33:38.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 01:33:38.158 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 01:33:51.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 01:33:51.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428198,ok=428198,error=0, records=41
[WARN ] 2026-06-02 01:33:52.671 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:33:53.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:34:06.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 01:34:06.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428199,ok=428199,error=0, records=41
[WARN ] 2026-06-02 01:34:07.675 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:34:08.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:34:21.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-02 01:34:21.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428200,ok=428200,error=0, records=41
[WARN ] 2026-06-02 01:34:22.682 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:34:23.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:34:36.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 01:34:36.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428201,ok=428201,error=0, records=41
[WARN ] 2026-06-02 01:34:37.686 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:34:38.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:34:46.773 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17837/300s
[INFO ] 2026-06-02 01:34:46.775 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859408},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:34:46.954 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:34:46.954 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 01:34:46.954 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:34:46.954 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:34:46.954 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:34:46.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:34:46.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:34:46.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:34:46.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:34:46.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:34:46.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:34:51.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-02 01:34:51.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428202,ok=428202,error=0, records=41
[WARN ] 2026-06-02 01:34:52.691 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:34:53.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:35:01.601 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21425/300s
[INFO ] 2026-06-02 01:35:06.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-02 01:35:06.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428203,ok=428203,error=0, records=41
[WARN ] 2026-06-02 01:35:07.696 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:35:08.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:35:20.200 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21416/300s
[INFO ] 2026-06-02 01:35:21.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 01:35:21.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428204,ok=428204,error=0, records=41
[WARN ] 2026-06-02 01:35:22.701 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:35:23.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:35:36.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 01:35:36.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428205,ok=428205,error=0, records=41
[WARN ] 2026-06-02 01:35:37.706 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:35:38.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:35:42.509 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21425/300s
[INFO ] 2026-06-02 01:35:51.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 01:35:51.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428206,ok=428206,error=0, records=41
[INFO ] 2026-06-02 01:35:51.873 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21412/300s
[WARN ] 2026-06-02 01:35:52.712 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:35:53.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:36:06.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 01:36:06.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428207,ok=428207,error=0, records=41
[WARN ] 2026-06-02 01:36:07.716 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:36:08.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:36:21.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 01:36:21.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428208,ok=428208,error=0, records=41
[WARN ] 2026-06-02 01:36:22.721 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:36:23.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:36:36.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 01:36:36.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428209,ok=428209,error=0, records=41
[WARN ] 2026-06-02 01:36:37.726 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:36:38.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:36:44.182 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21412/300s
[INFO ] 2026-06-02 01:36:45.938 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21421/300s
[INFO ] 2026-06-02 01:36:51.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 01:36:51.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428210,ok=428210,error=0, records=41
[WARN ] 2026-06-02 01:36:52.732 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:36:53.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:37:06.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 01:37:06.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428211,ok=428211,error=0, records=41
[WARN ] 2026-06-02 01:37:07.737 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:37:08.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:37:08.166 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21424/300s
[INFO ] 2026-06-02 01:37:21.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 01:37:21.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428212,ok=428212,error=0, records=41
[WARN ] 2026-06-02 01:37:22.743 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:37:23.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:37:36.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 01:37:36.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428213,ok=428213,error=0, records=41
[WARN ] 2026-06-02 01:37:37.748 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:37:38.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:37:46.955 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859328},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:37:47.131 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:37:47.131 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 01:37:47.132 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:37:47.132 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:37:47.132 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:37:47.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:37:47.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:37:47.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:37:47.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:37:47.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:37:47.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:37:51.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 01:37:51.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428214,ok=428214,error=0, records=41
[INFO ] 2026-06-02 01:37:52.580 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21422/300s
[WARN ] 2026-06-02 01:37:52.753 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:37:53.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:37:54.482 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21422/300s
[INFO ] 2026-06-02 01:38:01.588 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21422/300s
[INFO ] 2026-06-02 01:38:06.935 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-02 01:38:06.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428215,ok=428215,error=0, records=41
[WARN ] 2026-06-02 01:38:07.760 [28084] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:38:08.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:38:21.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-02 01:38:21.941 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428216,ok=428216,error=0, records=41
[WARN ] 2026-06-02 01:38:22.764 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:38:23.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:38:36.946 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 01:38:36.946 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428217,ok=428217,error=0, records=41
[WARN ] 2026-06-02 01:38:37.768 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:38:38.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:38:51.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 01:38:51.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428218,ok=428218,error=0, records=41
[WARN ] 2026-06-02 01:38:52.773 [28069] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:38:53.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:38:53.171 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 01:39:06.957 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 01:39:06.957 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428219,ok=428219,error=0, records=41
[WARN ] 2026-06-02 01:39:07.777 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:39:08.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:39:21.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 01:39:21.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428220,ok=428220,error=0, records=41
[WARN ] 2026-06-02 01:39:22.782 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:39:23.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:39:36.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 01:39:36.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428221,ok=428221,error=0, records=41
[WARN ] 2026-06-02 01:39:37.788 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:39:38.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:39:51.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 01:39:51.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428222,ok=428222,error=0, records=41
[WARN ] 2026-06-02 01:39:52.792 [28055] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:39:53.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:40:01.605 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21426/300s
[INFO ] 2026-06-02 01:40:06.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 01:40:06.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428223,ok=428223,error=0, records=41
[WARN ] 2026-06-02 01:40:07.798 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:40:08.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:40:20.302 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21417/300s
[INFO ] 2026-06-02 01:40:21.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 01:40:21.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428224,ok=428224,error=0, records=41
[WARN ] 2026-06-02 01:40:22.803 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:40:23.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:40:36.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 01:40:36.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428225,ok=428225,error=0, records=41
[WARN ] 2026-06-02 01:40:37.807 [28640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:40:38.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:40:42.517 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21426/300s
[INFO ] 2026-06-02 01:40:47.132 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17838/300s
[INFO ] 2026-06-02 01:40:47.133 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859248},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:40:47.299 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:40:47.299 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 01:40:47.299 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:40:47.299 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:40:47.299 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:40:47.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:40:47.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:40:47.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:40:47.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:40:47.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:40:47.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:40:52.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 01:40:52.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428226,ok=428226,error=0, records=41
[INFO ] 2026-06-02 01:40:52.002 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21413/300s
[WARN ] 2026-06-02 01:40:52.812 [28059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:40:53.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:41:07.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 01:41:07.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428227,ok=428227,error=0, records=41
[WARN ] 2026-06-02 01:41:07.817 [28676] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:41:08.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:41:22.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 01:41:22.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428228,ok=428228,error=0, records=41
[WARN ] 2026-06-02 01:41:22.822 [28690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:41:23.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:41:37.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 01:41:37.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428229,ok=428229,error=0, records=41
[WARN ] 2026-06-02 01:41:37.827 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:41:38.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:41:44.364 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21413/300s
[INFO ] 2026-06-02 01:41:45.996 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21422/300s
[INFO ] 2026-06-02 01:41:52.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 01:41:52.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428230,ok=428230,error=0, records=41
[WARN ] 2026-06-02 01:41:52.834 [28056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:41:53.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:42:07.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 01:42:07.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428231,ok=428231,error=0, records=41
[WARN ] 2026-06-02 01:42:07.838 [28690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:42:08.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:42:08.180 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21425/300s
[INFO ] 2026-06-02 01:42:22.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 01:42:22.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428232,ok=428232,error=0, records=41
[WARN ] 2026-06-02 01:42:22.842 [28740] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:42:23.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:42:37.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:42:37.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428233,ok=428233,error=0, records=41
[WARN ] 2026-06-02 01:42:37.847 [28690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:42:38.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:42:52.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 01:42:52.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428234,ok=428234,error=0, records=41
[INFO ] 2026-06-02 01:42:52.664 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21423/300s
[WARN ] 2026-06-02 01:42:52.852 [28754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:42:53.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:42:54.565 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21423/300s
[INFO ] 2026-06-02 01:43:01.671 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21423/300s
[INFO ] 2026-06-02 01:43:07.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 01:43:07.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428235,ok=428235,error=0, records=41
[WARN ] 2026-06-02 01:43:07.857 [28754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:43:08.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:43:22.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 01:43:22.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428236,ok=428236,error=0, records=41
[WARN ] 2026-06-02 01:43:22.862 [28768] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:43:23.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:43:37.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 01:43:37.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428237,ok=428237,error=0, records=41
[WARN ] 2026-06-02 01:43:37.867 [28810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:43:38.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 01:43:38.184 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 01:43:47.301 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859168},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:43:47.461 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:43:47.461 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 01:43:47.461 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:43:47.461 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:43:47.461 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:43:47.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:43:47.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:43:47.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:43:47.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:43:47.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:43:47.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:43:52.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 01:43:52.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428238,ok=428238,error=0, records=41
[WARN ] 2026-06-02 01:43:52.871 [28754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:43:53.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:44:07.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 01:44:07.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428239,ok=428239,error=0, records=41
[WARN ] 2026-06-02 01:44:07.875 [28754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:44:08.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:44:22.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 01:44:22.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428240,ok=428240,error=0, records=41
[WARN ] 2026-06-02 01:44:22.882 [28754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:44:23.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:44:37.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 01:44:37.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428241,ok=428241,error=0, records=41
[WARN ] 2026-06-02 01:44:37.886 [28870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:44:38.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:44:52.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:44:52.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428242,ok=428242,error=0, records=41
[WARN ] 2026-06-02 01:44:52.892 [28886] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:44:53.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:45:01.608 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21427/300s
[INFO ] 2026-06-02 01:45:07.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 01:45:07.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428243,ok=428243,error=0, records=41
[WARN ] 2026-06-02 01:45:07.898 [28870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:45:08.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:45:20.401 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21418/300s
[INFO ] 2026-06-02 01:45:22.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 01:45:22.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428244,ok=428244,error=0, records=41
[WARN ] 2026-06-02 01:45:22.903 [28919] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:45:23.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:45:37.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 01:45:37.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428245,ok=428245,error=0, records=41
[WARN ] 2026-06-02 01:45:37.908 [28920] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:45:38.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:45:42.524 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21427/300s
[INFO ] 2026-06-02 01:45:52.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 01:45:52.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428246,ok=428246,error=0, records=41
[INFO ] 2026-06-02 01:45:52.187 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21414/300s
[WARN ] 2026-06-02 01:45:52.914 [28926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:45:53.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:46:07.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 01:46:07.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428247,ok=428247,error=0, records=41
[WARN ] 2026-06-02 01:46:07.919 [28958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:46:08.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:46:22.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 01:46:22.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428248,ok=428248,error=0, records=41
[WARN ] 2026-06-02 01:46:22.924 [28991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:46:23.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:46:37.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 01:46:37.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428249,ok=428249,error=0, records=41
[WARN ] 2026-06-02 01:46:37.931 [28958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:46:38.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:46:44.539 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21414/300s
[INFO ] 2026-06-02 01:46:46.054 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21423/300s
[INFO ] 2026-06-02 01:46:47.461 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17839/300s
[INFO ] 2026-06-02 01:46:47.463 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:46:47.615 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:46:47.615 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 01:46:47.616 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:46:47.616 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:46:47.616 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:46:47.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:46:47.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:46:47.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:46:47.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:46:47.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:46:47.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:46:52.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-06-02 01:46:52.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428250,ok=428250,error=0, records=41
[WARN ] 2026-06-02 01:46:52.937 [28926] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:46:53.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:47:07.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 01:47:07.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428251,ok=428251,error=0, records=41
[WARN ] 2026-06-02 01:47:07.942 [29030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:47:08.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:47:08.192 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21426/300s
[INFO ] 2026-06-02 01:47:22.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 01:47:22.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428252,ok=428252,error=0, records=41
[WARN ] 2026-06-02 01:47:22.947 [29012] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:47:23.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:47:37.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-02 01:47:37.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428253,ok=428253,error=0, records=41
[WARN ] 2026-06-02 01:47:37.952 [29012] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:47:38.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:47:52.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 01:47:52.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428254,ok=428254,error=0, records=41
[INFO ] 2026-06-02 01:47:52.723 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21424/300s
[WARN ] 2026-06-02 01:47:52.959 [29018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:47:53.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:47:54.625 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21424/300s
[INFO ] 2026-06-02 01:48:01.732 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21424/300s
[INFO ] 2026-06-02 01:48:07.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 01:48:07.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428255,ok=428255,error=0, records=41
[WARN ] 2026-06-02 01:48:07.963 [29036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:48:08.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:48:22.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 01:48:22.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428256,ok=428256,error=0, records=41
[WARN ] 2026-06-02 01:48:22.969 [29036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:48:23.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:48:37.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 01:48:37.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428257,ok=428257,error=0, records=41
[WARN ] 2026-06-02 01:48:37.974 [29017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:48:38.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:48:52.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 01:48:52.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428258,ok=428258,error=0, records=41
[WARN ] 2026-06-02 01:48:52.980 [29123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:48:53.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:49:07.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 01:49:07.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428259,ok=428259,error=0, records=41
[WARN ] 2026-06-02 01:49:07.985 [29137] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:49:08.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:49:22.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 01:49:22.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428260,ok=428260,error=0, records=41
[WARN ] 2026-06-02 01:49:22.990 [29165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:49:23.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:49:37.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 01:49:37.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428261,ok=428261,error=0, records=41
[WARN ] 2026-06-02 01:49:37.996 [29096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:49:38.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:49:47.618 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20859016},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:49:47.768 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:49:47.769 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 01:49:47.769 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:49:47.769 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:49:47.769 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:49:47.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:49:47.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:49:47.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:49:47.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:49:47.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:49:47.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:49:52.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 01:49:52.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428262,ok=428262,error=0, records=41
[WARN ] 2026-06-02 01:49:53.001 [29179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:49:53.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:50:01.612 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21428/300s
[INFO ] 2026-06-02 01:50:07.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 01:50:07.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428263,ok=428263,error=0, records=41
[WARN ] 2026-06-02 01:50:08.007 [29151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:50:08.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:50:20.512 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21419/300s
[INFO ] 2026-06-02 01:50:22.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 01:50:22.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428264,ok=428264,error=0, records=41
[WARN ] 2026-06-02 01:50:23.013 [29193] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:50:23.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:50:37.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 01:50:37.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428265,ok=428265,error=0, records=41
[WARN ] 2026-06-02 01:50:38.019 [29165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:50:38.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:50:42.530 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21428/300s
[INFO ] 2026-06-02 01:50:52.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 01:50:52.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428266,ok=428266,error=0, records=41
[INFO ] 2026-06-02 01:50:52.450 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21415/300s
[WARN ] 2026-06-02 01:50:53.024 [29240] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:50:53.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:51:07.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 01:51:07.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428267,ok=428267,error=0, records=41
[WARN ] 2026-06-02 01:51:08.032 [29165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:51:08.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:51:22.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 01:51:22.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428268,ok=428268,error=0, records=41
[WARN ] 2026-06-02 01:51:23.036 [29268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:51:23.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:51:37.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 01:51:37.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428269,ok=428269,error=0, records=41
[WARN ] 2026-06-02 01:51:38.041 [29289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:51:38.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:51:44.722 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21415/300s
[INFO ] 2026-06-02 01:51:46.111 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21424/300s
[INFO ] 2026-06-02 01:51:52.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 01:51:52.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428270,ok=428270,error=0, records=41
[WARN ] 2026-06-02 01:51:53.046 [29289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:51:53.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:52:07.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 01:52:07.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428271,ok=428271,error=0, records=41
[WARN ] 2026-06-02 01:52:08.052 [29323] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:52:08.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:52:08.205 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21427/300s
[INFO ] 2026-06-02 01:52:22.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 01:52:22.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428272,ok=428272,error=0, records=41
[WARN ] 2026-06-02 01:52:22.557 [29328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:52:23.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:52:37.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 01:52:37.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428273,ok=428273,error=0, records=41
[WARN ] 2026-06-02 01:52:37.561 [29328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:52:38.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:52:47.769 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17840/300s
[INFO ] 2026-06-02 01:52:47.770 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858936},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:52:47.947 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:52:47.947 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 01:52:47.947 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:52:47.947 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:52:47.947 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:52:47.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:52:47.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:52:47.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:52:47.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:52:47.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:52:47.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:52:52.497 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-02 01:52:52.497 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428274,ok=428274,error=0, records=41
[WARN ] 2026-06-02 01:52:52.566 [29374] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:52:52.799 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21425/300s
[INFO ] 2026-06-02 01:52:53.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:52:54.700 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21425/300s
[INFO ] 2026-06-02 01:53:01.804 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21425/300s
[WARN ] 2026-06-02 01:53:07.571 [29401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:53:07.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 01:53:07.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428275,ok=428275,error=0, records=41
[INFO ] 2026-06-02 01:53:08.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:53:22.575 [29401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:53:22.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 01:53:22.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428276,ok=428276,error=0, records=41
[INFO ] 2026-06-02 01:53:23.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:53:37.581 [29426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:53:37.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 01:53:37.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428277,ok=428277,error=0, records=41
[INFO ] 2026-06-02 01:53:38.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 01:53:38.208 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 01:53:52.587 [29452] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:53:52.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 01:53:52.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428278,ok=428278,error=0, records=41
[INFO ] 2026-06-02 01:53:53.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:53:53.209 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 01:54:07.593 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:54:07.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 01:54:07.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428279,ok=428279,error=0, records=41
[INFO ] 2026-06-02 01:54:08.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:54:22.599 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:54:22.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 01:54:22.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428280,ok=428280,error=0, records=41
[INFO ] 2026-06-02 01:54:23.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:54:37.605 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:54:37.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 01:54:37.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428281,ok=428281,error=0, records=41
[INFO ] 2026-06-02 01:54:38.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:54:52.610 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:54:52.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 01:54:52.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428282,ok=428282,error=0, records=41
[INFO ] 2026-06-02 01:54:53.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:55:01.615 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21429/300s
[WARN ] 2026-06-02 01:55:07.616 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:55:07.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 01:55:07.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428283,ok=428283,error=0, records=41
[INFO ] 2026-06-02 01:55:08.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:55:20.621 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21420/300s
[WARN ] 2026-06-02 01:55:22.621 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:55:22.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 01:55:22.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428284,ok=428284,error=0, records=41
[INFO ] 2026-06-02 01:55:23.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:55:37.627 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:55:37.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 01:55:37.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428285,ok=428285,error=0, records=41
[INFO ] 2026-06-02 01:55:38.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:55:42.536 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21429/300s
[INFO ] 2026-06-02 01:55:47.949 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858860},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:55:48.104 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:55:48.104 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 01:55:48.105 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:55:48.105 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:55:48.105 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:55:48.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:55:48.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:55:48.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:55:48.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:55:48.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:55:48.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 01:55:52.633 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:55:52.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 01:55:52.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428286,ok=428286,error=0, records=41
[INFO ] 2026-06-02 01:55:52.637 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21416/300s
[INFO ] 2026-06-02 01:55:53.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 01:56:07.637 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:56:08.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-02 01:56:08.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 01:56:08.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428287,ok=428287,error=0, records=41
[WARN ] 2026-06-02 01:56:22.642 [29460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:56:23.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:56:23.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11865, records=51
[INFO ] 2026-06-02 01:56:23.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428288,ok=428288,error=0, records=51
[WARN ] 2026-06-02 01:56:37.646 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:56:38.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:56:38.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 01:56:38.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428289,ok=428289,error=0, records=41
[INFO ] 2026-06-02 01:56:44.907 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21416/300s
[INFO ] 2026-06-02 01:56:46.165 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21425/300s
[WARN ] 2026-06-02 01:56:52.652 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:56:53.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:56:53.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-02 01:56:53.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428290,ok=428290,error=0, records=41
[WARN ] 2026-06-02 01:57:07.659 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:57:08.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:57:08.218 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21428/300s
[INFO ] 2026-06-02 01:57:08.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-02 01:57:08.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428291,ok=428291,error=0, records=41
[WARN ] 2026-06-02 01:57:22.663 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:57:23.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:57:23.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 01:57:23.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428292,ok=428292,error=0, records=41
[WARN ] 2026-06-02 01:57:37.668 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:57:38.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:57:38.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 01:57:38.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428293,ok=428293,error=0, records=41
[WARN ] 2026-06-02 01:57:52.674 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:57:52.854 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21426/300s
[INFO ] 2026-06-02 01:57:53.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:57:53.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 01:57:53.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428294,ok=428294,error=0, records=41
[INFO ] 2026-06-02 01:57:54.756 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21426/300s
[INFO ] 2026-06-02 01:58:01.862 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21426/300s
[WARN ] 2026-06-02 01:58:07.678 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:58:08.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:58:08.740 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 01:58:08.740 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428295,ok=428295,error=0, records=41
[WARN ] 2026-06-02 01:58:22.683 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:58:23.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:58:23.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 01:58:23.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428296,ok=428296,error=0, records=41
[WARN ] 2026-06-02 01:58:37.689 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:58:38.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:58:38.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 01:58:38.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428297,ok=428297,error=0, records=41
[INFO ] 2026-06-02 01:58:48.105 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17841/300s
[INFO ] 2026-06-02 01:58:48.106 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 01:58:48.359 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 01:58:48.359 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 01:58:48.359 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 01:58:48.359 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 01:58:48.359 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 01:58:48.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:58:48.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 01:58:48.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:58:48.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 01:58:48.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 01:58:48.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 01:58:52.693 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:58:53.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:58:53.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 01:58:53.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428298,ok=428298,error=0, records=41
[WARN ] 2026-06-02 01:59:07.698 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:59:08.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:59:08.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 01:59:08.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428299,ok=428299,error=0, records=41
[WARN ] 2026-06-02 01:59:22.703 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:59:23.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:59:23.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 01:59:23.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428300,ok=428300,error=0, records=41
[WARN ] 2026-06-02 01:59:37.707 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:59:38.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:59:38.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 01:59:38.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428301,ok=428301,error=0, records=41
[WARN ] 2026-06-02 01:59:52.712 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 01:59:53.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 01:59:53.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 01:59:53.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428302,ok=428302,error=0, records=41
[INFO ] 2026-06-02 02:00:01.619 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21430/300s
[WARN ] 2026-06-02 02:00:07.717 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:00:08.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:00:08.787 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 02:00:08.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428303,ok=428303,error=0, records=41
[INFO ] 2026-06-02 02:00:20.722 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21421/300s
[WARN ] 2026-06-02 02:00:22.723 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:00:23.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:00:23.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 02:00:23.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428304,ok=428304,error=0, records=41
[WARN ] 2026-06-02 02:00:37.728 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:00:38.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:00:38.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 02:00:38.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428305,ok=428305,error=0, records=41
[INFO ] 2026-06-02 02:00:42.543 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21430/300s
[WARN ] 2026-06-02 02:00:52.732 [29476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:00:53.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:00:53.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 02:00:53.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428306,ok=428306,error=0, records=41
[INFO ] 2026-06-02 02:00:53.804 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21417/300s
[WARN ] 2026-06-02 02:01:07.737 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:01:08.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:01:08.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 02:01:08.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428307,ok=428307,error=0, records=41
[WARN ] 2026-06-02 02:01:22.741 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:01:23.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:01:23.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-02 02:01:23.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428308,ok=428308,error=0, records=41
[WARN ] 2026-06-02 02:01:37.747 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:01:38.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:01:38.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 02:01:38.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428309,ok=428309,error=0, records=41
[INFO ] 2026-06-02 02:01:45.095 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21417/300s
[INFO ] 2026-06-02 02:01:46.222 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21426/300s
[INFO ] 2026-06-02 02:01:48.361 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858700},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:01:48.534 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:01:48.534 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 02:01:48.534 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:01:48.534 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:01:48.534 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:01:48.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:01:48.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:01:48.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:01:48.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:01:48.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:01:48.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:01:52.753 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:01:53.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:01:53.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 02:01:53.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428310,ok=428310,error=0, records=41
[WARN ] 2026-06-02 02:02:07.759 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:02:08.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:02:08.231 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21429/300s
[INFO ] 2026-06-02 02:02:08.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 02:02:08.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428311,ok=428311,error=0, records=41
[WARN ] 2026-06-02 02:02:22.763 [29460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:02:23.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:02:23.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 02:02:23.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428312,ok=428312,error=0, records=41
[WARN ] 2026-06-02 02:02:37.767 [29485] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:02:38.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:02:38.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 02:02:38.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428313,ok=428313,error=0, records=41
[WARN ] 2026-06-02 02:02:52.771 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:02:52.919 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21427/300s
[INFO ] 2026-06-02 02:02:53.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:02:53.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 02:02:53.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428314,ok=428314,error=0, records=41
[INFO ] 2026-06-02 02:02:54.820 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21427/300s
[INFO ] 2026-06-02 02:03:01.928 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21427/300s
[WARN ] 2026-06-02 02:03:07.776 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:03:08.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:03:08.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-02 02:03:08.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428315,ok=428315,error=0, records=41
[WARN ] 2026-06-02 02:03:22.782 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:03:23.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:03:23.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 02:03:23.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428316,ok=428316,error=0, records=41
[WARN ] 2026-06-02 02:03:37.786 [29470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:03:38.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 02:03:38.235 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 02:03:38.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 02:03:38.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428317,ok=428317,error=0, records=41
[WARN ] 2026-06-02 02:03:52.793 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:03:53.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:03:53.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-02 02:03:53.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428318,ok=428318,error=0, records=41
[WARN ] 2026-06-02 02:04:07.800 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:04:08.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:04:08.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-02 02:04:08.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428319,ok=428319,error=0, records=41
[WARN ] 2026-06-02 02:04:22.806 [29460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:04:23.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:04:23.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 02:04:23.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428320,ok=428320,error=0, records=41
[WARN ] 2026-06-02 02:04:37.811 [30061] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:04:38.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:04:38.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-02 02:04:38.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428321,ok=428321,error=0, records=41
[INFO ] 2026-06-02 02:04:48.534 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17842/300s
[INFO ] 2026-06-02 02:04:48.536 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858616},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:04:48.712 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:04:48.712 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 02:04:48.712 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:04:48.712 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:04:48.712 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:04:48.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:04:48.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:04:48.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:04:48.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:04:48.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:04:48.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:04:52.816 [29461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:04:53.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:04:53.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 02:04:53.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428322,ok=428322,error=0, records=41
[INFO ] 2026-06-02 02:05:01.622 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21431/300s
[WARN ] 2026-06-02 02:05:07.822 [30042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:05:08.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:05:09.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-02 02:05:09.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428323,ok=428323,error=0, records=41
[INFO ] 2026-06-02 02:05:20.827 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21422/300s
[WARN ] 2026-06-02 02:05:22.828 [30071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:05:23.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:05:24.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 02:05:24.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428324,ok=428324,error=0, records=41
[WARN ] 2026-06-02 02:05:37.832 [29460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:05:38.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:05:39.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 02:05:39.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428325,ok=428325,error=0, records=41
[INFO ] 2026-06-02 02:05:42.550 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21431/300s
[WARN ] 2026-06-02 02:05:52.838 [30117] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:05:53.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:05:54.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 02:05:54.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428326,ok=428326,error=0, records=41
[INFO ] 2026-06-02 02:05:54.085 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21418/300s
[WARN ] 2026-06-02 02:06:07.844 [30131] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:06:08.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:06:09.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 02:06:09.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428327,ok=428327,error=0, records=41
[WARN ] 2026-06-02 02:06:22.849 [30155] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:06:23.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:06:24.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 02:06:24.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428328,ok=428328,error=0, records=41
[WARN ] 2026-06-02 02:06:37.855 [30103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:06:38.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:06:39.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:06:39.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428329,ok=428329,error=0, records=41
[INFO ] 2026-06-02 02:06:45.273 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21418/300s
[INFO ] 2026-06-02 02:06:46.277 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21427/300s
[WARN ] 2026-06-02 02:06:52.861 [30131] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:06:53.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:06:54.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 02:06:54.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428330,ok=428330,error=0, records=41
[WARN ] 2026-06-02 02:07:07.867 [30195] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:07:08.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:07:08.244 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21430/300s
[INFO ] 2026-06-02 02:07:09.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 02:07:09.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428331,ok=428331,error=0, records=41
[WARN ] 2026-06-02 02:07:22.871 [30210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:07:23.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:07:24.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 02:07:24.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428332,ok=428332,error=0, records=41
[WARN ] 2026-06-02 02:07:37.876 [30230] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:07:38.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:07:39.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 02:07:39.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428333,ok=428333,error=0, records=41
[INFO ] 2026-06-02 02:07:48.714 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858544},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:07:48.886 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:07:48.886 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 02:07:48.886 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:07:48.886 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:07:48.886 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:07:48.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:07:48.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:07:48.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:07:48.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:07:48.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:07:48.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:07:52.880 [30246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:07:52.975 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21428/300s
[INFO ] 2026-06-02 02:07:53.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:07:54.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 02:07:54.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428334,ok=428334,error=0, records=41
[INFO ] 2026-06-02 02:07:54.876 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21428/300s
[INFO ] 2026-06-02 02:08:01.982 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21428/300s
[WARN ] 2026-06-02 02:08:07.884 [30155] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:08:08.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:08:09.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 02:08:09.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428335,ok=428335,error=0, records=41
[WARN ] 2026-06-02 02:08:22.891 [30258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:08:23.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:08:24.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 02:08:24.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428336,ok=428336,error=0, records=41
[WARN ] 2026-06-02 02:08:37.896 [30258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:08:38.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:08:39.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 02:08:39.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428337,ok=428337,error=0, records=41
[WARN ] 2026-06-02 02:08:52.901 [30307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:08:53.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:08:53.248 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 02:08:54.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 02:08:54.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428338,ok=428338,error=0, records=41
[WARN ] 2026-06-02 02:09:07.906 [30301] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:09:08.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:09:09.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 02:09:09.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428339,ok=428339,error=0, records=41
[WARN ] 2026-06-02 02:09:22.912 [30258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:09:23.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:09:24.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 02:09:24.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428340,ok=428340,error=0, records=41
[WARN ] 2026-06-02 02:09:37.918 [30258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:09:38.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:09:39.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 02:09:39.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428341,ok=428341,error=0, records=41
[WARN ] 2026-06-02 02:09:52.924 [30301] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:09:53.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:09:54.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:09:54.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428342,ok=428342,error=0, records=41
[INFO ] 2026-06-02 02:10:01.626 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21432/300s
[WARN ] 2026-06-02 02:10:07.929 [30390] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:10:08.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:10:09.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 02:10:09.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428343,ok=428343,error=0, records=41
[INFO ] 2026-06-02 02:10:20.935 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21423/300s
[WARN ] 2026-06-02 02:10:22.936 [30380] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:10:23.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:10:24.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:10:24.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428344,ok=428344,error=0, records=41
[WARN ] 2026-06-02 02:10:37.940 [30391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:10:38.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:10:39.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 02:10:39.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428345,ok=428345,error=0, records=41
[INFO ] 2026-06-02 02:10:42.556 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21432/300s
[INFO ] 2026-06-02 02:10:48.887 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17843/300s
[INFO ] 2026-06-02 02:10:48.888 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858468},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:10:49.052 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:10:49.053 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:10:49.053 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:10:49.053 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:10:49.053 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:10:49.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:10:49.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:10:49.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:10:49.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:10:49.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:10:49.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:10:52.945 [30425] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:10:53.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:10:54.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 02:10:54.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428346,ok=428346,error=0, records=41
[INFO ] 2026-06-02 02:10:54.334 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21419/300s
[WARN ] 2026-06-02 02:11:07.951 [30301] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:11:08.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:11:09.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 02:11:09.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428347,ok=428347,error=0, records=41
[WARN ] 2026-06-02 02:11:22.955 [30401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:11:23.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:11:24.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 02:11:24.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428348,ok=428348,error=0, records=41
[WARN ] 2026-06-02 02:11:37.960 [30401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:11:38.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:11:39.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 02:11:39.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428349,ok=428349,error=0, records=41
[INFO ] 2026-06-02 02:11:45.454 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21419/300s
[INFO ] 2026-06-02 02:11:46.334 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21428/300s
[WARN ] 2026-06-02 02:11:52.966 [30391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:11:53.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:11:54.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 02:11:54.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428350,ok=428350,error=0, records=41
[WARN ] 2026-06-02 02:12:07.970 [30301] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:12:08.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:12:08.257 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21431/300s
[INFO ] 2026-06-02 02:12:09.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 02:12:09.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428351,ok=428351,error=0, records=41
[WARN ] 2026-06-02 02:12:22.975 [30522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:12:23.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:12:24.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 02:12:24.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428352,ok=428352,error=0, records=41
[WARN ] 2026-06-02 02:12:37.980 [30391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:12:38.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:12:39.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 02:12:39.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428353,ok=428353,error=0, records=41
[WARN ] 2026-06-02 02:12:52.985 [30401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:12:53.033 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21429/300s
[INFO ] 2026-06-02 02:12:53.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:12:54.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 02:12:54.379 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428354,ok=428354,error=0, records=41
[INFO ] 2026-06-02 02:12:54.935 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21429/300s
[INFO ] 2026-06-02 02:13:02.042 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21429/300s
[WARN ] 2026-06-02 02:13:07.991 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:13:08.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:13:09.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-02 02:13:09.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428355,ok=428355,error=0, records=41
[WARN ] 2026-06-02 02:13:22.997 [30577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:13:23.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:13:24.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 02:13:24.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428356,ok=428356,error=0, records=41
[WARN ] 2026-06-02 02:13:38.002 [30508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:13:38.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 02:13:38.261 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 02:13:39.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-02 02:13:39.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428357,ok=428357,error=0, records=41
[INFO ] 2026-06-02 02:13:49.054 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858396},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:13:49.223 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:13:49.223 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 02:13:49.223 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:13:49.223 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:13:49.223 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:13:49.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:13:49.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:13:49.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:13:49.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:13:49.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:13:49.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:13:53.007 [30401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:13:53.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:13:54.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 02:13:54.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428358,ok=428358,error=0, records=41
[WARN ] 2026-06-02 02:14:08.012 [30522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:14:08.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:14:09.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 02:14:09.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428359,ok=428359,error=0, records=41
[WARN ] 2026-06-02 02:14:23.018 [30620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:14:23.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:14:24.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-02 02:14:24.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428360,ok=428360,error=0, records=41
[WARN ] 2026-06-02 02:14:38.022 [30577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:14:38.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:14:39.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-02 02:14:39.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428361,ok=428361,error=0, records=41
[WARN ] 2026-06-02 02:14:53.026 [30620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:14:53.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:14:54.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-02 02:14:54.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428362,ok=428362,error=0, records=41
[INFO ] 2026-06-02 02:15:01.630 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21433/300s
[WARN ] 2026-06-02 02:15:08.033 [30634] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:15:08.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:15:09.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 02:15:09.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428363,ok=428363,error=0, records=41
[INFO ] 2026-06-02 02:15:21.036 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21424/300s
[WARN ] 2026-06-02 02:15:23.037 [30675] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:15:23.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:15:24.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 02:15:24.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428364,ok=428364,error=0, records=41
[WARN ] 2026-06-02 02:15:38.042 [30712] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:15:38.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:15:39.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 02:15:39.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428365,ok=428365,error=0, records=41
[INFO ] 2026-06-02 02:15:42.562 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21433/300s
[WARN ] 2026-06-02 02:15:53.047 [30732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:15:53.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:15:54.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 02:15:54.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428366,ok=428366,error=0, records=41
[INFO ] 2026-06-02 02:15:54.451 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21420/300s
[WARN ] 2026-06-02 02:16:08.053 [30732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:16:08.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:16:09.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 02:16:09.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428367,ok=428367,error=0, records=41
[WARN ] 2026-06-02 02:16:22.557 [30750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:16:23.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:16:24.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 02:16:24.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428368,ok=428368,error=0, records=41
[WARN ] 2026-06-02 02:16:37.563 [30750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:16:38.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:16:39.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 02:16:39.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428369,ok=428369,error=0, records=41
[INFO ] 2026-06-02 02:16:45.631 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21420/300s
[INFO ] 2026-06-02 02:16:46.388 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21429/300s
[INFO ] 2026-06-02 02:16:49.223 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17844/300s
[INFO ] 2026-06-02 02:16:49.225 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858316},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:16:49.363 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:16:49.363 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 02:16:49.363 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:16:49.363 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:16:49.363 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:16:49.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:16:49.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:16:49.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:16:49.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:16:49.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:16:49.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:16:52.567 [30796] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:16:53.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:16:54.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 02:16:54.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428370,ok=428370,error=0, records=41
[WARN ] 2026-06-02 02:17:07.572 [30818] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:17:08.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:17:08.269 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21432/300s
[INFO ] 2026-06-02 02:17:09.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 02:17:09.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428371,ok=428371,error=0, records=41
[WARN ] 2026-06-02 02:17:22.577 [30831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:17:23.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:17:24.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 02:17:24.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428372,ok=428372,error=0, records=41
[WARN ] 2026-06-02 02:17:37.583 [30834] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:17:38.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:17:39.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 02:17:39.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428373,ok=428373,error=0, records=41
[WARN ] 2026-06-02 02:17:52.587 [30825] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:17:53.087 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21430/300s
[INFO ] 2026-06-02 02:17:53.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:17:54.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 02:17:54.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428374,ok=428374,error=0, records=41
[INFO ] 2026-06-02 02:17:54.989 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21430/300s
[INFO ] 2026-06-02 02:18:02.095 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21430/300s
[WARN ] 2026-06-02 02:18:07.593 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:18:08.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:18:09.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:18:09.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428375,ok=428375,error=0, records=41
[WARN ] 2026-06-02 02:18:22.598 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:18:23.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:18:24.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 02:18:24.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428376,ok=428376,error=0, records=41
[WARN ] 2026-06-02 02:18:37.602 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:18:38.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:18:39.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 02:18:39.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428377,ok=428377,error=0, records=41
[WARN ] 2026-06-02 02:18:52.607 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:18:53.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:18:54.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 02:18:54.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428378,ok=428378,error=0, records=41
[WARN ] 2026-06-02 02:19:07.612 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:19:08.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:19:09.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:19:09.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428379,ok=428379,error=0, records=41
[WARN ] 2026-06-02 02:19:22.617 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:19:23.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:19:24.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 02:19:24.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428380,ok=428380,error=0, records=41
[WARN ] 2026-06-02 02:19:37.623 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:19:38.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:19:39.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 02:19:39.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428381,ok=428381,error=0, records=41
[INFO ] 2026-06-02 02:19:49.365 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858244},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:19:49.534 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:19:49.534 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:19:49.534 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:19:49.534 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:19:49.534 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:19:49.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:19:49.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:19:49.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:19:49.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:19:49.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:19:49.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:19:52.628 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:19:53.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:19:54.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 02:19:54.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428382,ok=428382,error=0, records=41
[INFO ] 2026-06-02 02:20:01.634 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21434/300s
[WARN ] 2026-06-02 02:20:07.633 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:20:08.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:20:09.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 02:20:09.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428383,ok=428383,error=0, records=41
[INFO ] 2026-06-02 02:20:21.137 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21425/300s
[WARN ] 2026-06-02 02:20:22.638 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:20:23.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:20:24.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 02:20:24.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428384,ok=428384,error=0, records=41
[WARN ] 2026-06-02 02:20:37.643 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:20:38.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:20:39.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 02:20:39.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428385,ok=428385,error=0, records=41
[INFO ] 2026-06-02 02:20:42.568 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21434/300s
[WARN ] 2026-06-02 02:20:52.648 [30898] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:20:53.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:20:54.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 02:20:54.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428386,ok=428386,error=0, records=41
[INFO ] 2026-06-02 02:20:54.695 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21421/300s
[WARN ] 2026-06-02 02:21:07.653 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:21:08.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:21:09.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 02:21:09.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428387,ok=428387,error=0, records=41
[WARN ] 2026-06-02 02:21:22.658 [30898] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:21:23.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:21:24.706 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 02:21:24.706 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428388,ok=428388,error=0, records=41
[WARN ] 2026-06-02 02:21:37.663 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:21:38.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:21:39.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 02:21:39.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428389,ok=428389,error=0, records=41
[INFO ] 2026-06-02 02:21:45.815 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21421/300s
[INFO ] 2026-06-02 02:21:46.446 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21430/300s
[WARN ] 2026-06-02 02:21:52.667 [30898] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:21:53.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:21:54.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 02:21:54.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428390,ok=428390,error=0, records=41
[WARN ] 2026-06-02 02:22:07.672 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:22:08.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:22:08.282 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21433/300s
[INFO ] 2026-06-02 02:22:09.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 02:22:09.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428391,ok=428391,error=0, records=41
[WARN ] 2026-06-02 02:22:22.676 [30898] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:22:23.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:22:24.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 02:22:24.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428392,ok=428392,error=0, records=41
[WARN ] 2026-06-02 02:22:37.682 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:22:38.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:22:39.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 02:22:39.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428393,ok=428393,error=0, records=41
[INFO ] 2026-06-02 02:22:49.534 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17845/300s
[INFO ] 2026-06-02 02:22:49.536 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858164},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:22:49.688 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:22:49.688 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 02:22:49.688 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:22:49.688 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:22:49.688 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:22:49.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:22:49.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:22:49.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:22:49.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:22:49.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:22:49.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:22:52.686 [30848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:22:53.159 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21431/300s
[INFO ] 2026-06-02 02:22:53.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:22:54.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 02:22:54.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428394,ok=428394,error=0, records=41
[INFO ] 2026-06-02 02:22:55.061 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21431/300s
[INFO ] 2026-06-02 02:23:02.166 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21431/300s
[WARN ] 2026-06-02 02:23:07.692 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:23:08.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:23:09.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 02:23:09.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428395,ok=428395,error=0, records=41
[WARN ] 2026-06-02 02:23:22.697 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:23:23.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:23:24.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 02:23:24.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428396,ok=428396,error=0, records=41
[WARN ] 2026-06-02 02:23:37.702 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:23:38.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 02:23:38.286 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 02:23:39.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 02:23:39.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428397,ok=428397,error=0, records=41
[WARN ] 2026-06-02 02:23:52.707 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:23:53.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:23:53.286 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 02:23:54.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 02:23:54.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428398,ok=428398,error=0, records=41
[WARN ] 2026-06-02 02:24:07.713 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:24:08.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:24:09.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 02:24:09.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428399,ok=428399,error=0, records=41
[WARN ] 2026-06-02 02:24:22.718 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:24:23.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:24:24.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 02:24:24.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428400,ok=428400,error=0, records=41
[WARN ] 2026-06-02 02:24:37.724 [30848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:24:38.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:24:39.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 02:24:39.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428401,ok=428401,error=0, records=41
[WARN ] 2026-06-02 02:24:52.731 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:24:53.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:24:54.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 02:24:54.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428402,ok=428402,error=0, records=41
[INFO ] 2026-06-02 02:25:01.638 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21435/300s
[WARN ] 2026-06-02 02:25:07.736 [30848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:25:08.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:25:09.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 02:25:09.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428403,ok=428403,error=0, records=41
[INFO ] 2026-06-02 02:25:21.241 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21426/300s
[WARN ] 2026-06-02 02:25:22.742 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:25:23.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:25:24.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 02:25:24.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428404,ok=428404,error=0, records=41
[WARN ] 2026-06-02 02:25:37.747 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:25:38.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:25:39.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 02:25:39.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428405,ok=428405,error=0, records=41
[INFO ] 2026-06-02 02:25:42.574 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21435/300s
[INFO ] 2026-06-02 02:25:49.690 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858084},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:25:49.842 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:25:49.843 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:25:49.843 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:25:49.843 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:25:49.843 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:25:49.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:25:49.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:25:49.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:25:49.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:25:49.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:25:49.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:25:52.752 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:25:53.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:25:54.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 02:25:54.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428406,ok=428406,error=0, records=41
[INFO ] 2026-06-02 02:25:54.891 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21422/300s
[WARN ] 2026-06-02 02:26:07.757 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:26:08.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:26:09.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 02:26:09.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428407,ok=428407,error=0, records=41
[WARN ] 2026-06-02 02:26:22.762 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:26:23.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:26:24.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 02:26:24.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428408,ok=428408,error=0, records=41
[WARN ] 2026-06-02 02:26:37.768 [30848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:26:38.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:26:39.911 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 02:26:39.911 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428409,ok=428409,error=0, records=41
[INFO ] 2026-06-02 02:26:45.999 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21422/300s
[INFO ] 2026-06-02 02:26:46.501 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21431/300s
[WARN ] 2026-06-02 02:26:52.775 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:26:53.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:26:54.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 02:26:54.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428410,ok=428410,error=0, records=41
[WARN ] 2026-06-02 02:27:07.780 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:27:08.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:27:08.295 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21434/300s
[INFO ] 2026-06-02 02:27:09.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 02:27:09.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428411,ok=428411,error=0, records=41
[WARN ] 2026-06-02 02:27:22.785 [30883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:27:23.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:27:24.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 02:27:24.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428412,ok=428412,error=0, records=41
[WARN ] 2026-06-02 02:27:37.790 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:27:38.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:27:39.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 02:27:39.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428413,ok=428413,error=0, records=41
[WARN ] 2026-06-02 02:27:52.795 [30848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:27:53.223 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21432/300s
[INFO ] 2026-06-02 02:27:53.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:27:55.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 02:27:55.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428414,ok=428414,error=0, records=41
[INFO ] 2026-06-02 02:27:55.125 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21432/300s
[INFO ] 2026-06-02 02:28:02.232 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21432/300s
[WARN ] 2026-06-02 02:28:07.801 [30848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:28:08.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:28:10.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 02:28:10.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428415,ok=428415,error=0, records=41
[WARN ] 2026-06-02 02:28:22.806 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:28:23.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:28:25.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 02:28:25.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428416,ok=428416,error=0, records=41
[WARN ] 2026-06-02 02:28:37.812 [31450] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:28:38.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:28:40.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 02:28:40.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428417,ok=428417,error=0, records=41
[INFO ] 2026-06-02 02:28:49.843 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17846/300s
[INFO ] 2026-06-02 02:28:49.844 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20858012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:28:50.014 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:28:50.014 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:28:50.014 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:28:50.014 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:28:50.014 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:28:50.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:28:50.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:28:50.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:28:50.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:28:50.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:28:50.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:28:52.816 [30872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:28:53.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:28:55.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 02:28:55.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428418,ok=428418,error=0, records=41
[WARN ] 2026-06-02 02:29:07.822 [31466] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:29:08.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:29:10.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 02:29:10.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428419,ok=428419,error=0, records=41
[WARN ] 2026-06-02 02:29:22.827 [31445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:29:23.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:29:25.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 02:29:25.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428420,ok=428420,error=0, records=41
[WARN ] 2026-06-02 02:29:37.832 [31445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:29:38.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:29:40.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 02:29:40.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428421,ok=428421,error=0, records=41
[WARN ] 2026-06-02 02:29:52.838 [31507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:29:53.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:29:55.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 02:29:55.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428422,ok=428422,error=0, records=41
[INFO ] 2026-06-02 02:30:01.641 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21436/300s
[WARN ] 2026-06-02 02:30:07.843 [31460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:30:08.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:30:10.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 02:30:10.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428423,ok=428423,error=0, records=41
[INFO ] 2026-06-02 02:30:21.347 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21427/300s
[WARN ] 2026-06-02 02:30:22.848 [31550] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:30:23.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:30:25.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 02:30:25.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428424,ok=428424,error=0, records=41
[WARN ] 2026-06-02 02:30:37.853 [31493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:30:38.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:30:40.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 02:30:40.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428425,ok=428425,error=0, records=41
[INFO ] 2026-06-02 02:30:42.581 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21436/300s
[WARN ] 2026-06-02 02:30:52.858 [31536] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:30:53.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:30:55.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 02:30:55.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428426,ok=428426,error=0, records=41
[INFO ] 2026-06-02 02:30:55.208 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21423/300s
[WARN ] 2026-06-02 02:31:07.864 [31536] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:31:08.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:31:10.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 02:31:10.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428427,ok=428427,error=0, records=41
[WARN ] 2026-06-02 02:31:22.869 [31578] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:31:23.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:31:25.218 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 02:31:25.218 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428428,ok=428428,error=0, records=41
[WARN ] 2026-06-02 02:31:37.874 [31624] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:31:38.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:31:40.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 02:31:40.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428429,ok=428429,error=0, records=41
[INFO ] 2026-06-02 02:31:46.183 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21423/300s
[INFO ] 2026-06-02 02:31:46.559 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21432/300s
[INFO ] 2026-06-02 02:31:50.016 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:31:50.175 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:31:50.175 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:31:50.175 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:31:50.175 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:31:50.175 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:31:50.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:31:50.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:31:50.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:31:50.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:31:50.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:31:50.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:31:52.878 [31635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:31:53.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:31:55.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 02:31:55.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428430,ok=428430,error=0, records=41
[WARN ] 2026-06-02 02:32:07.883 [31592] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:32:08.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:32:08.308 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21435/300s
[INFO ] 2026-06-02 02:32:10.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 02:32:10.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428431,ok=428431,error=0, records=41
[WARN ] 2026-06-02 02:32:22.888 [31564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:32:23.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:32:25.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 02:32:25.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428432,ok=428432,error=0, records=41
[WARN ] 2026-06-02 02:32:37.893 [31685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:32:38.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:32:40.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 02:32:40.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428433,ok=428433,error=0, records=41
[WARN ] 2026-06-02 02:32:52.898 [31707] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:32:53.291 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21433/300s
[INFO ] 2026-06-02 02:32:53.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:32:55.193 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21433/300s
[INFO ] 2026-06-02 02:32:55.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 02:32:55.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428434,ok=428434,error=0, records=41
[INFO ] 2026-06-02 02:33:02.300 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21433/300s
[WARN ] 2026-06-02 02:33:07.904 [31724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:33:08.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:33:10.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 02:33:10.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428435,ok=428435,error=0, records=41
[WARN ] 2026-06-02 02:33:22.911 [31724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:33:23.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:33:25.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 02:33:25.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428436,ok=428436,error=0, records=41
[WARN ] 2026-06-02 02:33:37.917 [31750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:33:38.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 02:33:38.311 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 02:33:40.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 02:33:40.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428437,ok=428437,error=0, records=41
[WARN ] 2026-06-02 02:33:52.923 [31756] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:33:53.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:33:55.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:33:55.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428438,ok=428438,error=0, records=41
[WARN ] 2026-06-02 02:34:07.928 [31784] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:34:08.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:34:10.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 02:34:10.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428439,ok=428439,error=0, records=41
[WARN ] 2026-06-02 02:34:22.933 [31785] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:34:23.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:34:25.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 02:34:25.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428440,ok=428440,error=0, records=41
[WARN ] 2026-06-02 02:34:37.938 [31813] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:34:38.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:34:40.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 02:34:40.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428441,ok=428441,error=0, records=41
[INFO ] 2026-06-02 02:34:50.176 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17847/300s
[INFO ] 2026-06-02 02:34:50.177 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:34:50.344 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:34:50.344 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:34:50.344 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:34:50.344 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:34:50.344 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:34:50.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:34:50.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:34:50.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:34:50.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:34:50.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:34:50.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:34:52.943 [31824] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:34:53.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:34:55.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 02:34:55.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428442,ok=428442,error=0, records=41
[INFO ] 2026-06-02 02:35:01.645 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21437/300s
[WARN ] 2026-06-02 02:35:07.950 [31835] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:35:08.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:35:10.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-02 02:35:10.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428443,ok=428443,error=0, records=41
[INFO ] 2026-06-02 02:35:21.454 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21428/300s
[WARN ] 2026-06-02 02:35:22.955 [31865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:35:23.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:35:25.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:35:25.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428444,ok=428444,error=0, records=41
[WARN ] 2026-06-02 02:35:37.960 [31865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:35:38.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:35:40.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 02:35:40.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428445,ok=428445,error=0, records=41
[INFO ] 2026-06-02 02:35:42.587 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21437/300s
[WARN ] 2026-06-02 02:35:52.964 [31879] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:35:53.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:35:55.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 02:35:55.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428446,ok=428446,error=0, records=41
[INFO ] 2026-06-02 02:35:55.428 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21424/300s
[WARN ] 2026-06-02 02:36:07.970 [31836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:36:08.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:36:10.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 02:36:10.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428447,ok=428447,error=0, records=41
[WARN ] 2026-06-02 02:36:22.974 [31921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:36:23.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:36:25.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 02:36:25.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428448,ok=428448,error=0, records=41
[WARN ] 2026-06-02 02:36:37.979 [31836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:36:38.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:36:40.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 02:36:40.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428449,ok=428449,error=0, records=41
[INFO ] 2026-06-02 02:36:46.365 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21424/300s
[INFO ] 2026-06-02 02:36:46.612 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21433/300s
[WARN ] 2026-06-02 02:36:52.984 [31935] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:36:53.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:36:55.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 02:36:55.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428450,ok=428450,error=0, records=41
[WARN ] 2026-06-02 02:37:07.989 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:37:08.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:37:08.320 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21436/300s
[INFO ] 2026-06-02 02:37:10.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 02:37:10.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428451,ok=428451,error=0, records=41
[WARN ] 2026-06-02 02:37:22.994 [31893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:37:23.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:37:25.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 02:37:25.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428452,ok=428452,error=0, records=41
[WARN ] 2026-06-02 02:37:38.000 [31963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:37:38.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:37:40.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 02:37:40.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428453,ok=428453,error=0, records=41
[INFO ] 2026-06-02 02:37:50.346 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:37:50.517 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:37:50.517 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 02:37:50.517 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:37:50.518 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:37:50.518 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:37:50.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:37:50.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:37:50.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:37:50.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:37:50.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:37:50.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:37:53.004 [31836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:37:53.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:37:53.354 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21434/300s
[INFO ] 2026-06-02 02:37:55.256 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21434/300s
[INFO ] 2026-06-02 02:37:55.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 02:37:55.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428454,ok=428454,error=0, records=41
[INFO ] 2026-06-02 02:38:02.363 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21434/300s
[WARN ] 2026-06-02 02:38:08.009 [31949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:38:08.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:38:10.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 02:38:10.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428455,ok=428455,error=0, records=41
[WARN ] 2026-06-02 02:38:23.014 [32034] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:38:23.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:38:25.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:38:25.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428456,ok=428456,error=0, records=41
[WARN ] 2026-06-02 02:38:38.019 [31836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:38:38.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:38:40.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 02:38:40.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428457,ok=428457,error=0, records=41
[WARN ] 2026-06-02 02:38:53.023 [32062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:38:53.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:38:53.325 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 02:38:55.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 02:38:55.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428458,ok=428458,error=0, records=41
[WARN ] 2026-06-02 02:39:08.029 [31963] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:39:08.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:39:10.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 02:39:10.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428459,ok=428459,error=0, records=41
[WARN ] 2026-06-02 02:39:23.034 [32076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:39:23.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:39:25.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:39:25.513 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428460,ok=428460,error=0, records=41
[WARN ] 2026-06-02 02:39:38.039 [32106] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:39:38.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:39:40.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 02:39:40.519 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428461,ok=428461,error=0, records=41
[WARN ] 2026-06-02 02:39:53.044 [32076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:39:53.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:39:55.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 02:39:55.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428462,ok=428462,error=0, records=41
[INFO ] 2026-06-02 02:40:01.648 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21438/300s
[WARN ] 2026-06-02 02:40:08.049 [32122] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:40:08.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:40:10.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 02:40:10.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428463,ok=428463,error=0, records=41
[INFO ] 2026-06-02 02:40:21.553 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21429/300s
[WARN ] 2026-06-02 02:40:22.554 [32168] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:40:23.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:40:25.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 02:40:25.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428464,ok=428464,error=0, records=41
[WARN ] 2026-06-02 02:40:37.560 [32182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:40:38.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:40:40.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:40:40.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428465,ok=428465,error=0, records=41
[INFO ] 2026-06-02 02:40:42.593 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21438/300s
[INFO ] 2026-06-02 02:40:50.518 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17848/300s
[INFO ] 2026-06-02 02:40:50.519 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857692},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:40:50.700 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:40:50.700 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-02 02:40:50.700 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:40:50.700 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:40:50.700 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:40:50.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:40:50.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:40:50.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:40:50.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:40:50.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:40:50.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:40:52.566 [32196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:40:53.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:40:55.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 02:40:55.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428466,ok=428466,error=0, records=41
[INFO ] 2026-06-02 02:40:55.549 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21425/300s
[WARN ] 2026-06-02 02:41:07.571 [32196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:41:08.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:41:10.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 02:41:10.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428467,ok=428467,error=0, records=41
[WARN ] 2026-06-02 02:41:22.576 [32238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:41:23.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:41:25.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:41:25.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428468,ok=428468,error=0, records=41
[WARN ] 2026-06-02 02:41:37.581 [32254] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:41:38.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:41:40.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 02:41:40.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428469,ok=428469,error=0, records=41
[INFO ] 2026-06-02 02:41:46.549 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21425/300s
[INFO ] 2026-06-02 02:41:46.670 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21434/300s
[WARN ] 2026-06-02 02:41:52.585 [32238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:41:53.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:41:55.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 02:41:55.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428470,ok=428470,error=0, records=41
[WARN ] 2026-06-02 02:42:07.589 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:42:08.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:42:08.334 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21437/300s
[INFO ] 2026-06-02 02:42:10.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 02:42:10.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428471,ok=428471,error=0, records=41
[WARN ] 2026-06-02 02:42:22.594 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:42:23.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:42:25.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 02:42:25.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428472,ok=428472,error=0, records=41
[WARN ] 2026-06-02 02:42:37.599 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:42:38.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:42:40.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 02:42:40.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428473,ok=428473,error=0, records=41
[WARN ] 2026-06-02 02:42:52.604 [32275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:42:53.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:42:53.423 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21435/300s
[INFO ] 2026-06-02 02:42:55.325 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21435/300s
[INFO ] 2026-06-02 02:42:55.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 02:42:55.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428474,ok=428474,error=0, records=41
[INFO ] 2026-06-02 02:43:02.431 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21435/300s
[WARN ] 2026-06-02 02:43:07.610 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:43:08.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:43:10.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 02:43:10.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428475,ok=428475,error=0, records=41
[WARN ] 2026-06-02 02:43:22.615 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:43:23.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:43:25.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:43:25.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428476,ok=428476,error=0, records=41
[WARN ] 2026-06-02 02:43:37.620 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:43:38.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 02:43:38.338 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 02:43:40.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 02:43:40.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428477,ok=428477,error=0, records=41
[INFO ] 2026-06-02 02:43:50.702 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857616},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:43:50.878 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:43:50.878 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:43:50.878 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:43:50.878 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:43:50.878 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:43:50.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:43:50.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:43:50.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:43:50.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:43:50.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:43:50.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:43:52.624 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:43:53.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:43:55.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 02:43:55.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428478,ok=428478,error=0, records=41
[WARN ] 2026-06-02 02:44:07.629 [32275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:44:08.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:44:10.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 02:44:10.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428479,ok=428479,error=0, records=41
[WARN ] 2026-06-02 02:44:22.634 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:44:23.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:44:25.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 02:44:25.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428480,ok=428480,error=0, records=41
[WARN ] 2026-06-02 02:44:37.640 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:44:38.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:44:40.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 02:44:40.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428481,ok=428481,error=0, records=41
[WARN ] 2026-06-02 02:44:52.645 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:44:53.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:44:55.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 02:44:55.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428482,ok=428482,error=0, records=41
[INFO ] 2026-06-02 02:45:01.652 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21439/300s
[WARN ] 2026-06-02 02:45:07.650 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:45:08.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:45:10.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 02:45:10.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428483,ok=428483,error=0, records=41
[INFO ] 2026-06-02 02:45:21.654 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21430/300s
[WARN ] 2026-06-02 02:45:22.655 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:45:23.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:45:25.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 02:45:25.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428484,ok=428484,error=0, records=41
[WARN ] 2026-06-02 02:45:37.661 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:45:38.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:45:40.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 02:45:40.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428485,ok=428485,error=0, records=41
[INFO ] 2026-06-02 02:45:42.600 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21439/300s
[WARN ] 2026-06-02 02:45:52.664 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:45:53.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:45:55.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 02:45:55.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428486,ok=428486,error=0, records=41
[INFO ] 2026-06-02 02:45:55.667 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21426/300s
[WARN ] 2026-06-02 02:46:07.669 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:46:08.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:46:10.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 02:46:10.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428487,ok=428487,error=0, records=41
[WARN ] 2026-06-02 02:46:22.674 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:46:23.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:46:25.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 02:46:25.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428488,ok=428488,error=0, records=41
[WARN ] 2026-06-02 02:46:37.679 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:46:38.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:46:40.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 02:46:40.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428489,ok=428489,error=0, records=41
[INFO ] 2026-06-02 02:46:46.728 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21435/300s
[INFO ] 2026-06-02 02:46:46.730 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21426/300s
[INFO ] 2026-06-02 02:46:50.878 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17849/300s
[INFO ] 2026-06-02 02:46:50.880 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857544},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:46:51.056 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:46:51.056 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 02:46:51.056 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:46:51.056 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:46:51.056 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:46:51.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:46:51.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:46:51.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:46:51.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:46:51.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:46:51.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:46:52.684 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:46:53.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:46:55.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 02:46:55.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428490,ok=428490,error=0, records=41
[WARN ] 2026-06-02 02:47:07.690 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:47:08.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:47:08.347 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21438/300s
[INFO ] 2026-06-02 02:47:10.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 02:47:10.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428491,ok=428491,error=0, records=41
[WARN ] 2026-06-02 02:47:22.695 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:47:23.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:47:25.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 02:47:25.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428492,ok=428492,error=0, records=41
[WARN ] 2026-06-02 02:47:37.699 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:47:38.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:47:40.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 02:47:40.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428493,ok=428493,error=0, records=41
[WARN ] 2026-06-02 02:47:52.704 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:47:53.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:47:53.493 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21436/300s
[INFO ] 2026-06-02 02:47:55.394 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21436/300s
[INFO ] 2026-06-02 02:47:55.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 02:47:55.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428494,ok=428494,error=0, records=41
[INFO ] 2026-06-02 02:48:02.499 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21436/300s
[WARN ] 2026-06-02 02:48:07.709 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:48:08.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:48:10.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 02:48:10.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428495,ok=428495,error=0, records=41
[WARN ] 2026-06-02 02:48:22.713 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:48:23.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:48:25.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 02:48:25.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428496,ok=428496,error=0, records=41
[WARN ] 2026-06-02 02:48:37.718 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:48:38.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:48:40.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 02:48:40.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428497,ok=428497,error=0, records=41
[WARN ] 2026-06-02 02:48:52.723 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:48:53.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:48:55.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 02:48:55.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428498,ok=428498,error=0, records=41
[WARN ] 2026-06-02 02:49:07.728 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:49:08.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:49:10.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 02:49:10.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428499,ok=428499,error=0, records=41
[WARN ] 2026-06-02 02:49:22.734 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:49:23.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:49:25.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 02:49:25.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428500,ok=428500,error=0, records=41
[WARN ] 2026-06-02 02:49:37.739 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:49:38.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:49:40.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 02:49:40.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428501,ok=428501,error=0, records=41
[INFO ] 2026-06-02 02:49:51.058 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857460},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:49:51.198 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:49:51.198 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 02:49:51.198 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:49:51.198 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:49:51.198 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:49:51.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:49:51.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:49:51.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:49:51.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:49:51.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:49:51.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:49:52.744 [32275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:49:53.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:49:55.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 02:49:55.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428502,ok=428502,error=0, records=41
[INFO ] 2026-06-02 02:50:01.656 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21440/300s
[WARN ] 2026-06-02 02:50:07.750 [32303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:50:08.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:50:10.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-06-02 02:50:10.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428503,ok=428503,error=0, records=41
[INFO ] 2026-06-02 02:50:21.759 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21431/300s
[WARN ] 2026-06-02 02:50:22.760 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:50:23.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:50:25.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10440, records=41
[INFO ] 2026-06-02 02:50:25.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428504,ok=428504,error=0, records=41
[WARN ] 2026-06-02 02:50:37.767 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:50:38.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:50:40.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10397, records=41
[INFO ] 2026-06-02 02:50:40.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428505,ok=428505,error=0, records=41
[INFO ] 2026-06-02 02:50:42.606 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21440/300s
[WARN ] 2026-06-02 02:50:52.776 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:50:53.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:50:55.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10426, records=41
[INFO ] 2026-06-02 02:50:55.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428506,ok=428506,error=0, records=41
[INFO ] 2026-06-02 02:50:55.791 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21427/300s
[WARN ] 2026-06-02 02:51:07.787 [32275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:51:08.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:51:10.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 02:51:10.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428507,ok=428507,error=0, records=41
[WARN ] 2026-06-02 02:51:22.795 [32292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:51:23.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:51:25.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 02:51:25.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428508,ok=428508,error=0, records=41
[WARN ] 2026-06-02 02:51:37.803 [32275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:51:38.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:51:40.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 02:51:40.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428509,ok=428509,error=0, records=41
[INFO ] 2026-06-02 02:51:46.784 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21436/300s
[INFO ] 2026-06-02 02:51:46.910 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21427/300s
[WARN ] 2026-06-02 02:51:52.817 [32287] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:51:53.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:51:55.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-02 02:51:55.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428510,ok=428510,error=0, records=41
[WARN ] 2026-06-02 02:52:07.824 [355  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:52:08.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:52:08.360 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21439/300s
[INFO ] 2026-06-02 02:52:10.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-02 02:52:10.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428511,ok=428511,error=0, records=41
[WARN ] 2026-06-02 02:52:22.829 [355  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:52:23.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:52:25.825 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 02:52:25.825 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428512,ok=428512,error=0, records=41
[WARN ] 2026-06-02 02:52:37.835 [355  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:52:38.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:52:40.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 02:52:40.831 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428513,ok=428513,error=0, records=41
[INFO ] 2026-06-02 02:52:51.199 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17850/300s
[INFO ] 2026-06-02 02:52:51.200 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20273116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:52:51.349 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:52:51.349 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 02:52:51.349 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:52:51.349 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:52:51.349 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:52:51.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:52:51.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:52:51.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:52:51.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:52:51.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:52:51.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:52:52.840 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:52:53.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:52:53.559 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21437/300s
[INFO ] 2026-06-02 02:52:55.474 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21437/300s
[INFO ] 2026-06-02 02:52:55.836 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 02:52:55.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428514,ok=428514,error=0, records=41
[INFO ] 2026-06-02 02:53:02.575 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21437/300s
[WARN ] 2026-06-02 02:53:07.846 [450  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:53:08.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:53:10.842 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 02:53:10.842 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428515,ok=428515,error=0, records=41
[WARN ] 2026-06-02 02:53:22.852 [365  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:53:23.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:53:25.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 02:53:25.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428516,ok=428516,error=0, records=41
[WARN ] 2026-06-02 02:53:37.857 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:53:38.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 02:53:38.363 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 02:53:40.852 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 02:53:40.852 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428517,ok=428517,error=0, records=41
[WARN ] 2026-06-02 02:53:52.863 [365  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:53:53.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:53:53.364 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 02:53:55.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-02 02:53:55.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428518,ok=428518,error=0, records=41
[WARN ] 2026-06-02 02:54:07.868 [32317] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:54:08.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:54:10.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 02:54:10.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428519,ok=428519,error=0, records=41
[WARN ] 2026-06-02 02:54:22.875 [355  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:54:23.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:54:25.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 02:54:25.928 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428520,ok=428520,error=0, records=41
[WARN ] 2026-06-02 02:54:37.881 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:54:38.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:54:40.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 02:54:40.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428521,ok=428521,error=0, records=41
[WARN ] 2026-06-02 02:54:52.886 [576  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:54:53.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:54:55.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 02:54:55.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428522,ok=428522,error=0, records=41
[INFO ] 2026-06-02 02:55:01.659 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21441/300s
[WARN ] 2026-06-02 02:55:07.892 [597  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:55:08.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:55:10.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 02:55:10.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428523,ok=428523,error=0, records=41
[INFO ] 2026-06-02 02:55:21.896 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21432/300s
[WARN ] 2026-06-02 02:55:22.896 [620  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:55:23.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:55:25.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 02:55:25.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428524,ok=428524,error=0, records=41
[WARN ] 2026-06-02 02:55:37.901 [643  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:55:38.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:55:40.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 02:55:40.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428525,ok=428525,error=0, records=41
[INFO ] 2026-06-02 02:55:42.613 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21441/300s
[INFO ] 2026-06-02 02:55:51.351 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:55:51.538 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:55:51.538 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 02:55:51.538 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:55:51.538 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:55:51.538 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:55:51.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:55:51.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:55:51.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:55:51.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:55:51.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:55:51.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:55:52.905 [666  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:55:53.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:55:55.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-02 02:55:55.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428526,ok=428526,error=0, records=41
[INFO ] 2026-06-02 02:55:55.977 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21428/300s
[WARN ] 2026-06-02 02:56:07.911 [683  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:56:08.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:56:10.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 02:56:10.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428527,ok=428527,error=0, records=41
[WARN ] 2026-06-02 02:56:22.916 [677  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:56:23.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:56:25.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 02:56:25.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428528,ok=428528,error=0, records=41
[WARN ] 2026-06-02 02:56:37.922 [695  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:56:38.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:56:40.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 02:56:40.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428529,ok=428529,error=0, records=41
[INFO ] 2026-06-02 02:56:46.842 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21437/300s
[INFO ] 2026-06-02 02:56:47.086 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21428/300s
[WARN ] 2026-06-02 02:56:52.928 [728  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:56:53.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:56:56.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 02:56:56.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428530,ok=428530,error=0, records=41
[WARN ] 2026-06-02 02:57:07.933 [722  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:57:08.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:57:08.373 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21440/300s
[INFO ] 2026-06-02 02:57:11.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 02:57:11.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428531,ok=428531,error=0, records=41
[WARN ] 2026-06-02 02:57:22.938 [722  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:57:23.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:57:26.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 02:57:26.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428532,ok=428532,error=0, records=41
[WARN ] 2026-06-02 02:57:37.943 [761  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:57:38.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:57:41.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 02:57:41.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428533,ok=428533,error=0, records=41
[WARN ] 2026-06-02 02:57:52.948 [801  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:57:53.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:57:53.608 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21438/300s
[INFO ] 2026-06-02 02:57:55.523 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21438/300s
[INFO ] 2026-06-02 02:57:56.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 02:57:56.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428534,ok=428534,error=0, records=41
[INFO ] 2026-06-02 02:58:02.615 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21438/300s
[WARN ] 2026-06-02 02:58:07.953 [811  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:58:08.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:58:11.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 02:58:11.030 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428535,ok=428535,error=0, records=41
[WARN ] 2026-06-02 02:58:22.958 [795  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:58:23.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:58:26.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 02:58:26.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428536,ok=428536,error=0, records=41
[WARN ] 2026-06-02 02:58:37.963 [795  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:58:38.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:58:41.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 02:58:41.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428537,ok=428537,error=0, records=41
[INFO ] 2026-06-02 02:58:51.539 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17851/300s
[INFO ] 2026-06-02 02:58:51.540 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 02:58:51.702 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 02:58:51.702 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 02:58:51.702 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 02:58:51.702 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 02:58:51.702 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 02:58:51.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:58:51.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 02:58:51.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:58:51.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 02:58:51.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 02:58:51.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 02:58:52.968 [795  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:58:53.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:58:56.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 02:58:56.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428538,ok=428538,error=0, records=41
[WARN ] 2026-06-02 02:59:07.973 [811  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:59:08.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:59:11.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 02:59:11.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428539,ok=428539,error=0, records=41
[WARN ] 2026-06-02 02:59:22.977 [795  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:59:23.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:59:26.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-02 02:59:26.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428540,ok=428540,error=0, records=41
[WARN ] 2026-06-02 02:59:37.982 [884  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:59:38.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:59:41.069 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 02:59:41.069 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428541,ok=428541,error=0, records=41
[WARN ] 2026-06-02 02:59:52.985 [884  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 02:59:53.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 02:59:56.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 02:59:56.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428542,ok=428542,error=0, records=41
[INFO ] 2026-06-02 03:00:01.662 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21442/300s
[WARN ] 2026-06-02 03:00:07.991 [795  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:00:08.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:00:11.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 03:00:11.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428543,ok=428543,error=0, records=41
[INFO ] 2026-06-02 03:00:21.996 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21433/300s
[WARN ] 2026-06-02 03:00:22.996 [795  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:00:23.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:00:26.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 03:00:26.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428544,ok=428544,error=0, records=41
[WARN ] 2026-06-02 03:00:38.002 [921  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:00:38.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:00:41.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 03:00:41.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428545,ok=428545,error=0, records=41
[INFO ] 2026-06-02 03:00:42.619 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21442/300s
[WARN ] 2026-06-02 03:00:53.006 [971  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:00:53.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:00:56.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 03:00:56.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428546,ok=428546,error=0, records=41
[INFO ] 2026-06-02 03:00:56.104 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21429/300s
[WARN ] 2026-06-02 03:01:08.012 [971  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:01:08.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:01:11.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 03:01:11.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428547,ok=428547,error=0, records=41
[WARN ] 2026-06-02 03:01:23.017 [1042 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:01:23.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:01:26.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 03:01:26.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428548,ok=428548,error=0, records=41
[WARN ] 2026-06-02 03:01:38.022 [1070 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:01:38.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:01:41.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 03:01:41.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428549,ok=428549,error=0, records=41
[INFO ] 2026-06-02 03:01:46.896 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21438/300s
[INFO ] 2026-06-02 03:01:47.268 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21429/300s
[INFO ] 2026-06-02 03:01:51.703 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857192},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:01:51.856 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:01:51.856 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 03:01:51.856 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:01:51.856 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:01:51.856 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:01:51.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:01:51.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:01:51.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:01:51.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:01:51.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:01:51.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 03:01:53.027 [1085 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:01:53.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:01:56.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 03:01:56.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428550,ok=428550,error=0, records=41
[WARN ] 2026-06-02 03:02:08.032 [999  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:02:08.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:02:08.385 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21441/300s
[INFO ] 2026-06-02 03:02:11.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 03:02:11.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428551,ok=428551,error=0, records=41
[WARN ] 2026-06-02 03:02:23.037 [1115 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:02:23.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:02:26.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 03:02:26.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428552,ok=428552,error=0, records=41
[WARN ] 2026-06-02 03:02:38.041 [1125 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:02:38.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:02:41.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 03:02:41.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428553,ok=428553,error=0, records=41
[WARN ] 2026-06-02 03:02:53.046 [1149 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:02:53.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:02:53.653 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21439/300s
[INFO ] 2026-06-02 03:02:55.576 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21439/300s
[INFO ] 2026-06-02 03:02:56.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 03:02:56.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428554,ok=428554,error=0, records=41
[INFO ] 2026-06-02 03:03:02.661 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21439/300s
[WARN ] 2026-06-02 03:03:08.052 [1131 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:03:08.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:03:11.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 03:03:11.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428555,ok=428555,error=0, records=41
[WARN ] 2026-06-02 03:03:22.557 [1186 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:03:23.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:03:26.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 03:03:26.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428556,ok=428556,error=0, records=41
[WARN ] 2026-06-02 03:03:37.562 [1204 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:03:38.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 03:03:38.389 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 03:03:41.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 03:03:41.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428557,ok=428557,error=0, records=41
[WARN ] 2026-06-02 03:03:52.567 [1204 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:03:53.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:03:56.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-02 03:03:56.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428558,ok=428558,error=0, records=41
[WARN ] 2026-06-02 03:04:07.572 [1169 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:04:08.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:04:11.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 03:04:11.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428559,ok=428559,error=0, records=41
[WARN ] 2026-06-02 03:04:22.578 [1249 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:04:23.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:04:26.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:04:26.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428560,ok=428560,error=0, records=41
[WARN ] 2026-06-02 03:04:37.582 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:04:38.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:04:41.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 03:04:41.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428561,ok=428561,error=0, records=41
[INFO ] 2026-06-02 03:04:51.856 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17852/300s
[INFO ] 2026-06-02 03:04:51.858 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:04:52.012 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:04:52.012 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 03:04:52.012 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:04:52.012 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:04:52.012 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:04:52.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:04:52.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:04:52.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:04:52.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:04:52.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:04:52.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 03:04:52.587 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:04:53.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:04:56.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 03:04:56.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428562,ok=428562,error=0, records=41
[INFO ] 2026-06-02 03:05:01.666 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21443/300s
[WARN ] 2026-06-02 03:05:07.592 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:05:08.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:05:11.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 03:05:11.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428563,ok=428563,error=0, records=41
[INFO ] 2026-06-02 03:05:22.097 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21434/300s
[WARN ] 2026-06-02 03:05:22.598 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:05:23.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:05:26.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 03:05:26.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428564,ok=428564,error=0, records=41
[WARN ] 2026-06-02 03:05:37.604 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:05:38.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:05:41.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 03:05:41.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428565,ok=428565,error=0, records=41
[INFO ] 2026-06-02 03:05:42.625 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21443/300s
[WARN ] 2026-06-02 03:05:52.609 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:05:53.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:05:56.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:05:56.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428566,ok=428566,error=0, records=41
[INFO ] 2026-06-02 03:05:56.256 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21430/300s
[WARN ] 2026-06-02 03:06:07.614 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:06:08.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:06:11.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 03:06:11.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428567,ok=428567,error=0, records=41
[WARN ] 2026-06-02 03:06:22.619 [1318 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:06:23.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:06:26.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 03:06:26.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428568,ok=428568,error=0, records=41
[WARN ] 2026-06-02 03:06:37.624 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:06:38.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:06:41.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:06:41.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428569,ok=428569,error=0, records=41
[INFO ] 2026-06-02 03:06:46.950 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21439/300s
[INFO ] 2026-06-02 03:06:47.450 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21430/300s
[WARN ] 2026-06-02 03:06:52.629 [1318 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:06:53.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:06:56.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 03:06:56.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428570,ok=428570,error=0, records=41
[WARN ] 2026-06-02 03:07:07.634 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:07:08.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:07:08.398 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21442/300s
[INFO ] 2026-06-02 03:07:11.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 03:07:11.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428571,ok=428571,error=0, records=41
[WARN ] 2026-06-02 03:07:22.639 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:07:23.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:07:26.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 03:07:26.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428572,ok=428572,error=0, records=41
[WARN ] 2026-06-02 03:07:37.643 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:07:38.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:07:41.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 03:07:41.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428573,ok=428573,error=0, records=41
[INFO ] 2026-06-02 03:07:52.014 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20857052},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:07:52.170 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:07:52.170 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 03:07:52.170 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:07:52.170 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:07:52.170 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:07:52.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:07:52.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:07:52.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:07:52.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:07:52.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:07:52.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 03:07:52.648 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:07:53.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:07:53.708 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21440/300s
[INFO ] 2026-06-02 03:07:55.644 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21440/300s
[INFO ] 2026-06-02 03:07:56.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 03:07:56.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428574,ok=428574,error=0, records=41
[INFO ] 2026-06-02 03:08:02.716 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21440/300s
[WARN ] 2026-06-02 03:08:07.652 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:08:08.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:08:11.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 03:08:11.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428575,ok=428575,error=0, records=41
[WARN ] 2026-06-02 03:08:22.658 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:08:23.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:08:26.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:08:26.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428576,ok=428576,error=0, records=41
[WARN ] 2026-06-02 03:08:37.664 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:08:38.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:08:41.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 03:08:41.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428577,ok=428577,error=0, records=41
[WARN ] 2026-06-02 03:08:52.667 [1318 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:08:53.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:08:53.402 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 03:08:56.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 03:08:56.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428578,ok=428578,error=0, records=41
[WARN ] 2026-06-02 03:09:07.673 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:09:08.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:09:11.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 03:09:11.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428579,ok=428579,error=0, records=41
[WARN ] 2026-06-02 03:09:22.678 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:09:23.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:09:26.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 03:09:26.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428580,ok=428580,error=0, records=41
[WARN ] 2026-06-02 03:09:37.683 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:09:38.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:09:41.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 03:09:41.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428581,ok=428581,error=0, records=41
[WARN ] 2026-06-02 03:09:52.688 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:09:53.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:09:56.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 03:09:56.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428582,ok=428582,error=0, records=41
[INFO ] 2026-06-02 03:10:01.669 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21444/300s
[WARN ] 2026-06-02 03:10:07.693 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:10:08.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:10:11.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 03:10:11.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428583,ok=428583,error=0, records=41
[INFO ] 2026-06-02 03:10:22.197 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21435/300s
[WARN ] 2026-06-02 03:10:22.698 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:10:23.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:10:26.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 03:10:26.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428584,ok=428584,error=0, records=41
[WARN ] 2026-06-02 03:10:37.704 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:10:38.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:10:41.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 03:10:41.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428585,ok=428585,error=0, records=41
[INFO ] 2026-06-02 03:10:42.631 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21444/300s
[INFO ] 2026-06-02 03:10:52.170 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17853/300s
[INFO ] 2026-06-02 03:10:52.172 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856972},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:10:52.351 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:10:52.351 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 03:10:52.351 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:10:52.351 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:10:52.351 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:10:52.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:10:52.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:10:52.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:10:52.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:10:52.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:10:52.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 03:10:52.709 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:10:53.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:10:56.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 03:10:56.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428586,ok=428586,error=0, records=41
[INFO ] 2026-06-02 03:10:56.368 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21431/300s
[WARN ] 2026-06-02 03:11:07.714 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:11:08.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:11:11.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 03:11:11.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428587,ok=428587,error=0, records=41
[WARN ] 2026-06-02 03:11:22.719 [1318 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:11:23.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:11:26.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 03:11:26.379 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428588,ok=428588,error=0, records=41
[WARN ] 2026-06-02 03:11:37.724 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:11:38.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:11:41.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 03:11:41.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428589,ok=428589,error=0, records=41
[INFO ] 2026-06-02 03:11:47.004 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21440/300s
[INFO ] 2026-06-02 03:11:47.631 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21431/300s
[WARN ] 2026-06-02 03:11:52.730 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:11:53.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:11:56.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 03:11:56.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428590,ok=428590,error=0, records=41
[WARN ] 2026-06-02 03:12:07.735 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:12:08.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:12:08.411 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21443/300s
[INFO ] 2026-06-02 03:12:11.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 03:12:11.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428591,ok=428591,error=0, records=41
[WARN ] 2026-06-02 03:12:22.740 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:12:23.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:12:26.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 03:12:26.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428592,ok=428592,error=0, records=41
[WARN ] 2026-06-02 03:12:37.746 [1318 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:12:38.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:12:41.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 03:12:41.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428593,ok=428593,error=0, records=41
[WARN ] 2026-06-02 03:12:52.751 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:12:53.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:12:53.767 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21441/300s
[INFO ] 2026-06-02 03:12:55.716 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21441/300s
[INFO ] 2026-06-02 03:12:56.416 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 03:12:56.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428594,ok=428594,error=0, records=41
[INFO ] 2026-06-02 03:13:02.773 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21441/300s
[WARN ] 2026-06-02 03:13:07.757 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:13:08.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:13:11.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 03:13:11.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428595,ok=428595,error=0, records=41
[WARN ] 2026-06-02 03:13:22.762 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:13:23.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:13:26.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 03:13:26.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428596,ok=428596,error=0, records=41
[WARN ] 2026-06-02 03:13:37.767 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:13:38.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 03:13:38.415 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 03:13:41.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 03:13:41.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428597,ok=428597,error=0, records=41
[INFO ] 2026-06-02 03:13:52.353 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856904},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:13:52.531 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:13:52.531 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 03:13:52.531 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:13:52.531 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:13:52.531 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:13:52.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:13:52.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:13:52.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:13:52.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:13:52.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:13:52.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 03:13:52.772 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:13:53.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:13:56.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 03:13:56.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428598,ok=428598,error=0, records=41
[WARN ] 2026-06-02 03:14:07.777 [1255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:14:08.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:14:11.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 03:14:11.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428599,ok=428599,error=0, records=41
[WARN ] 2026-06-02 03:14:22.781 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:14:23.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:14:26.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 03:14:26.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428600,ok=428600,error=0, records=41
[WARN ] 2026-06-02 03:14:37.787 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:14:38.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:14:41.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 03:14:41.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428601,ok=428601,error=0, records=41
[WARN ] 2026-06-02 03:14:52.793 [1289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:14:53.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:14:56.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 03:14:56.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428602,ok=428602,error=0, records=41
[INFO ] 2026-06-02 03:15:01.672 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21445/300s
[WARN ] 2026-06-02 03:15:07.798 [1271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:15:08.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:15:11.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 03:15:11.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428603,ok=428603,error=0, records=41
[INFO ] 2026-06-02 03:15:22.302 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21436/300s
[WARN ] 2026-06-02 03:15:22.802 [1318 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:15:23.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:15:26.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 03:15:26.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428604,ok=428604,error=0, records=41
[WARN ] 2026-06-02 03:15:37.807 [1869 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:15:38.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:15:41.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 03:15:41.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428605,ok=428605,error=0, records=41
[INFO ] 2026-06-02 03:15:42.638 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21445/300s
[WARN ] 2026-06-02 03:15:52.811 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:15:53.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:15:56.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 03:15:56.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428606,ok=428606,error=0, records=41
[INFO ] 2026-06-02 03:15:56.483 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21432/300s
[WARN ] 2026-06-02 03:16:07.816 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:16:08.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:16:11.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 03:16:11.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428607,ok=428607,error=0, records=41
[WARN ] 2026-06-02 03:16:22.822 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:16:23.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:16:26.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 03:16:26.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428608,ok=428608,error=0, records=41
[WARN ] 2026-06-02 03:16:37.827 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:16:38.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:16:41.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 03:16:41.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428609,ok=428609,error=0, records=41
[INFO ] 2026-06-02 03:16:47.059 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21441/300s
[INFO ] 2026-06-02 03:16:47.813 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21432/300s
[INFO ] 2026-06-02 03:16:52.531 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17854/300s
[INFO ] 2026-06-02 03:16:52.533 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856828},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:16:52.707 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:16:52.707 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 03:16:52.707 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:16:52.707 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:16:52.707 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:16:52.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:16:52.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:16:52.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:16:52.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:16:52.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:16:52.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 03:16:52.831 [1303 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:16:53.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:16:56.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 03:16:56.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428610,ok=428610,error=0, records=41
[WARN ] 2026-06-02 03:17:07.838 [1918 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:17:08.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:17:08.424 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21444/300s
[INFO ] 2026-06-02 03:17:11.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 03:17:11.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428611,ok=428611,error=0, records=41
[WARN ] 2026-06-02 03:17:22.842 [1971 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:17:23.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:17:26.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 03:17:26.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428612,ok=428612,error=0, records=41
[WARN ] 2026-06-02 03:17:37.848 [1947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:17:38.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:17:41.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 03:17:41.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428613,ok=428613,error=0, records=41
[WARN ] 2026-06-02 03:17:52.853 [1947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:17:53.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:17:53.815 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21442/300s
[INFO ] 2026-06-02 03:17:55.788 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21442/300s
[INFO ] 2026-06-02 03:17:56.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 03:17:56.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428614,ok=428614,error=0, records=41
[INFO ] 2026-06-02 03:18:02.822 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21442/300s
[WARN ] 2026-06-02 03:18:07.859 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:18:08.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:18:11.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 03:18:11.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428615,ok=428615,error=0, records=41
[WARN ] 2026-06-02 03:18:22.863 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:18:23.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:18:26.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-02 03:18:26.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428616,ok=428616,error=0, records=41
[WARN ] 2026-06-02 03:18:37.868 [1999 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:18:38.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:18:41.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 03:18:41.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428617,ok=428617,error=0, records=41
[WARN ] 2026-06-02 03:18:52.874 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:18:53.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:18:56.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 03:18:56.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428618,ok=428618,error=0, records=41
[WARN ] 2026-06-02 03:19:07.880 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:19:08.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:19:11.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 03:19:11.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428619,ok=428619,error=0, records=41
[WARN ] 2026-06-02 03:19:22.886 [2096 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:19:23.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:19:26.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-02 03:19:26.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428620,ok=428620,error=0, records=41
[WARN ] 2026-06-02 03:19:37.892 [2112 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:19:38.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:19:41.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 03:19:41.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428621,ok=428621,error=0, records=41
[INFO ] 2026-06-02 03:19:52.709 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856752},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:19:52.869 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:19:52.870 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 03:19:52.870 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:19:52.870 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:19:52.870 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[WARN ] 2026-06-02 03:19:52.898 [2102 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:19:52.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:19:52.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:19:52.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:19:52.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:19:52.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:19:52.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:19:53.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:19:56.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 03:19:56.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428622,ok=428622,error=0, records=41
[INFO ] 2026-06-02 03:20:01.676 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21446/300s
[WARN ] 2026-06-02 03:20:07.904 [1874 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:20:08.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:20:11.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 03:20:11.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428623,ok=428623,error=0, records=41
[INFO ] 2026-06-02 03:20:22.408 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21437/300s
[WARN ] 2026-06-02 03:20:22.908 [2156 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:20:23.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:20:26.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 03:20:26.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428624,ok=428624,error=0, records=41
[WARN ] 2026-06-02 03:20:37.914 [2188 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:20:38.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:20:41.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10380, records=41
[INFO ] 2026-06-02 03:20:41.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428625,ok=428625,error=0, records=41
[INFO ] 2026-06-02 03:20:42.644 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21446/300s
[WARN ] 2026-06-02 03:20:52.920 [2198 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:20:53.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:20:56.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 03:20:56.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428626,ok=428626,error=0, records=41
[INFO ] 2026-06-02 03:20:56.695 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21433/300s
[WARN ] 2026-06-02 03:21:07.925 [2180 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:21:08.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:21:11.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 03:21:11.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428627,ok=428627,error=0, records=41
[WARN ] 2026-06-02 03:21:22.930 [2215 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:21:23.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:21:26.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 03:21:26.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428628,ok=428628,error=0, records=41
[WARN ] 2026-06-02 03:21:37.936 [2232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:21:38.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:21:41.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 03:21:41.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428629,ok=428629,error=0, records=41
[INFO ] 2026-06-02 03:21:47.113 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21442/300s
[INFO ] 2026-06-02 03:21:47.996 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21433/300s
[WARN ] 2026-06-02 03:21:52.941 [2271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:21:53.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:21:56.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 03:21:56.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428630,ok=428630,error=0, records=41
[WARN ] 2026-06-02 03:22:07.946 [2232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:22:08.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:22:08.436 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21445/300s
[INFO ] 2026-06-02 03:22:11.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 03:22:11.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428631,ok=428631,error=0, records=41
[WARN ] 2026-06-02 03:22:22.950 [2232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:22:23.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:22:26.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 03:22:26.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428632,ok=428632,error=0, records=41
[WARN ] 2026-06-02 03:22:37.956 [2311 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:22:38.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:22:41.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 03:22:41.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428633,ok=428633,error=0, records=41
[INFO ] 2026-06-02 03:22:52.870 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17855/300s
[INFO ] 2026-06-02 03:22:52.872 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856676},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-02 03:22:52.961 [2286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:22:53.025 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:22:53.025 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 03:22:53.026 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:22:53.026 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:22:53.026 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:22:53.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:22:53.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:22:53.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:22:53.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:22:53.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:22:53.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:22:53.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:22:53.901 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21443/300s
[INFO ] 2026-06-02 03:22:55.816 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21443/300s
[INFO ] 2026-06-02 03:22:56.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 03:22:56.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428634,ok=428634,error=0, records=41
[INFO ] 2026-06-02 03:23:02.822 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21443/300s
[WARN ] 2026-06-02 03:23:07.965 [2311 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:23:08.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:23:11.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 03:23:11.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428635,ok=428635,error=0, records=41
[WARN ] 2026-06-02 03:23:22.969 [2286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:23:23.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:23:26.787 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:23:26.787 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428636,ok=428636,error=0, records=41
[WARN ] 2026-06-02 03:23:37.975 [2232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:23:38.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 03:23:38.440 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 03:23:41.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 03:23:41.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428637,ok=428637,error=0, records=41
[WARN ] 2026-06-02 03:23:52.981 [2382 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:23:53.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:23:53.441 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 03:23:56.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 03:23:56.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428638,ok=428638,error=0, records=41
[WARN ] 2026-06-02 03:24:07.985 [2382 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:24:08.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:24:11.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 03:24:11.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428639,ok=428639,error=0, records=41
[WARN ] 2026-06-02 03:24:22.990 [2354 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:24:23.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:24:26.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 03:24:26.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428640,ok=428640,error=0, records=41
[WARN ] 2026-06-02 03:24:37.996 [2325 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:24:38.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:24:41.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 03:24:41.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428641,ok=428641,error=0, records=41
[WARN ] 2026-06-02 03:24:53.001 [2325 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:24:53.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:24:56.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 03:24:56.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428642,ok=428642,error=0, records=41
[INFO ] 2026-06-02 03:25:01.679 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21447/300s
[WARN ] 2026-06-02 03:25:08.006 [2354 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:25:08.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:25:11.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 03:25:11.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428643,ok=428643,error=0, records=41
[INFO ] 2026-06-02 03:25:22.510 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21438/300s
[WARN ] 2026-06-02 03:25:23.010 [2325 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:25:23.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:25:26.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 03:25:26.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428644,ok=428644,error=0, records=41
[WARN ] 2026-06-02 03:25:38.016 [2452 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:25:38.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:25:41.935 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 03:25:41.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428645,ok=428645,error=0, records=41
[INFO ] 2026-06-02 03:25:42.650 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21447/300s
[WARN ] 2026-06-02 03:25:53.021 [2494 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:25:53.027 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856600},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:25:53.210 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:25:53.210 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 03:25:53.210 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:25:53.210 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:25:53.210 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:25:53.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:25:53.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:25:53.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:25:53.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:25:53.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:25:53.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:25:53.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:25:56.941 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 03:25:56.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428646,ok=428646,error=0, records=41
[INFO ] 2026-06-02 03:25:56.942 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21434/300s
[WARN ] 2026-06-02 03:26:08.026 [2325 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:26:08.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:26:11.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 03:26:11.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428647,ok=428647,error=0, records=41
[WARN ] 2026-06-02 03:26:23.031 [2325 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:26:23.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:26:26.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 03:26:26.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428648,ok=428648,error=0, records=41
[WARN ] 2026-06-02 03:26:38.036 [2537 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:26:38.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:26:41.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 03:26:41.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428649,ok=428649,error=0, records=41
[INFO ] 2026-06-02 03:26:47.171 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21443/300s
[INFO ] 2026-06-02 03:26:48.182 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21434/300s
[WARN ] 2026-06-02 03:26:53.042 [2537 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:26:53.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:26:57.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 03:26:57.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428650,ok=428650,error=0, records=41
[WARN ] 2026-06-02 03:27:08.048 [2556 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:27:08.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:27:08.451 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21446/300s
[INFO ] 2026-06-02 03:27:12.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 03:27:12.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428651,ok=428651,error=0, records=41
[WARN ] 2026-06-02 03:27:23.053 [2584 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:27:23.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:27:27.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 03:27:27.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428652,ok=428652,error=0, records=41
[WARN ] 2026-06-02 03:27:37.557 [2564 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:27:38.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:27:42.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 03:27:42.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428653,ok=428653,error=0, records=41
[WARN ] 2026-06-02 03:27:52.562 [2584 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:27:53.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:27:53.971 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21444/300s
[INFO ] 2026-06-02 03:27:55.873 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21444/300s
[INFO ] 2026-06-02 03:27:57.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 03:27:57.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428654,ok=428654,error=0, records=41
[INFO ] 2026-06-02 03:28:02.879 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21444/300s
[WARN ] 2026-06-02 03:28:07.566 [2640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:28:08.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:28:12.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 03:28:12.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428655,ok=428655,error=0, records=41
[WARN ] 2026-06-02 03:28:22.571 [2640 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:28:23.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:28:27.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 03:28:27.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428656,ok=428656,error=0, records=41
[WARN ] 2026-06-02 03:28:37.576 [2657 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:28:38.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:28:42.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-02 03:28:42.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428657,ok=428657,error=0, records=41
[WARN ] 2026-06-02 03:28:52.581 [2701 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:28:53.210 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17856/300s
[INFO ] 2026-06-02 03:28:53.212 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856516},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:28:53.368 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:28:53.368 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 03:28:53.368 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:28:53.368 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:28:53.368 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:28:53.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:28:53.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:28:53.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:28:53.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:28:53.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:28:53.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:28:53.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:28:57.082 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 03:28:57.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428658,ok=428658,error=0, records=41
[WARN ] 2026-06-02 03:29:07.586 [2709 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:29:08.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:29:12.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 03:29:12.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428659,ok=428659,error=0, records=41
[WARN ] 2026-06-02 03:29:22.591 [2733 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:29:23.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:29:27.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-02 03:29:27.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428660,ok=428660,error=0, records=41
[WARN ] 2026-06-02 03:29:37.597 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:29:38.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:29:42.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10144, records=41
[INFO ] 2026-06-02 03:29:42.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428661,ok=428661,error=0, records=41
[WARN ] 2026-06-02 03:29:52.603 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:29:53.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:29:57.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 03:29:57.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428662,ok=428662,error=0, records=41
[INFO ] 2026-06-02 03:30:01.682 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21448/300s
[WARN ] 2026-06-02 03:30:07.609 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:30:08.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:30:12.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 03:30:12.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428663,ok=428663,error=0, records=41
[INFO ] 2026-06-02 03:30:22.613 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21439/300s
[WARN ] 2026-06-02 03:30:22.614 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:30:23.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:30:27.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:30:27.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428664,ok=428664,error=0, records=41
[WARN ] 2026-06-02 03:30:37.620 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:30:38.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:30:42.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 03:30:42.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428665,ok=428665,error=0, records=41
[INFO ] 2026-06-02 03:30:42.657 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21448/300s
[WARN ] 2026-06-02 03:30:52.627 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:30:53.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:30:57.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:30:57.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428666,ok=428666,error=0, records=41
[INFO ] 2026-06-02 03:30:57.131 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21435/300s
[WARN ] 2026-06-02 03:31:07.633 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:31:08.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:31:12.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 03:31:12.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428667,ok=428667,error=0, records=41
[WARN ] 2026-06-02 03:31:22.638 [2749 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:31:23.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:31:27.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 03:31:27.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428668,ok=428668,error=0, records=41
[WARN ] 2026-06-02 03:31:37.643 [2749 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:31:38.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:31:42.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 03:31:42.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428669,ok=428669,error=0, records=41
[INFO ] 2026-06-02 03:31:47.228 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21444/300s
[INFO ] 2026-06-02 03:31:48.366 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21435/300s
[WARN ] 2026-06-02 03:31:52.649 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:31:53.370 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856440},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:31:53.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-02 03:31:53.529 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:31:53.529 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 03:31:57.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 03:31:57.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428670,ok=428670,error=0, records=41
[WARN ] 2026-06-02 03:32:07.654 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:32:08.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:32:08.463 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21447/300s
[INFO ] 2026-06-02 03:32:12.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-02 03:32:12.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428671,ok=428671,error=0, records=41
[WARN ] 2026-06-02 03:32:22.660 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:32:23.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:32:27.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 03:32:27.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428672,ok=428672,error=0, records=41
[WARN ] 2026-06-02 03:32:37.664 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:32:38.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:32:42.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 03:32:42.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428673,ok=428673,error=0, records=41
[WARN ] 2026-06-02 03:32:52.668 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:32:53.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:32:54.052 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21445/300s
[INFO ] 2026-06-02 03:32:55.954 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21445/300s
[INFO ] 2026-06-02 03:32:57.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 03:32:57.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428674,ok=428674,error=0, records=41
[INFO ] 2026-06-02 03:33:02.961 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21445/300s
[WARN ] 2026-06-02 03:33:07.673 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:33:08.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:33:12.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 03:33:12.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428675,ok=428675,error=0, records=41
[WARN ] 2026-06-02 03:33:22.677 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:33:23.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:33:27.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 03:33:27.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428676,ok=428676,error=0, records=41
[WARN ] 2026-06-02 03:33:37.684 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:33:38.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 03:33:38.467 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 03:33:42.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 03:33:42.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428677,ok=428677,error=0, records=41
[WARN ] 2026-06-02 03:33:52.690 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:33:53.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:33:57.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 03:33:57.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428678,ok=428678,error=0, records=41
[WARN ] 2026-06-02 03:34:07.695 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:34:08.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:34:12.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 03:34:12.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428679,ok=428679,error=0, records=41
[WARN ] 2026-06-02 03:34:22.699 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:34:23.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:34:27.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 03:34:27.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428680,ok=428680,error=0, records=41
[WARN ] 2026-06-02 03:34:37.705 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:34:38.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:34:42.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-02 03:34:42.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428681,ok=428681,error=0, records=41
[WARN ] 2026-06-02 03:34:52.711 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:34:53.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:34:53.530 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17857/300s
[INFO ] 2026-06-02 03:34:53.531 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856352},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:34:53.921 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:34:53.921 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 03:34:53.921 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:34:53.921 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:34:53.921 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:34:53.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:34:53.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:34:53.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:34:53.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:34:53.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:34:53.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:34:57.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 03:34:57.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428682,ok=428682,error=0, records=41
[INFO ] 2026-06-02 03:35:01.686 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21449/300s
[WARN ] 2026-06-02 03:35:07.716 [2749 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:35:08.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:35:12.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 03:35:12.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428683,ok=428683,error=0, records=41
[INFO ] 2026-06-02 03:35:22.722 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21440/300s
[WARN ] 2026-06-02 03:35:22.722 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:35:23.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:35:27.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-02 03:35:27.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428684,ok=428684,error=0, records=41
[WARN ] 2026-06-02 03:35:37.727 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:35:38.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:35:42.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-02 03:35:42.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428685,ok=428685,error=0, records=41
[INFO ] 2026-06-02 03:35:42.663 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21449/300s
[WARN ] 2026-06-02 03:35:52.732 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:35:53.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:35:57.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 03:35:57.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428686,ok=428686,error=0, records=41
[INFO ] 2026-06-02 03:35:57.282 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21436/300s
[WARN ] 2026-06-02 03:36:07.737 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:36:08.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:36:12.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 03:36:12.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428687,ok=428687,error=0, records=41
[WARN ] 2026-06-02 03:36:22.743 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:36:23.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:36:27.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 03:36:27.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428688,ok=428688,error=0, records=41
[WARN ] 2026-06-02 03:36:37.749 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:36:38.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:36:42.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 03:36:42.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428689,ok=428689,error=0, records=41
[INFO ] 2026-06-02 03:36:47.284 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21445/300s
[INFO ] 2026-06-02 03:36:48.547 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21436/300s
[WARN ] 2026-06-02 03:36:52.753 [2749 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:36:53.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:36:57.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 03:36:57.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428690,ok=428690,error=0, records=41
[WARN ] 2026-06-02 03:37:07.757 [2749 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:37:08.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:37:08.475 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21448/300s
[INFO ] 2026-06-02 03:37:12.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 03:37:12.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428691,ok=428691,error=0, records=41
[WARN ] 2026-06-02 03:37:22.762 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:37:23.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:37:27.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 03:37:27.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428692,ok=428692,error=0, records=41
[WARN ] 2026-06-02 03:37:37.766 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:37:38.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:37:42.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 03:37:42.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428693,ok=428693,error=0, records=41
[WARN ] 2026-06-02 03:37:52.770 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:37:53.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:37:53.922 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856276},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:37:54.085 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:37:54.085 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 03:37:54.085 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:37:54.086 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:37:54.086 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:37:54.102 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21446/300s
[INFO ] 2026-06-02 03:37:54.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:37:54.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:37:54.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:37:54.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:37:54.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:37:54.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:37:56.003 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21446/300s
[INFO ] 2026-06-02 03:37:57.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 03:37:57.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428694,ok=428694,error=0, records=41
[INFO ] 2026-06-02 03:38:03.009 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21446/300s
[WARN ] 2026-06-02 03:38:07.775 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:38:08.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:38:12.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 03:38:12.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428695,ok=428695,error=0, records=41
[WARN ] 2026-06-02 03:38:22.780 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:38:23.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:38:27.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 03:38:27.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428696,ok=428696,error=0, records=41
[WARN ] 2026-06-02 03:38:37.785 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:38:38.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:38:42.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 03:38:42.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428697,ok=428697,error=0, records=41
[WARN ] 2026-06-02 03:38:52.790 [2695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:38:53.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:38:53.479 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 03:38:57.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 03:38:57.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428698,ok=428698,error=0, records=41
[WARN ] 2026-06-02 03:39:07.795 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:39:08.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:39:12.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 03:39:12.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428699,ok=428699,error=0, records=41
[WARN ] 2026-06-02 03:39:22.798 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:39:23.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:39:27.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 03:39:27.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428700,ok=428700,error=0, records=41
[WARN ] 2026-06-02 03:39:37.805 [2739 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:39:38.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:39:42.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 03:39:42.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428701,ok=428701,error=0, records=41
[WARN ] 2026-06-02 03:39:52.809 [2736 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:39:53.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:39:57.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-02 03:39:57.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428702,ok=428702,error=0, records=41
[INFO ] 2026-06-02 03:40:01.689 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21450/300s
[WARN ] 2026-06-02 03:40:07.815 [3324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:40:08.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:40:12.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 03:40:12.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428703,ok=428703,error=0, records=41
[INFO ] 2026-06-02 03:40:22.820 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21441/300s
[WARN ] 2026-06-02 03:40:22.820 [3364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:40:23.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:40:27.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-02 03:40:27.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428704,ok=428704,error=0, records=41
[WARN ] 2026-06-02 03:40:37.826 [3364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:40:38.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:40:42.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 03:40:42.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428705,ok=428705,error=0, records=41
[INFO ] 2026-06-02 03:40:42.669 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21450/300s
[WARN ] 2026-06-02 03:40:52.830 [3406 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:40:53.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:40:54.086 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17858/300s
[INFO ] 2026-06-02 03:40:54.087 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856192},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:40:54.262 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:40:54.262 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 03:40:54.262 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:40:54.262 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:40:54.262 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:40:54.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:40:54.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:40:54.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:40:54.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:40:54.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:40:54.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:40:57.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 03:40:57.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428706,ok=428706,error=0, records=41
[INFO ] 2026-06-02 03:40:57.481 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21437/300s
[WARN ] 2026-06-02 03:41:07.835 [3364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:41:08.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:41:12.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 03:41:12.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428707,ok=428707,error=0, records=41
[WARN ] 2026-06-02 03:41:22.840 [3406 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:41:23.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:41:27.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 03:41:27.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428708,ok=428708,error=0, records=41
[WARN ] 2026-06-02 03:41:37.844 [3364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:41:38.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:41:42.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 03:41:42.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428709,ok=428709,error=0, records=41
[INFO ] 2026-06-02 03:41:47.339 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21446/300s
[INFO ] 2026-06-02 03:41:48.730 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21437/300s
[WARN ] 2026-06-02 03:41:52.849 [3324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:41:53.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:41:57.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 03:41:57.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428710,ok=428710,error=0, records=41
[WARN ] 2026-06-02 03:42:07.854 [3469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:42:08.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:42:08.488 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21449/300s
[INFO ] 2026-06-02 03:42:12.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 03:42:12.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428711,ok=428711,error=0, records=41
[WARN ] 2026-06-02 03:42:22.861 [3324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:42:23.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:42:27.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 03:42:27.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428712,ok=428712,error=0, records=41
[WARN ] 2026-06-02 03:42:37.866 [3406 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:42:38.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:42:42.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 03:42:42.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428713,ok=428713,error=0, records=41
[WARN ] 2026-06-02 03:42:52.871 [3406 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:42:53.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:42:54.165 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21447/300s
[INFO ] 2026-06-02 03:42:56.066 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21447/300s
[INFO ] 2026-06-02 03:42:57.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 03:42:57.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428714,ok=428714,error=0, records=41
[INFO ] 2026-06-02 03:43:03.072 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21447/300s
[WARN ] 2026-06-02 03:43:07.875 [3483 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:43:08.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:43:12.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 03:43:12.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428715,ok=428715,error=0, records=41
[WARN ] 2026-06-02 03:43:22.880 [3533 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:43:23.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:43:27.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 03:43:27.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428716,ok=428716,error=0, records=41
[WARN ] 2026-06-02 03:43:37.885 [3556 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:43:38.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 03:43:38.492 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 03:43:42.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 03:43:42.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428717,ok=428717,error=0, records=41
[WARN ] 2026-06-02 03:43:52.890 [3483 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:43:53.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:43:54.264 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:43:54.419 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:43:54.419 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 03:43:54.419 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:43:54.419 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:43:54.419 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:43:54.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:43:54.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:43:54.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:43:54.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:43:54.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:43:54.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:43:57.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 03:43:57.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428718,ok=428718,error=0, records=41
[WARN ] 2026-06-02 03:44:07.896 [3556 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:44:08.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:44:12.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 03:44:12.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428719,ok=428719,error=0, records=41
[WARN ] 2026-06-02 03:44:22.901 [3627 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:44:23.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:44:27.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 03:44:27.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428720,ok=428720,error=0, records=41
[WARN ] 2026-06-02 03:44:37.906 [3644 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:44:38.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:44:42.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 03:44:42.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428721,ok=428721,error=0, records=41
[WARN ] 2026-06-02 03:44:52.911 [3656 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:44:53.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:44:57.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 03:44:57.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428722,ok=428722,error=0, records=41
[INFO ] 2026-06-02 03:45:01.692 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21451/300s
[WARN ] 2026-06-02 03:45:07.916 [3666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:45:08.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:45:12.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 03:45:12.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428723,ok=428723,error=0, records=41
[INFO ] 2026-06-02 03:45:22.920 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21442/300s
[WARN ] 2026-06-02 03:45:22.921 [3666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:45:23.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:45:27.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-02 03:45:27.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428724,ok=428724,error=0, records=41
[WARN ] 2026-06-02 03:45:37.926 [3711 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:45:38.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:45:42.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-02 03:45:42.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428725,ok=428725,error=0, records=41
[INFO ] 2026-06-02 03:45:42.676 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21451/300s
[WARN ] 2026-06-02 03:45:52.932 [3728 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:45:53.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:45:57.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 03:45:57.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428726,ok=428726,error=0, records=41
[INFO ] 2026-06-02 03:45:57.606 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21438/300s
[WARN ] 2026-06-02 03:46:07.937 [3644 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:46:08.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:46:12.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 03:46:12.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428727,ok=428727,error=0, records=41
[WARN ] 2026-06-02 03:46:22.942 [3762 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:46:23.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:46:27.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 03:46:27.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428728,ok=428728,error=0, records=41
[WARN ] 2026-06-02 03:46:37.947 [3779 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:46:38.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:46:42.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 03:46:42.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428729,ok=428729,error=0, records=41
[INFO ] 2026-06-02 03:46:47.393 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21447/300s
[INFO ] 2026-06-02 03:46:48.913 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21438/300s
[WARN ] 2026-06-02 03:46:52.954 [3773 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:46:53.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:46:54.419 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17859/300s
[INFO ] 2026-06-02 03:46:54.421 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20856036},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:46:54.583 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:46:54.583 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 03:46:54.584 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:46:54.584 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:46:54.584 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:46:54.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:46:54.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:46:54.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:46:54.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:46:54.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:46:54.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:46:57.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 03:46:57.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428730,ok=428730,error=0, records=41
[WARN ] 2026-06-02 03:47:07.958 [3804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:47:08.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:47:08.501 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21450/300s
[INFO ] 2026-06-02 03:47:12.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 03:47:12.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428731,ok=428731,error=0, records=41
[WARN ] 2026-06-02 03:47:22.964 [3773 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:47:23.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:47:27.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 03:47:27.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428732,ok=428732,error=0, records=41
[WARN ] 2026-06-02 03:47:37.968 [3762 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:47:38.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:47:42.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 03:47:42.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428733,ok=428733,error=0, records=41
[WARN ] 2026-06-02 03:47:52.973 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:47:53.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:47:54.218 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21448/300s
[INFO ] 2026-06-02 03:47:56.120 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21448/300s
[INFO ] 2026-06-02 03:47:57.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 03:47:57.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428734,ok=428734,error=0, records=41
[INFO ] 2026-06-02 03:48:03.127 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21448/300s
[WARN ] 2026-06-02 03:48:07.978 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:48:08.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:48:12.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 03:48:12.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428735,ok=428735,error=0, records=41
[WARN ] 2026-06-02 03:48:22.983 [3845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:48:23.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:48:27.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 03:48:27.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428736,ok=428736,error=0, records=41
[WARN ] 2026-06-02 03:48:37.987 [3873 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:48:38.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:48:42.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 03:48:42.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428737,ok=428737,error=0, records=41
[WARN ] 2026-06-02 03:48:52.992 [3818 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:48:53.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:48:57.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 03:48:57.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428738,ok=428738,error=0, records=41
[WARN ] 2026-06-02 03:49:07.997 [3901 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:49:08.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:49:12.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 03:49:12.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428739,ok=428739,error=0, records=41
[WARN ] 2026-06-02 03:49:23.001 [3915 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:49:23.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:49:27.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-02 03:49:27.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428740,ok=428740,error=0, records=41
[WARN ] 2026-06-02 03:49:38.007 [3901 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:49:38.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:49:42.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 03:49:42.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428741,ok=428741,error=0, records=41
[WARN ] 2026-06-02 03:49:53.012 [3957 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:49:53.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:49:54.585 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20855960},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:49:54.758 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:49:54.758 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 03:49:54.758 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:49:54.758 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:49:54.758 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:49:54.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:49:54.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:49:54.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:49:54.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:49:54.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:49:54.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:49:57.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-02 03:49:57.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428742,ok=428742,error=0, records=41
[INFO ] 2026-06-02 03:50:01.696 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21452/300s
[WARN ] 2026-06-02 03:50:08.018 [3929 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:50:08.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:50:12.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-02 03:50:12.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428743,ok=428743,error=0, records=41
[INFO ] 2026-06-02 03:50:23.021 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21443/300s
[WARN ] 2026-06-02 03:50:23.022 [3929 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:50:23.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:50:27.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10432, records=41
[INFO ] 2026-06-02 03:50:27.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428744,ok=428744,error=0, records=41
[WARN ] 2026-06-02 03:50:38.028 [3929 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:50:38.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:50:42.682 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21452/300s
[INFO ] 2026-06-02 03:50:42.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10453, records=41
[INFO ] 2026-06-02 03:50:42.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428745,ok=428745,error=0, records=41
[WARN ] 2026-06-02 03:50:53.034 [4013 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:50:53.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:50:57.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10433, records=41
[INFO ] 2026-06-02 03:50:57.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428746,ok=428746,error=0, records=41
[INFO ] 2026-06-02 03:50:57.826 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21439/300s
[WARN ] 2026-06-02 03:51:08.039 [4036 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:51:08.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:51:12.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10471, records=41
[INFO ] 2026-06-02 03:51:12.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428747,ok=428747,error=0, records=41
[WARN ] 2026-06-02 03:51:23.043 [4048 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:51:23.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:51:27.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10461, records=41
[INFO ] 2026-06-02 03:51:27.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428748,ok=428748,error=0, records=41
[WARN ] 2026-06-02 03:51:38.047 [4077 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:51:38.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:51:42.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10446, records=41
[INFO ] 2026-06-02 03:51:42.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428749,ok=428749,error=0, records=41
[INFO ] 2026-06-02 03:51:47.452 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21448/300s
[INFO ] 2026-06-02 03:51:49.094 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21439/300s
[WARN ] 2026-06-02 03:51:53.053 [4089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:51:53.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:51:57.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10494, records=41
[INFO ] 2026-06-02 03:51:57.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428750,ok=428750,error=0, records=41
[WARN ] 2026-06-02 03:52:07.556 [4101 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:52:08.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:52:08.514 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21451/300s
[INFO ] 2026-06-02 03:52:12.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10399, records=41
[INFO ] 2026-06-02 03:52:12.857 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428751,ok=428751,error=0, records=41
[WARN ] 2026-06-02 03:52:22.564 [4132 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:52:23.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:52:27.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10412, records=41
[INFO ] 2026-06-02 03:52:27.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428752,ok=428752,error=0, records=41
[WARN ] 2026-06-02 03:52:37.568 [4155 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:52:38.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:52:42.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10404, records=41
[INFO ] 2026-06-02 03:52:42.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428753,ok=428753,error=0, records=41
[WARN ] 2026-06-02 03:52:52.573 [4174 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:52:53.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:52:54.306 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21449/300s
[INFO ] 2026-06-02 03:52:54.758 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17860/300s
[INFO ] 2026-06-02 03:52:54.759 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":17534880},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:52:54.943 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:52:54.943 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 03:52:54.943 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:52:54.943 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:52:54.943 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:52:54.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:52:54.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:52:54.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:52:54.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:52:54.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:52:54.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:52:56.194 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21449/300s
[INFO ] 2026-06-02 03:52:57.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-02 03:52:57.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428754,ok=428754,error=0, records=41
[INFO ] 2026-06-02 03:53:03.187 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21449/300s
[WARN ] 2026-06-02 03:53:07.577 [4187 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:53:08.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:53:12.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10406, records=41
[INFO ] 2026-06-02 03:53:12.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428755,ok=428755,error=0, records=41
[WARN ] 2026-06-02 03:53:22.582 [4208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:53:23.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:53:27.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 03:53:27.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428756,ok=428756,error=0, records=41
[WARN ] 2026-06-02 03:53:37.588 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:53:38.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 03:53:38.517 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 03:53:42.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-02 03:53:42.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428757,ok=428757,error=0, records=41
[WARN ] 2026-06-02 03:53:52.593 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:53:53.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:53:53.518 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 03:53:57.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 03:53:57.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428758,ok=428758,error=0, records=41
[WARN ] 2026-06-02 03:54:07.602 [4208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:54:08.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:54:12.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10435, records=41
[INFO ] 2026-06-02 03:54:12.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428759,ok=428759,error=0, records=41
[WARN ] 2026-06-02 03:54:22.608 [4245 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:54:23.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:54:27.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-02 03:54:27.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428760,ok=428760,error=0, records=41
[WARN ] 2026-06-02 03:54:37.616 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:54:38.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:54:42.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-02 03:54:42.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428761,ok=428761,error=0, records=41
[WARN ] 2026-06-02 03:54:52.624 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:54:53.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:54:57.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10386, records=41
[INFO ] 2026-06-02 03:54:57.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428762,ok=428762,error=0, records=41
[INFO ] 2026-06-02 03:55:01.699 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21453/300s
[WARN ] 2026-06-02 03:55:07.634 [4230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:55:08.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:55:12.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10408, records=41
[INFO ] 2026-06-02 03:55:12.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428763,ok=428763,error=0, records=41
[WARN ] 2026-06-02 03:55:22.639 [4230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:55:23.139 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21444/300s
[INFO ] 2026-06-02 03:55:23.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:55:27.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10409, records=41
[INFO ] 2026-06-02 03:55:27.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428764,ok=428764,error=0, records=41
[WARN ] 2026-06-02 03:55:37.647 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:55:38.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:55:42.688 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21453/300s
[INFO ] 2026-06-02 03:55:42.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10404, records=41
[INFO ] 2026-06-02 03:55:42.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428765,ok=428765,error=0, records=41
[WARN ] 2026-06-02 03:55:52.653 [4208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:55:53.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:55:54.945 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":14128052},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:55:55.179 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:55:55.179 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 03:55:55.179 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:55:55.179 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:55:55.179 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:55:55.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:55:55.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:55:55.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:55:55.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:55:55.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:55:55.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:55:57.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-02 03:55:57.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428766,ok=428766,error=0, records=41
[INFO ] 2026-06-02 03:55:57.959 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21440/300s
[WARN ] 2026-06-02 03:56:07.660 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:56:08.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:56:12.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10432, records=41
[INFO ] 2026-06-02 03:56:12.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428767,ok=428767,error=0, records=41
[WARN ] 2026-06-02 03:56:22.668 [4208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:56:23.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:56:27.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10402, records=41
[INFO ] 2026-06-02 03:56:27.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428768,ok=428768,error=0, records=41
[WARN ] 2026-06-02 03:56:37.674 [4208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:56:38.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:56:42.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10406, records=41
[INFO ] 2026-06-02 03:56:42.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428769,ok=428769,error=0, records=41
[INFO ] 2026-06-02 03:56:47.519 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21449/300s
[INFO ] 2026-06-02 03:56:49.261 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21440/300s
[WARN ] 2026-06-02 03:56:52.680 [4245 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:56:53.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:56:57.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-06-02 03:56:57.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428770,ok=428770,error=0, records=41
[WARN ] 2026-06-02 03:57:07.686 [4208 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:57:08.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:57:08.527 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21452/300s
[INFO ] 2026-06-02 03:57:12.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10426, records=41
[INFO ] 2026-06-02 03:57:12.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428771,ok=428771,error=0, records=41
[WARN ] 2026-06-02 03:57:22.693 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:57:23.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:57:28.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10401, records=41
[INFO ] 2026-06-02 03:57:28.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428772,ok=428772,error=0, records=41
[WARN ] 2026-06-02 03:57:37.701 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:57:38.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:57:43.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10393, records=41
[INFO ] 2026-06-02 03:57:43.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428773,ok=428773,error=0, records=41
[WARN ] 2026-06-02 03:57:52.707 [4230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:57:53.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:57:54.392 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21450/300s
[INFO ] 2026-06-02 03:57:56.194 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21450/300s
[INFO ] 2026-06-02 03:57:58.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10399, records=41
[INFO ] 2026-06-02 03:57:58.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428774,ok=428774,error=0, records=41
[INFO ] 2026-06-02 03:58:03.247 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21450/300s
[WARN ] 2026-06-02 03:58:07.715 [4245 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:58:08.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:58:13.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10432, records=41
[INFO ] 2026-06-02 03:58:13.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428775,ok=428775,error=0, records=41
[WARN ] 2026-06-02 03:58:22.728 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:58:23.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:58:28.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10386, records=41
[INFO ] 2026-06-02 03:58:28.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428776,ok=428776,error=0, records=41
[WARN ] 2026-06-02 03:58:37.734 [4245 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:58:38.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:58:43.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10404, records=41
[INFO ] 2026-06-02 03:58:43.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428777,ok=428777,error=0, records=41
[WARN ] 2026-06-02 03:58:52.741 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:58:53.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:58:55.179 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17861/300s
[INFO ] 2026-06-02 03:58:55.181 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":10893516},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 03:58:55.356 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 03:58:55.356 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 03:58:55.356 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 03:58:55.356 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 03:58:55.356 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 03:58:55.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:58:55.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 03:58:55.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:58:55.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 03:58:55.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:58:55.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 03:58:58.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 03:58:58.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428778,ok=428778,error=0, records=41
[WARN ] 2026-06-02 03:59:07.749 [4245 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:59:08.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:59:13.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10416, records=41
[INFO ] 2026-06-02 03:59:13.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428779,ok=428779,error=0, records=41
[WARN ] 2026-06-02 03:59:22.756 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:59:23.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:59:28.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-02 03:59:28.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428780,ok=428780,error=0, records=41
[WARN ] 2026-06-02 03:59:37.766 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:59:38.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:59:43.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10415, records=41
[INFO ] 2026-06-02 03:59:43.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428781,ok=428781,error=0, records=41
[WARN ] 2026-06-02 03:59:52.771 [4206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 03:59:53.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 03:59:58.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-06-02 03:59:58.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428782,ok=428782,error=0, records=41
[INFO ] 2026-06-02 04:00:01.702 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21454/300s
[WARN ] 2026-06-02 04:00:07.777 [4230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:00:08.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:00:13.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10417, records=41
[INFO ] 2026-06-02 04:00:13.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428783,ok=428783,error=0, records=41
[WARN ] 2026-06-02 04:00:22.783 [4230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:00:23.283 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21445/300s
[INFO ] 2026-06-02 04:00:23.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:00:28.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10405, records=41
[INFO ] 2026-06-02 04:00:28.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428784,ok=428784,error=0, records=41
[WARN ] 2026-06-02 04:00:37.796 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:00:38.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:00:42.694 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21454/300s
[INFO ] 2026-06-02 04:00:43.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-02 04:00:43.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428785,ok=428785,error=0, records=41
[WARN ] 2026-06-02 04:00:52.801 [4235 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:00:53.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:00:58.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10404, records=41
[INFO ] 2026-06-02 04:00:58.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428786,ok=428786,error=0, records=41
[INFO ] 2026-06-02 04:00:58.145 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21441/300s
[WARN ] 2026-06-02 04:01:07.806 [4651 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:01:08.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:01:13.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 04:01:13.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428787,ok=428787,error=0, records=41
[WARN ] 2026-06-02 04:01:22.811 [4666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:01:23.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:01:28.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 04:01:28.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428788,ok=428788,error=0, records=41
[WARN ] 2026-06-02 04:01:37.818 [4681 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:01:38.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:01:43.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 04:01:43.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428789,ok=428789,error=0, records=41
[INFO ] 2026-06-02 04:01:47.588 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21450/300s
[INFO ] 2026-06-02 04:01:49.442 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21441/300s
[WARN ] 2026-06-02 04:01:52.823 [4230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:01:53.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:01:55.358 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":7438080},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:01:55.508 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=252
[INFO ] 2026-06-02 04:01:55.508 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 04:01:55.508 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:01:55.508 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:01:55.508 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:01:55.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:01:55.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:01:55.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:01:55.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:01:55.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:01:55.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:01:58.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-02 04:01:58.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428790,ok=428790,error=0, records=41
[WARN ] 2026-06-02 04:02:07.831 [4709 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:02:08.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:02:08.540 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21453/300s
[INFO ] 2026-06-02 04:02:13.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 04:02:13.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428791,ok=428791,error=0, records=41
[WARN ] 2026-06-02 04:02:22.836 [4723 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:02:23.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:02:28.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 04:02:28.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428792,ok=428792,error=0, records=41
[WARN ] 2026-06-02 04:02:37.842 [4709 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:02:38.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:02:43.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-02 04:02:43.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428793,ok=428793,error=0, records=41
[WARN ] 2026-06-02 04:02:52.849 [4709 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:02:53.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:02:54.469 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21451/300s
[INFO ] 2026-06-02 04:02:56.243 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21451/300s
[INFO ] 2026-06-02 04:02:58.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 04:02:58.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428794,ok=428794,error=0, records=41
[INFO ] 2026-06-02 04:03:03.285 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21451/300s
[WARN ] 2026-06-02 04:03:07.857 [4733 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:03:08.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:03:13.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10446, records=41
[INFO ] 2026-06-02 04:03:13.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428795,ok=428795,error=0, records=41
[WARN ] 2026-06-02 04:03:22.864 [4723 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:03:23.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:03:28.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10407, records=41
[INFO ] 2026-06-02 04:03:28.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428796,ok=428796,error=0, records=41
[WARN ] 2026-06-02 04:03:37.882 [4651 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:03:38.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 04:03:38.544 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 04:03:43.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10414, records=41
[INFO ] 2026-06-02 04:03:43.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428797,ok=428797,error=0, records=41
[WARN ] 2026-06-02 04:03:52.887 [4813 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:03:53.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:03:58.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10449, records=41
[INFO ] 2026-06-02 04:03:58.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428798,ok=428798,error=0, records=41
[WARN ] 2026-06-02 04:04:07.894 [4824 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:04:08.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:04:13.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10379, records=41
[INFO ] 2026-06-02 04:04:13.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428799,ok=428799,error=0, records=41
[WARN ] 2026-06-02 04:04:22.902 [4808 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:04:23.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:04:28.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10403, records=41
[INFO ] 2026-06-02 04:04:28.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428800,ok=428800,error=0, records=41
[WARN ] 2026-06-02 04:04:37.906 [4858 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:04:38.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:04:43.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10382, records=41
[INFO ] 2026-06-02 04:04:43.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428801,ok=428801,error=0, records=41
[WARN ] 2026-06-02 04:04:52.915 [4881 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:04:53.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:04:55.509 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17862/300s
[INFO ] 2026-06-02 04:04:55.510 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":4471152},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:04:55.661 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=252
[INFO ] 2026-06-02 04:04:55.661 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 04:04:55.661 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:04:55.661 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:04:55.661 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:04:55.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:04:55.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:04:55.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:04:55.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:04:55.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:04:55.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:04:58.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10420, records=41
[INFO ] 2026-06-02 04:04:58.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428802,ok=428802,error=0, records=41
[INFO ] 2026-06-02 04:05:01.707 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21455/300s
[WARN ] 2026-06-02 04:05:07.920 [4808 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:05:08.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:05:13.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 04:05:13.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428803,ok=428803,error=0, records=41
[WARN ] 2026-06-02 04:05:22.926 [4825 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:05:23.427 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21446/300s
[INFO ] 2026-06-02 04:05:23.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:05:28.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 04:05:28.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428804,ok=428804,error=0, records=41
[WARN ] 2026-06-02 04:05:37.933 [4926 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:05:38.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:05:42.701 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21455/300s
[INFO ] 2026-06-02 04:05:43.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 04:05:43.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428805,ok=428805,error=0, records=41
[WARN ] 2026-06-02 04:05:52.939 [4881 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:05:53.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:05:58.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 04:05:58.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428806,ok=428806,error=0, records=41
[INFO ] 2026-06-02 04:05:58.348 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21442/300s
[WARN ] 2026-06-02 04:06:07.947 [4881 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:06:08.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:06:13.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10440, records=41
[INFO ] 2026-06-02 04:06:13.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428807,ok=428807,error=0, records=41
[WARN ] 2026-06-02 04:06:22.953 [4965 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:06:23.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:06:28.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10429, records=41
[INFO ] 2026-06-02 04:06:28.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428808,ok=428808,error=0, records=41
[WARN ] 2026-06-02 04:06:37.959 [4960 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:06:38.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:06:43.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10424, records=41
[INFO ] 2026-06-02 04:06:43.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428809,ok=428809,error=0, records=41
[INFO ] 2026-06-02 04:06:47.658 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21451/300s
[INFO ] 2026-06-02 04:06:49.633 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21442/300s
[WARN ] 2026-06-02 04:06:52.968 [4989 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:06:53.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:06:58.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10435, records=41
[INFO ] 2026-06-02 04:06:58.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428810,ok=428810,error=0, records=41
[WARN ] 2026-06-02 04:07:07.974 [4989 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:07:08.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:07:08.553 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21454/300s
[INFO ] 2026-06-02 04:07:13.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10452, records=41
[INFO ] 2026-06-02 04:07:13.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428811,ok=428811,error=0, records=41
[WARN ] 2026-06-02 04:07:22.986 [4965 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:07:23.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:07:28.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10435, records=41
[INFO ] 2026-06-02 04:07:28.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428812,ok=428812,error=0, records=41
[WARN ] 2026-06-02 04:07:37.996 [4959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:07:38.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:07:43.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10424, records=41
[INFO ] 2026-06-02 04:07:43.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428813,ok=428813,error=0, records=41
[WARN ] 2026-06-02 04:07:53.002 [4959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:07:53.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:07:54.492 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21452/300s
[INFO ] 2026-06-02 04:07:55.689 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":4471076},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:07:55.857 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=252
[INFO ] 2026-06-02 04:07:55.857 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 04:07:55.869 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:07:55.869 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:07:55.869 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:07:55.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:07:55.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:07:55.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:07:55.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:07:55.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:07:55.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:07:56.269 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21452/300s
[INFO ] 2026-06-02 04:07:58.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10449, records=41
[INFO ] 2026-06-02 04:07:58.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428814,ok=428814,error=0, records=41
[INFO ] 2026-06-02 04:08:03.380 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21452/300s
[WARN ] 2026-06-02 04:08:08.006 [5043 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:08:08.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:08:13.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10444, records=41
[INFO ] 2026-06-02 04:08:13.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428815,ok=428815,error=0, records=41
[WARN ] 2026-06-02 04:08:23.011 [5071 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:08:23.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:08:28.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10433, records=41
[INFO ] 2026-06-02 04:08:28.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428816,ok=428816,error=0, records=41
[WARN ] 2026-06-02 04:08:38.018 [4960 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:08:38.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:08:43.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10449, records=41
[INFO ] 2026-06-02 04:08:43.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428817,ok=428817,error=0, records=41
[WARN ] 2026-06-02 04:08:53.026 [5100 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:08:53.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:08:53.557 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 04:08:58.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10448, records=41
[INFO ] 2026-06-02 04:08:58.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428818,ok=428818,error=0, records=41
[WARN ] 2026-06-02 04:09:08.031 [4959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:09:08.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:09:13.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 04:09:13.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428819,ok=428819,error=0, records=41
[WARN ] 2026-06-02 04:09:23.037 [5128 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:09:23.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:09:28.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 04:09:28.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428820,ok=428820,error=0, records=41
[WARN ] 2026-06-02 04:09:38.043 [5166 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:09:38.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:09:43.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 04:09:43.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428821,ok=428821,error=0, records=41
[WARN ] 2026-06-02 04:09:53.048 [5181 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:09:53.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:09:58.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 04:09:58.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428822,ok=428822,error=0, records=41
[INFO ] 2026-06-02 04:10:01.711 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21456/300s
[WARN ] 2026-06-02 04:10:07.554 [5206 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:10:08.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:10:13.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10481, records=41
[INFO ] 2026-06-02 04:10:13.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428823,ok=428823,error=0, records=41
[WARN ] 2026-06-02 04:10:22.561 [5219 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:10:23.561 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21447/300s
[INFO ] 2026-06-02 04:10:23.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:10:28.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10472, records=41
[INFO ] 2026-06-02 04:10:28.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428824,ok=428824,error=0, records=41
[WARN ] 2026-06-02 04:10:37.566 [5218 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:10:38.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:10:42.707 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21456/300s
[INFO ] 2026-06-02 04:10:43.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10456, records=41
[INFO ] 2026-06-02 04:10:43.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428825,ok=428825,error=0, records=41
[WARN ] 2026-06-02 04:10:52.572 [5259 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:10:53.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:10:55.869 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17863/300s
[INFO ] 2026-06-02 04:10:55.873 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854444},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:10:56.031 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:10:56.031 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 04:10:56.031 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:10:56.031 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:10:56.031 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:10:56.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:10:56.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:10:56.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:10:56.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:10:56.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:10:56.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:10:58.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10445, records=41
[INFO ] 2026-06-02 04:10:58.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428826,ok=428826,error=0, records=41
[INFO ] 2026-06-02 04:10:58.465 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21443/300s
[WARN ] 2026-06-02 04:11:07.576 [5276 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:11:08.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:11:13.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10457, records=41
[INFO ] 2026-06-02 04:11:13.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428827,ok=428827,error=0, records=41
[WARN ] 2026-06-02 04:11:22.590 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:11:23.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:11:28.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10460, records=41
[INFO ] 2026-06-02 04:11:28.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428828,ok=428828,error=0, records=41
[WARN ] 2026-06-02 04:11:37.594 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:11:38.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:11:43.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10478, records=41
[INFO ] 2026-06-02 04:11:43.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428829,ok=428829,error=0, records=41
[WARN ] 2026-06-02 04:11:47.597 [5297 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3980/stat), No such file or directory
[WARN ] 2026-06-02 04:11:47.598 [5297 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/3977/stat), No such file or directory
[INFO ] 2026-06-02 04:11:47.717 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21452/300s
[INFO ] 2026-06-02 04:11:49.804 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21443/300s
[WARN ] 2026-06-02 04:11:52.598 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:11:53.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:11:58.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10562, records=41
[INFO ] 2026-06-02 04:11:58.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428830,ok=428830,error=0, records=41
[WARN ] 2026-06-02 04:12:07.607 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:12:08.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:12:08.566 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21455/300s
[INFO ] 2026-06-02 04:12:13.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 04:12:13.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428831,ok=428831,error=0, records=41
[WARN ] 2026-06-02 04:12:22.621 [5285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:12:23.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:12:28.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 04:12:28.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428832,ok=428832,error=0, records=41
[WARN ] 2026-06-02 04:12:37.626 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:12:38.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:12:43.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-02 04:12:44.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428833,ok=428833,error=0, records=41
[WARN ] 2026-06-02 04:12:52.631 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:12:53.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:12:54.576 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21453/300s
[INFO ] 2026-06-02 04:12:56.366 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21453/300s
[INFO ] 2026-06-02 04:12:59.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 04:12:59.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428834,ok=428834,error=0, records=41
[INFO ] 2026-06-02 04:13:03.455 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21453/300s
[WARN ] 2026-06-02 04:13:07.638 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:13:08.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:13:14.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10420, records=41
[INFO ] 2026-06-02 04:13:14.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428835,ok=428835,error=0, records=41
[WARN ] 2026-06-02 04:13:22.646 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:13:23.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:13:29.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10419, records=41
[INFO ] 2026-06-02 04:13:29.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428836,ok=428836,error=0, records=41
[WARN ] 2026-06-02 04:13:32.652 [5298 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5310/stat), No such file or directory
[WARN ] 2026-06-02 04:13:32.653 [5298 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5309/stat), No such file or directory
[WARN ] 2026-06-02 04:13:37.655 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:13:38.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 04:13:38.573 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 04:13:44.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 04:13:44.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428837,ok=428837,error=0, records=41
[WARN ] 2026-06-02 04:13:47.660 [5297 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5310/stat), No such file or directory
[WARN ] 2026-06-02 04:13:47.660 [5297 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5309/stat), No such file or directory
[WARN ] 2026-06-02 04:13:52.660 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:13:53.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:13:56.035 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20426408},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:13:56.202 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:13:56.203 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 04:13:56.203 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:13:56.203 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:13:56.203 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:13:56.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:13:56.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:13:56.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:13:56.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:13:56.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:13:56.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:13:59.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 04:13:59.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428838,ok=428838,error=0, records=41
[WARN ] 2026-06-02 04:14:07.668 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:14:08.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:14:14.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 04:14:14.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428839,ok=428839,error=0, records=41
[WARN ] 2026-06-02 04:14:22.673 [5285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:14:23.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:14:29.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10100, records=41
[INFO ] 2026-06-02 04:14:29.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428840,ok=428840,error=0, records=41
[WARN ] 2026-06-02 04:14:37.679 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:14:38.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:14:44.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10108, records=41
[INFO ] 2026-06-02 04:14:44.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428841,ok=428841,error=0, records=41
[WARN ] 2026-06-02 04:14:52.684 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:14:53.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:14:59.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10074, records=41
[INFO ] 2026-06-02 04:14:59.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428842,ok=428842,error=0, records=41
[INFO ] 2026-06-02 04:15:01.715 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21457/300s
[WARN ] 2026-06-02 04:15:07.691 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:15:08.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:15:14.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 04:15:14.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428843,ok=428843,error=0, records=41
[WARN ] 2026-06-02 04:15:22.696 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:15:23.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:15:23.696 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21448/300s
[INFO ] 2026-06-02 04:15:29.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 04:15:29.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428844,ok=428844,error=0, records=41
[WARN ] 2026-06-02 04:15:37.700 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:15:38.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:15:42.713 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21457/300s
[INFO ] 2026-06-02 04:15:44.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 04:15:44.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428845,ok=428845,error=0, records=41
[WARN ] 2026-06-02 04:15:52.705 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:15:53.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:15:59.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 04:15:59.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428846,ok=428846,error=0, records=41
[INFO ] 2026-06-02 04:15:59.441 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21444/300s
[WARN ] 2026-06-02 04:16:07.709 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:16:08.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:16:14.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 04:16:14.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428847,ok=428847,error=0, records=41
[WARN ] 2026-06-02 04:16:22.716 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:16:23.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:16:29.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 04:16:29.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428848,ok=428848,error=0, records=41
[WARN ] 2026-06-02 04:16:37.721 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:16:38.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:16:44.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 04:16:44.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428849,ok=428849,error=0, records=41
[INFO ] 2026-06-02 04:16:47.769 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21453/300s
[INFO ] 2026-06-02 04:16:49.987 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21444/300s
[WARN ] 2026-06-02 04:16:52.726 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:16:53.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:16:56.203 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17864/300s
[INFO ] 2026-06-02 04:16:56.204 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20855244},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:16:56.374 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:16:56.374 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 04:16:56.374 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:16:56.374 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:16:56.374 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:16:56.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:16:56.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:16:56.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:16:56.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:16:56.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:16:56.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:16:59.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 04:16:59.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428850,ok=428850,error=0, records=41
[WARN ] 2026-06-02 04:17:07.731 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:17:08.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:17:08.582 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21456/300s
[INFO ] 2026-06-02 04:17:14.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 04:17:14.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428851,ok=428851,error=0, records=41
[WARN ] 2026-06-02 04:17:22.736 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:17:23.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:17:29.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 04:17:29.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428852,ok=428852,error=0, records=41
[WARN ] 2026-06-02 04:17:37.742 [5285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:17:38.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:17:44.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 04:17:44.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428853,ok=428853,error=0, records=41
[WARN ] 2026-06-02 04:17:52.747 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:17:53.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:17:54.639 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21454/300s
[INFO ] 2026-06-02 04:17:56.441 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21454/300s
[INFO ] 2026-06-02 04:17:59.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 04:17:59.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428854,ok=428854,error=0, records=41
[INFO ] 2026-06-02 04:18:03.510 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21454/300s
[WARN ] 2026-06-02 04:18:07.751 [5285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:18:08.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:18:14.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 04:18:14.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428855,ok=428855,error=0, records=41
[WARN ] 2026-06-02 04:18:22.757 [5285 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:18:23.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:18:29.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 04:18:29.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428856,ok=428856,error=0, records=41
[WARN ] 2026-06-02 04:18:37.762 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:18:38.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:18:44.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 04:18:44.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428857,ok=428857,error=0, records=41
[WARN ] 2026-06-02 04:18:52.768 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:18:53.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:18:59.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 04:18:59.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428858,ok=428858,error=0, records=41
[WARN ] 2026-06-02 04:19:07.772 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:19:08.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:19:14.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 04:19:14.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428859,ok=428859,error=0, records=41
[WARN ] 2026-06-02 04:19:22.778 [5298 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:19:23.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:19:29.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 04:19:29.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428860,ok=428860,error=0, records=41
[WARN ] 2026-06-02 04:19:37.783 [5315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:19:38.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:19:44.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 04:19:44.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428861,ok=428861,error=0, records=41
[WARN ] 2026-06-02 04:19:52.789 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:19:53.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:19:56.375 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20855172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:19:56.540 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:19:56.540 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 04:19:56.540 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:19:56.540 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:19:56.540 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:19:56.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:19:56.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:19:56.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:19:56.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:19:56.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:19:56.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:19:59.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 04:19:59.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428862,ok=428862,error=0, records=41
[INFO ] 2026-06-02 04:20:01.718 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21458/300s
[WARN ] 2026-06-02 04:20:07.794 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:20:08.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:20:14.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 04:20:14.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428863,ok=428863,error=0, records=41
[WARN ] 2026-06-02 04:20:22.798 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:20:23.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:20:23.798 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21449/300s
[INFO ] 2026-06-02 04:20:29.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 04:20:29.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428864,ok=428864,error=0, records=41
[WARN ] 2026-06-02 04:20:37.803 [5803 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:20:38.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:20:42.720 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21458/300s
[INFO ] 2026-06-02 04:20:44.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 04:20:44.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428865,ok=428865,error=0, records=41
[WARN ] 2026-06-02 04:20:52.809 [5817 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:20:53.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:20:59.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 04:20:59.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428866,ok=428866,error=0, records=41
[INFO ] 2026-06-02 04:20:59.585 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21445/300s
[WARN ] 2026-06-02 04:21:07.814 [5817 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:21:08.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:21:14.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 04:21:14.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428867,ok=428867,error=0, records=41
[WARN ] 2026-06-02 04:21:22.820 [5297 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:21:23.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:21:29.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 04:21:29.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428868,ok=428868,error=0, records=41
[WARN ] 2026-06-02 04:21:37.825 [5845 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:21:38.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:21:44.610 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-02 04:21:44.610 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428869,ok=428869,error=0, records=41
[INFO ] 2026-06-02 04:21:47.822 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21454/300s
[INFO ] 2026-06-02 04:21:50.168 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21445/300s
[WARN ] 2026-06-02 04:21:52.831 [5236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:21:53.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:21:59.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-02 04:21:59.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428870,ok=428870,error=0, records=41
[WARN ] 2026-06-02 04:22:07.837 [5922 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:22:08.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:22:08.595 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21457/300s
[INFO ] 2026-06-02 04:22:14.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 04:22:14.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428871,ok=428871,error=0, records=41
[WARN ] 2026-06-02 04:22:22.842 [5827 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:22:23.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:22:29.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 04:22:29.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428872,ok=428872,error=0, records=41
[WARN ] 2026-06-02 04:22:37.848 [5959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:22:38.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:22:44.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 04:22:44.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428873,ok=428873,error=0, records=41
[WARN ] 2026-06-02 04:22:52.853 [5945 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:22:53.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:22:54.690 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21455/300s
[INFO ] 2026-06-02 04:22:56.492 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21455/300s
[INFO ] 2026-06-02 04:22:56.541 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17865/300s
[INFO ] 2026-06-02 04:22:56.542 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20855072},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:22:56.711 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:22:56.711 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 04:22:56.711 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:22:56.711 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:22:56.711 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:22:56.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:22:56.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:22:56.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:22:56.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:22:56.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:22:56.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:22:59.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 04:22:59.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428874,ok=428874,error=0, records=41
[INFO ] 2026-06-02 04:23:03.555 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21455/300s
[WARN ] 2026-06-02 04:23:07.859 [5827 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:23:08.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:23:14.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 04:23:14.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428875,ok=428875,error=0, records=41
[WARN ] 2026-06-02 04:23:22.864 [5945 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:23:23.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:23:29.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 04:23:29.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428876,ok=428876,error=0, records=41
[WARN ] 2026-06-02 04:23:37.868 [5959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:23:38.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 04:23:38.598 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 04:23:44.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 04:23:44.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428877,ok=428877,error=0, records=41
[WARN ] 2026-06-02 04:23:52.874 [5945 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:23:53.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:23:53.599 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 04:23:59.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 04:23:59.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428878,ok=428878,error=0, records=41
[WARN ] 2026-06-02 04:24:07.880 [5945 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:24:08.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:24:14.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 04:24:14.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428879,ok=428879,error=0, records=41
[WARN ] 2026-06-02 04:24:22.886 [5959 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:24:23.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:24:29.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 04:24:29.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428880,ok=428880,error=0, records=41
[WARN ] 2026-06-02 04:24:37.891 [6083 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:24:38.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:24:44.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 04:24:44.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428881,ok=428881,error=0, records=41
[WARN ] 2026-06-02 04:24:52.895 [6078 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:24:53.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:24:59.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 04:24:59.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428882,ok=428882,error=0, records=41
[INFO ] 2026-06-02 04:25:01.723 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21459/300s
[WARN ] 2026-06-02 04:25:07.901 [6105 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:25:08.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:25:14.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 04:25:14.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428883,ok=428883,error=0, records=41
[WARN ] 2026-06-02 04:25:22.906 [6111 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:25:23.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:25:23.906 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21450/300s
[INFO ] 2026-06-02 04:25:29.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 04:25:29.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428884,ok=428884,error=0, records=41
[WARN ] 2026-06-02 04:25:37.912 [6149 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:25:38.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:25:42.726 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21459/300s
[INFO ] 2026-06-02 04:25:44.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 04:25:44.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428885,ok=428885,error=0, records=41
[WARN ] 2026-06-02 04:25:52.917 [6166 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:25:53.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:25:56.713 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854996},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:25:56.876 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:25:56.876 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 04:25:56.876 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:25:56.876 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:25:56.876 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:25:56.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:25:56.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:25:56.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:25:56.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:25:56.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:25:56.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:25:59.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 04:25:59.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428886,ok=428886,error=0, records=41
[INFO ] 2026-06-02 04:25:59.707 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21446/300s
[WARN ] 2026-06-02 04:26:07.921 [6178 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:26:08.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:26:14.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 04:26:14.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428887,ok=428887,error=0, records=41
[WARN ] 2026-06-02 04:26:22.926 [6149 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:26:23.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:26:29.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 04:26:29.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428888,ok=428888,error=0, records=41
[WARN ] 2026-06-02 04:26:37.931 [6188 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:26:38.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:26:44.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 04:26:44.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428889,ok=428889,error=0, records=41
[INFO ] 2026-06-02 04:26:47.873 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21455/300s
[INFO ] 2026-06-02 04:26:50.347 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21446/300s
[WARN ] 2026-06-02 04:26:52.935 [6231 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:26:53.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:26:59.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 04:26:59.740 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428890,ok=428890,error=0, records=41
[WARN ] 2026-06-02 04:27:07.941 [6242 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:27:08.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:27:08.608 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21458/300s
[INFO ] 2026-06-02 04:27:14.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 04:27:14.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428891,ok=428891,error=0, records=41
[WARN ] 2026-06-02 04:27:22.946 [6264 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:27:23.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:27:29.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 04:27:29.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428892,ok=428892,error=0, records=41
[WARN ] 2026-06-02 04:27:37.951 [6247 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:27:38.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:27:44.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 04:27:44.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428893,ok=428893,error=0, records=41
[WARN ] 2026-06-02 04:27:52.956 [6289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:27:53.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:27:54.762 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21456/300s
[INFO ] 2026-06-02 04:27:56.563 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21456/300s
[INFO ] 2026-06-02 04:27:59.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 04:27:59.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428894,ok=428894,error=0, records=41
[INFO ] 2026-06-02 04:28:03.604 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21456/300s
[WARN ] 2026-06-02 04:28:07.961 [6247 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:28:08.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:28:14.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 04:28:14.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428895,ok=428895,error=0, records=41
[WARN ] 2026-06-02 04:28:22.965 [6352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:28:23.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:28:29.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 04:28:29.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428896,ok=428896,error=0, records=41
[WARN ] 2026-06-02 04:28:32.469 [6352 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5873/stat), No such file or directory
[WARN ] 2026-06-02 04:28:32.470 [6352 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5893/stat), No such file or directory
[WARN ] 2026-06-02 04:28:37.971 [6289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:28:38.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:28:44.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 04:28:44.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428897,ok=428897,error=0, records=41
[WARN ] 2026-06-02 04:28:47.474 [6247 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5873/stat), No such file or directory
[WARN ] 2026-06-02 04:28:47.475 [6247 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5893/stat), No such file or directory
[WARN ] 2026-06-02 04:28:52.977 [6305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:28:53.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:28:56.877 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17866/300s
[INFO ] 2026-06-02 04:28:56.878 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854888},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:28:57.040 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:28:57.040 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 04:28:57.040 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:28:57.040 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:28:57.040 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:28:57.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:28:57.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:28:57.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:28:57.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:28:57.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:28:57.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:28:59.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 04:28:59.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428898,ok=428898,error=0, records=41
[WARN ] 2026-06-02 04:29:07.982 [6352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:29:08.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:29:14.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 04:29:14.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428899,ok=428899,error=0, records=41
[WARN ] 2026-06-02 04:29:17.486 [6352 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5907/stat), No such file or directory
[WARN ] 2026-06-02 04:29:22.987 [6433 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:29:23.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:29:29.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 04:29:29.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428900,ok=428900,error=0, records=41
[WARN ] 2026-06-02 04:29:32.491 [6305 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5907/stat), No such file or directory
[WARN ] 2026-06-02 04:29:37.993 [6305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:29:38.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:29:44.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 04:29:44.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428901,ok=428901,error=0, records=41
[WARN ] 2026-06-02 04:29:47.496 [6419 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/5907/stat), No such file or directory
[WARN ] 2026-06-02 04:29:52.997 [6305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:29:53.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:29:59.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 04:29:59.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428902,ok=428902,error=0, records=41
[INFO ] 2026-06-02 04:30:01.727 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21460/300s
[WARN ] 2026-06-02 04:30:08.001 [6275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:30:08.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:30:14.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 04:30:14.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428903,ok=428903,error=0, records=41
[WARN ] 2026-06-02 04:30:23.005 [6493 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:30:23.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:30:24.005 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21451/300s
[INFO ] 2026-06-02 04:30:29.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 04:30:29.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428904,ok=428904,error=0, records=41
[WARN ] 2026-06-02 04:30:38.010 [6479 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:30:38.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:30:42.732 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21460/300s
[INFO ] 2026-06-02 04:30:44.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-02 04:30:44.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428905,ok=428905,error=0, records=41
[WARN ] 2026-06-02 04:30:47.514 [6493 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6392/stat), No such file or directory
[WARN ] 2026-06-02 04:30:47.514 [6493 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6339/stat), No such file or directory
[WARN ] 2026-06-02 04:30:53.015 [6275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:30:53.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:30:59.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 04:30:59.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428906,ok=428906,error=0, records=41
[INFO ] 2026-06-02 04:30:59.920 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21447/300s
[WARN ] 2026-06-02 04:31:08.020 [6419 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:31:08.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:31:14.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 04:31:14.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428907,ok=428907,error=0, records=41
[WARN ] 2026-06-02 04:31:23.025 [6419 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:31:23.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:31:29.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 04:31:29.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428908,ok=428908,error=0, records=41
[WARN ] 2026-06-02 04:31:38.030 [6419 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:31:38.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:31:44.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 04:31:44.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428909,ok=428909,error=0, records=41
[INFO ] 2026-06-02 04:31:47.927 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21456/300s
[INFO ] 2026-06-02 04:31:50.523 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21447/300s
[WARN ] 2026-06-02 04:31:53.034 [6305 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:31:53.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:31:57.042 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854788},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:31:57.222 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:31:57.222 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 04:31:57.222 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:31:57.222 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:31:57.222 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:31:57.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:31:57.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:31:57.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:31:57.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:31:57.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:31:57.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:31:59.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 04:31:59.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428910,ok=428910,error=0, records=41
[WARN ] 2026-06-02 04:32:08.039 [6627 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:32:08.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:32:08.620 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21459/300s
[INFO ] 2026-06-02 04:32:14.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 04:32:14.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428911,ok=428911,error=0, records=41
[WARN ] 2026-06-02 04:32:23.044 [6643 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:32:23.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:32:29.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 04:32:29.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428912,ok=428912,error=0, records=41
[WARN ] 2026-06-02 04:32:38.049 [6659 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:32:38.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:32:44.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 04:32:44.962 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428913,ok=428913,error=0, records=41
[WARN ] 2026-06-02 04:32:52.555 [6677 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:32:53.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:32:54.814 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21457/300s
[INFO ] 2026-06-02 04:32:56.616 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21457/300s
[INFO ] 2026-06-02 04:32:59.967 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 04:32:59.967 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428914,ok=428914,error=0, records=41
[INFO ] 2026-06-02 04:33:03.648 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21457/300s
[WARN ] 2026-06-02 04:33:07.560 [6677 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:33:08.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:33:14.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 04:33:14.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428915,ok=428915,error=0, records=41
[WARN ] 2026-06-02 04:33:22.565 [6707 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:33:23.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:33:29.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 04:33:29.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428916,ok=428916,error=0, records=41
[WARN ] 2026-06-02 04:33:37.572 [6713 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:33:38.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 04:33:38.624 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 04:33:44.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 04:33:44.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428917,ok=428917,error=0, records=41
[WARN ] 2026-06-02 04:33:52.576 [6695 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:33:53.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:34:00.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 04:34:00.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428918,ok=428918,error=0, records=41
[WARN ] 2026-06-02 04:34:07.580 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:34:08.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:34:15.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 04:34:15.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428919,ok=428919,error=0, records=41
[WARN ] 2026-06-02 04:34:22.585 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:34:23.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:34:30.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 04:34:30.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428920,ok=428920,error=0, records=41
[WARN ] 2026-06-02 04:34:37.591 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:34:38.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:34:45.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 04:34:45.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428921,ok=428921,error=0, records=41
[WARN ] 2026-06-02 04:34:52.597 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:34:53.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:34:57.222 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17867/300s
[INFO ] 2026-06-02 04:34:57.223 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854960},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:34:57.381 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:34:57.381 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 04:34:57.381 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:34:57.381 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:34:57.381 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:34:57.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:34:57.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:34:57.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:34:57.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:34:57.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:34:57.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:35:00.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 04:35:00.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428922,ok=428922,error=0, records=41
[INFO ] 2026-06-02 04:35:01.730 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21461/300s
[WARN ] 2026-06-02 04:35:07.601 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:35:08.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:35:15.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 04:35:15.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428923,ok=428923,error=0, records=41
[WARN ] 2026-06-02 04:35:22.606 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:35:23.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:35:24.107 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21452/300s
[INFO ] 2026-06-02 04:35:30.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 04:35:30.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428924,ok=428924,error=0, records=41
[WARN ] 2026-06-02 04:35:37.614 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:35:38.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:35:42.739 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21461/300s
[INFO ] 2026-06-02 04:35:45.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 04:35:45.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428925,ok=428925,error=0, records=41
[WARN ] 2026-06-02 04:35:52.619 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:35:53.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:36:00.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 04:36:00.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428926,ok=428926,error=0, records=41
[INFO ] 2026-06-02 04:36:00.079 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21448/300s
[WARN ] 2026-06-02 04:36:07.625 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:36:08.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:36:15.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 04:36:15.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428927,ok=428927,error=0, records=41
[WARN ] 2026-06-02 04:36:22.630 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:36:23.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:36:30.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 04:36:30.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428928,ok=428928,error=0, records=41
[WARN ] 2026-06-02 04:36:37.635 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:36:38.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:36:45.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 04:36:45.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428929,ok=428929,error=0, records=41
[INFO ] 2026-06-02 04:36:47.982 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21457/300s
[INFO ] 2026-06-02 04:36:50.685 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21448/300s
[WARN ] 2026-06-02 04:36:52.641 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:36:53.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:37:00.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 04:37:00.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428930,ok=428930,error=0, records=41
[WARN ] 2026-06-02 04:37:07.646 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:37:08.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:37:08.633 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21460/300s
[INFO ] 2026-06-02 04:37:15.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 04:37:15.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428931,ok=428931,error=0, records=41
[WARN ] 2026-06-02 04:37:22.652 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:37:23.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:37:30.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 04:37:30.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428932,ok=428932,error=0, records=41
[WARN ] 2026-06-02 04:37:37.657 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:37:38.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:37:45.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 04:37:45.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428933,ok=428933,error=0, records=41
[WARN ] 2026-06-02 04:37:52.662 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:37:53.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:37:54.882 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21458/300s
[INFO ] 2026-06-02 04:37:56.683 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21458/300s
[INFO ] 2026-06-02 04:37:57.383 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854836},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:37:57.546 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:37:57.546 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 04:37:57.546 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:37:57.547 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:37:57.547 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:37:57.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:37:57.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:37:57.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:37:57.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:37:57.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:37:57.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:38:00.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 04:38:00.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428934,ok=428934,error=0, records=41
[INFO ] 2026-06-02 04:38:03.690 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21458/300s
[WARN ] 2026-06-02 04:38:07.666 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:38:08.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:38:15.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 04:38:15.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428935,ok=428935,error=0, records=41
[WARN ] 2026-06-02 04:38:22.671 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:38:23.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:38:30.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 04:38:30.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428936,ok=428936,error=0, records=41
[WARN ] 2026-06-02 04:38:37.676 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:38:38.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:38:45.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 04:38:45.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428937,ok=428937,error=0, records=41
[WARN ] 2026-06-02 04:38:52.681 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:38:53.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:38:53.637 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 04:39:00.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 04:39:00.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428938,ok=428938,error=0, records=41
[WARN ] 2026-06-02 04:39:07.686 [6811 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:39:08.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:39:15.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-02 04:39:15.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428939,ok=428939,error=0, records=41
[WARN ] 2026-06-02 04:39:22.690 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:39:23.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:39:30.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-02 04:39:30.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428940,ok=428940,error=0, records=41
[WARN ] 2026-06-02 04:39:37.696 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:39:38.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:39:45.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-02 04:39:45.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428941,ok=428941,error=0, records=41
[WARN ] 2026-06-02 04:39:52.702 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:39:53.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:40:00.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 04:40:00.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428942,ok=428942,error=0, records=41
[INFO ] 2026-06-02 04:40:01.734 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21462/300s
[WARN ] 2026-06-02 04:40:07.706 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:40:08.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:40:15.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 04:40:15.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428943,ok=428943,error=0, records=41
[WARN ] 2026-06-02 04:40:22.710 [6811 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:40:23.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:40:24.211 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21453/300s
[INFO ] 2026-06-02 04:40:30.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 04:40:30.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428944,ok=428944,error=0, records=41
[WARN ] 2026-06-02 04:40:37.716 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:40:38.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:40:42.746 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21462/300s
[INFO ] 2026-06-02 04:40:45.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 04:40:45.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428945,ok=428945,error=0, records=41
[WARN ] 2026-06-02 04:40:52.721 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:40:53.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:40:57.547 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17868/300s
[INFO ] 2026-06-02 04:40:57.548 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854736},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:40:57.696 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:40:57.697 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 04:40:57.697 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:40:57.697 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:40:57.697 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:40:57.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:40:57.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:40:57.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:40:57.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:40:57.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:40:57.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:41:00.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 04:41:00.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428946,ok=428946,error=0, records=41
[INFO ] 2026-06-02 04:41:00.296 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21449/300s
[WARN ] 2026-06-02 04:41:07.726 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:41:08.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:41:15.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 04:41:15.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428947,ok=428947,error=0, records=41
[WARN ] 2026-06-02 04:41:22.731 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:41:23.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:41:30.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 04:41:30.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428948,ok=428948,error=0, records=41
[WARN ] 2026-06-02 04:41:37.736 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:41:38.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:41:45.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 04:41:45.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428949,ok=428949,error=0, records=41
[INFO ] 2026-06-02 04:41:48.039 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21458/300s
[INFO ] 2026-06-02 04:41:50.868 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21449/300s
[WARN ] 2026-06-02 04:41:52.740 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:41:53.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:42:00.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 04:42:00.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428950,ok=428950,error=0, records=41
[WARN ] 2026-06-02 04:42:07.746 [6816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:42:08.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:42:08.647 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21461/300s
[INFO ] 2026-06-02 04:42:15.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 04:42:15.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428951,ok=428951,error=0, records=41
[WARN ] 2026-06-02 04:42:22.751 [6811 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:42:23.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:42:30.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 04:42:30.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428952,ok=428952,error=0, records=41
[WARN ] 2026-06-02 04:42:37.757 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:42:38.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:42:45.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 04:42:45.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428953,ok=428953,error=0, records=41
[WARN ] 2026-06-02 04:42:52.761 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:42:53.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:42:54.934 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21459/300s
[INFO ] 2026-06-02 04:42:56.736 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21459/300s
[INFO ] 2026-06-02 04:43:00.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 04:43:00.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428954,ok=428954,error=0, records=41
[INFO ] 2026-06-02 04:43:03.743 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21459/300s
[WARN ] 2026-06-02 04:43:07.765 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:43:08.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:43:15.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 04:43:15.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428955,ok=428955,error=0, records=41
[WARN ] 2026-06-02 04:43:22.771 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:43:23.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:43:30.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 04:43:30.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428956,ok=428956,error=0, records=41
[WARN ] 2026-06-02 04:43:37.776 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:43:38.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 04:43:38.650 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 04:43:45.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 04:43:45.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428957,ok=428957,error=0, records=41
[WARN ] 2026-06-02 04:43:52.781 [6811 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:43:53.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:43:57.698 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854660},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:43:57.842 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:43:57.842 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 04:43:57.842 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:43:57.842 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:43:57.842 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:43:57.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:43:57.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:43:57.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:43:57.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:43:57.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:43:57.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:44:00.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 04:44:00.461 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428958,ok=428958,error=0, records=41
[WARN ] 2026-06-02 04:44:07.786 [6752 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:44:08.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:44:15.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 04:44:15.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428959,ok=428959,error=0, records=41
[WARN ] 2026-06-02 04:44:22.792 [6796 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:44:23.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:44:30.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 04:44:30.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428960,ok=428960,error=0, records=41
[WARN ] 2026-06-02 04:44:37.797 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:44:38.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:44:45.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 04:44:45.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428961,ok=428961,error=0, records=41
[WARN ] 2026-06-02 04:44:52.801 [6811 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:44:53.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:45:00.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 04:45:00.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428962,ok=428962,error=0, records=41
[INFO ] 2026-06-02 04:45:01.738 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21463/300s
[WARN ] 2026-06-02 04:45:07.808 [6763 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:45:08.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:45:15.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 04:45:15.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428963,ok=428963,error=0, records=41
[WARN ] 2026-06-02 04:45:22.813 [7370 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:45:23.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:45:24.313 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21454/300s
[INFO ] 2026-06-02 04:45:30.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 04:45:30.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428964,ok=428964,error=0, records=41
[WARN ] 2026-06-02 04:45:37.818 [7375 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:45:38.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:45:42.752 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21463/300s
[INFO ] 2026-06-02 04:45:45.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 04:45:45.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428965,ok=428965,error=0, records=41
[WARN ] 2026-06-02 04:45:52.823 [7404 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:45:53.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:46:00.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 04:46:00.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428966,ok=428966,error=0, records=41
[INFO ] 2026-06-02 04:46:00.510 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21450/300s
[WARN ] 2026-06-02 04:46:07.827 [7375 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:46:08.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:46:15.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 04:46:15.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428967,ok=428967,error=0, records=41
[WARN ] 2026-06-02 04:46:22.833 [7432 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:46:23.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:46:30.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 04:46:30.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428968,ok=428968,error=0, records=41
[WARN ] 2026-06-02 04:46:37.837 [7446 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:46:38.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:46:45.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 04:46:45.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428969,ok=428969,error=0, records=41
[INFO ] 2026-06-02 04:46:48.094 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21459/300s
[INFO ] 2026-06-02 04:46:51.052 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21450/300s
[WARN ] 2026-06-02 04:46:52.842 [7390 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:46:53.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:46:57.842 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17869/300s
[INFO ] 2026-06-02 04:46:57.844 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854584},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:46:58.001 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:46:58.001 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 04:46:58.001 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:46:58.001 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:46:58.001 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:46:58.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:46:58.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:46:58.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:46:58.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:46:58.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:46:58.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:47:00.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 04:47:00.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428970,ok=428970,error=0, records=41
[WARN ] 2026-06-02 04:47:07.848 [7390 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:47:08.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:47:08.659 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21462/300s
[INFO ] 2026-06-02 04:47:15.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 04:47:15.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428971,ok=428971,error=0, records=41
[WARN ] 2026-06-02 04:47:22.854 [7484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:47:23.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:47:30.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 04:47:30.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428972,ok=428972,error=0, records=41
[WARN ] 2026-06-02 04:47:37.859 [7484 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:47:38.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:47:45.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 04:47:45.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428973,ok=428973,error=0, records=41
[WARN ] 2026-06-02 04:47:52.863 [7390 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:47:53.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:47:55.001 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21460/300s
[INFO ] 2026-06-02 04:47:56.803 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21460/300s
[INFO ] 2026-06-02 04:48:00.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 04:48:00.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428974,ok=428974,error=0, records=41
[INFO ] 2026-06-02 04:48:03.808 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21460/300s
[WARN ] 2026-06-02 04:48:07.868 [7498 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:48:08.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:48:15.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 04:48:15.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428975,ok=428975,error=0, records=41
[WARN ] 2026-06-02 04:48:22.873 [7498 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:48:23.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:48:30.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 04:48:30.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428976,ok=428976,error=0, records=41
[WARN ] 2026-06-02 04:48:37.879 [7540 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:48:38.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:48:45.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 04:48:45.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428977,ok=428977,error=0, records=41
[WARN ] 2026-06-02 04:48:52.884 [7540 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:48:53.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:49:00.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 04:49:00.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428978,ok=428978,error=0, records=41
[WARN ] 2026-06-02 04:49:07.889 [7594 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:49:08.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:49:15.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 04:49:15.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428979,ok=428979,error=0, records=41
[WARN ] 2026-06-02 04:49:22.895 [7609 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:49:23.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:49:30.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 04:49:30.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428980,ok=428980,error=0, records=41
[WARN ] 2026-06-02 04:49:37.900 [7620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:49:38.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:49:45.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 04:49:45.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428981,ok=428981,error=0, records=41
[WARN ] 2026-06-02 04:49:52.904 [7555 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:49:53.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:49:58.003 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854504},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:49:58.187 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:49:58.188 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 04:49:58.188 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:49:58.188 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:49:58.188 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:49:58.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:49:58.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:49:58.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:49:58.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:49:58.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:49:58.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:50:00.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 04:50:00.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428982,ok=428982,error=0, records=41
[INFO ] 2026-06-02 04:50:01.741 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21464/300s
[WARN ] 2026-06-02 04:50:07.909 [7665 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:50:08.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:50:15.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 04:50:15.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428983,ok=428983,error=0, records=41
[WARN ] 2026-06-02 04:50:22.915 [7665 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:50:23.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:50:24.415 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21455/300s
[INFO ] 2026-06-02 04:50:30.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 04:50:30.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428984,ok=428984,error=0, records=41
[WARN ] 2026-06-02 04:50:37.920 [7665 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:50:38.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:50:42.758 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21464/300s
[INFO ] 2026-06-02 04:50:45.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 04:50:45.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428985,ok=428985,error=0, records=41
[WARN ] 2026-06-02 04:50:52.925 [7715 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:50:53.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:51:00.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 04:51:00.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428986,ok=428986,error=0, records=41
[INFO ] 2026-06-02 04:51:00.639 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21451/300s
[WARN ] 2026-06-02 04:51:07.930 [7726 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:51:08.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:51:15.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 04:51:15.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428987,ok=428987,error=0, records=41
[WARN ] 2026-06-02 04:51:22.934 [7732 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:51:23.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:51:30.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 04:51:30.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428988,ok=428988,error=0, records=41
[WARN ] 2026-06-02 04:51:37.940 [7687 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:51:38.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:51:45.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 04:51:45.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428989,ok=428989,error=0, records=41
[INFO ] 2026-06-02 04:51:48.150 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21460/300s
[INFO ] 2026-06-02 04:51:51.233 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21451/300s
[WARN ] 2026-06-02 04:51:52.945 [7782 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:51:53.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:52:00.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-02 04:52:00.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428990,ok=428990,error=0, records=41
[WARN ] 2026-06-02 04:52:07.949 [7777 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:52:08.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:52:08.671 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21463/300s
[INFO ] 2026-06-02 04:52:15.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 04:52:15.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428991,ok=428991,error=0, records=41
[WARN ] 2026-06-02 04:52:22.954 [7687 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:52:23.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:52:30.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 04:52:30.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428992,ok=428992,error=0, records=41
[WARN ] 2026-06-02 04:52:37.958 [7720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:52:38.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:52:45.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 04:52:45.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428993,ok=428993,error=0, records=41
[WARN ] 2026-06-02 04:52:52.963 [7793 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:52:53.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:52:55.057 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21461/300s
[INFO ] 2026-06-02 04:52:56.858 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21461/300s
[INFO ] 2026-06-02 04:52:58.188 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17870/300s
[INFO ] 2026-06-02 04:52:58.189 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854424},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:52:58.341 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:52:58.341 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 04:52:58.342 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:52:58.342 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:52:58.342 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:52:58.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:52:58.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:52:58.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:52:58.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:52:58.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:52:58.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:53:00.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 04:53:00.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428994,ok=428994,error=0, records=41
[INFO ] 2026-06-02 04:53:03.864 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21461/300s
[WARN ] 2026-06-02 04:53:07.968 [7793 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:53:08.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:53:15.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 04:53:15.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428995,ok=428995,error=0, records=41
[WARN ] 2026-06-02 04:53:22.973 [7821 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:53:23.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:53:30.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 04:53:30.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428996,ok=428996,error=0, records=41
[WARN ] 2026-06-02 04:53:37.978 [7821 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:53:38.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 04:53:38.675 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 04:53:45.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 04:53:45.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428997,ok=428997,error=0, records=41
[WARN ] 2026-06-02 04:53:52.984 [7777 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:53:53.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:53:53.676 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 04:54:00.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 04:54:00.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428998,ok=428998,error=0, records=41
[WARN ] 2026-06-02 04:54:07.990 [7905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:54:08.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:54:15.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 04:54:15.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=428999,ok=428999,error=0, records=41
[WARN ] 2026-06-02 04:54:22.994 [7905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:54:23.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:54:30.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 04:54:30.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429000,ok=429000,error=0, records=41
[WARN ] 2026-06-02 04:54:38.000 [7777 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:54:38.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:54:45.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 04:54:45.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429001,ok=429001,error=0, records=41
[WARN ] 2026-06-02 04:54:53.005 [7946 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:54:53.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:55:00.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 04:55:00.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429002,ok=429002,error=0, records=41
[INFO ] 2026-06-02 04:55:01.744 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21465/300s
[WARN ] 2026-06-02 04:55:08.011 [7905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:55:08.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:55:15.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 04:55:15.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429003,ok=429003,error=0, records=41
[WARN ] 2026-06-02 04:55:23.016 [7821 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:55:23.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:55:24.516 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21456/300s
[INFO ] 2026-06-02 04:55:30.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 04:55:30.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429004,ok=429004,error=0, records=41
[WARN ] 2026-06-02 04:55:38.021 [7960 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:55:38.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:55:42.765 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21465/300s
[INFO ] 2026-06-02 04:55:45.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 04:55:45.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429005,ok=429005,error=0, records=41
[WARN ] 2026-06-02 04:55:53.026 [7777 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:55:53.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:55:58.343 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854348},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:55:58.495 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:55:58.495 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 04:55:58.495 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 04:55:58.495 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 04:55:58.495 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 04:55:58.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:55:58.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 04:55:58.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:55:58.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 04:55:58.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:55:58.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 04:56:00.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 04:56:00.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429006,ok=429006,error=0, records=41
[INFO ] 2026-06-02 04:56:00.827 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21452/300s
[WARN ] 2026-06-02 04:56:08.031 [7988 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:56:08.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:56:15.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 04:56:15.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429007,ok=429007,error=0, records=41
[WARN ] 2026-06-02 04:56:23.035 [7988 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:56:23.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:56:30.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 04:56:30.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429008,ok=429008,error=0, records=41
[WARN ] 2026-06-02 04:56:38.041 [8037 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:56:38.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:56:45.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 04:56:45.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429009,ok=429009,error=0, records=41
[INFO ] 2026-06-02 04:56:48.205 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21461/300s
[INFO ] 2026-06-02 04:56:51.414 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21452/300s
[WARN ] 2026-06-02 04:56:53.046 [8065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:56:53.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:57:00.852 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 04:57:00.852 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429010,ok=429010,error=0, records=41
[WARN ] 2026-06-02 04:57:08.051 [7777 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:57:08.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:57:08.684 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21464/300s
[INFO ] 2026-06-02 04:57:15.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 04:57:15.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429011,ok=429011,error=0, records=41
[WARN ] 2026-06-02 04:57:22.555 [8106 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:57:23.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:57:30.863 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 04:57:30.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429012,ok=429012,error=0, records=41
[WARN ] 2026-06-02 04:57:37.559 [8107 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:57:38.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:57:45.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 04:57:45.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429013,ok=429013,error=0, records=41
[WARN ] 2026-06-02 04:57:52.564 [8135 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:57:53.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:57:55.108 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21462/300s
[INFO ] 2026-06-02 04:57:56.910 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21462/300s
[INFO ] 2026-06-02 04:58:00.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-02 04:58:00.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429014,ok=429014,error=0, records=41
[INFO ] 2026-06-02 04:58:03.913 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21462/300s
[WARN ] 2026-06-02 04:58:07.568 [8135 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:58:08.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:58:15.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 04:58:15.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429015,ok=429015,error=0, records=41
[WARN ] 2026-06-02 04:58:22.573 [8142 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:58:23.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:58:30.928 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 04:58:30.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429016,ok=429016,error=0, records=41
[WARN ] 2026-06-02 04:58:37.577 [8188 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:58:38.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:58:45.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 04:58:45.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429017,ok=429017,error=0, records=41
[WARN ] 2026-06-02 04:58:52.583 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:58:53.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:58:58.496 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17871/300s
[INFO ] 2026-06-02 04:58:58.498 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854272},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 04:58:58.640 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 04:58:58.640 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 04:59:00.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 04:59:00.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429018,ok=429018,error=0, records=41
[WARN ] 2026-06-02 04:59:07.589 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:59:08.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:59:15.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 04:59:15.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429019,ok=429019,error=0, records=41
[WARN ] 2026-06-02 04:59:22.593 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:59:23.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:59:30.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 04:59:30.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429020,ok=429020,error=0, records=41
[WARN ] 2026-06-02 04:59:37.598 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:59:38.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 04:59:45.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 04:59:45.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429021,ok=429021,error=0, records=41
[WARN ] 2026-06-02 04:59:52.602 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 04:59:53.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:00:00.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 05:00:00.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429022,ok=429022,error=0, records=41
[INFO ] 2026-06-02 05:00:01.748 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21466/300s
[WARN ] 2026-06-02 05:00:07.607 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:00:08.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:00:15.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 05:00:15.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429023,ok=429023,error=0, records=41
[WARN ] 2026-06-02 05:00:22.612 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:00:23.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:00:24.612 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21457/300s
[INFO ] 2026-06-02 05:00:30.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 05:00:30.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429024,ok=429024,error=0, records=41
[WARN ] 2026-06-02 05:00:37.617 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:00:38.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:00:42.771 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21466/300s
[INFO ] 2026-06-02 05:00:45.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 05:00:45.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429025,ok=429025,error=0, records=41
[WARN ] 2026-06-02 05:00:52.623 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:00:53.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:01:01.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 05:01:01.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429026,ok=429026,error=0, records=41
[INFO ] 2026-06-02 05:01:01.048 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21453/300s
[WARN ] 2026-06-02 05:01:07.627 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:01:08.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:01:16.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10420, records=41
[INFO ] 2026-06-02 05:01:16.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429027,ok=429027,error=0, records=41
[WARN ] 2026-06-02 05:01:22.632 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:01:23.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:01:31.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-02 05:01:31.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429028,ok=429028,error=0, records=41
[WARN ] 2026-06-02 05:01:37.638 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:01:38.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:01:46.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 05:01:46.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429029,ok=429029,error=0, records=41
[INFO ] 2026-06-02 05:01:48.261 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21462/300s
[INFO ] 2026-06-02 05:01:51.596 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21453/300s
[WARN ] 2026-06-02 05:01:52.643 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:01:53.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:01:58.642 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854192},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:01:58.812 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:01:58.812 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 05:01:58.813 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:01:58.813 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:01:58.813 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:01:58.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:01:58.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:01:58.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:01:58.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:01:58.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:01:58.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:02:01.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-02 05:02:01.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429030,ok=429030,error=0, records=41
[WARN ] 2026-06-02 05:02:07.648 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:02:08.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:02:08.696 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21465/300s
[INFO ] 2026-06-02 05:02:16.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 05:02:16.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429031,ok=429031,error=0, records=41
[WARN ] 2026-06-02 05:02:22.653 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:02:23.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:02:31.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 05:02:31.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429032,ok=429032,error=0, records=41
[WARN ] 2026-06-02 05:02:37.659 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:02:38.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:02:46.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 05:02:46.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429033,ok=429033,error=0, records=41
[WARN ] 2026-06-02 05:02:52.664 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:02:53.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:02:55.153 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21463/300s
[INFO ] 2026-06-02 05:02:56.956 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21463/300s
[INFO ] 2026-06-02 05:03:01.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 05:03:01.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429034,ok=429034,error=0, records=41
[INFO ] 2026-06-02 05:03:03.962 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21463/300s
[WARN ] 2026-06-02 05:03:07.671 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:03:08.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:03:16.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 05:03:16.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429035,ok=429035,error=0, records=41
[WARN ] 2026-06-02 05:03:22.675 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:03:23.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:03:31.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 05:03:31.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429036,ok=429036,error=0, records=41
[WARN ] 2026-06-02 05:03:37.680 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:03:38.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 05:03:38.700 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 05:03:46.179 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 05:03:46.179 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429037,ok=429037,error=0, records=41
[WARN ] 2026-06-02 05:03:52.686 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:03:53.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:04:01.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 05:04:01.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429038,ok=429038,error=0, records=41
[WARN ] 2026-06-02 05:04:07.692 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:04:08.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:04:16.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 05:04:16.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429039,ok=429039,error=0, records=41
[WARN ] 2026-06-02 05:04:22.697 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:04:23.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:04:31.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 05:04:31.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429040,ok=429040,error=0, records=41
[WARN ] 2026-06-02 05:04:37.702 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:04:38.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:04:46.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 05:04:46.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429041,ok=429041,error=0, records=41
[WARN ] 2026-06-02 05:04:52.708 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:04:53.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:04:58.813 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17872/300s
[INFO ] 2026-06-02 05:04:58.815 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:04:58.989 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:04:58.989 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 05:04:58.989 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:04:58.989 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:04:58.989 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:04:59.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:04:59.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:04:59.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:04:59.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:04:59.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:04:59.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:05:01.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 05:05:01.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429042,ok=429042,error=0, records=41
[INFO ] 2026-06-02 05:05:01.751 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21467/300s
[WARN ] 2026-06-02 05:05:07.714 [8212 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:05:08.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:05:16.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 05:05:16.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429043,ok=429043,error=0, records=41
[WARN ] 2026-06-02 05:05:22.719 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:05:23.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:05:24.720 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21458/300s
[INFO ] 2026-06-02 05:05:31.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 05:05:31.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429044,ok=429044,error=0, records=41
[WARN ] 2026-06-02 05:05:37.725 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:05:38.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:05:42.777 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21467/300s
[INFO ] 2026-06-02 05:05:46.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 05:05:46.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429045,ok=429045,error=0, records=41
[WARN ] 2026-06-02 05:05:52.730 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:05:53.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:06:01.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 05:06:01.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429046,ok=429046,error=0, records=41
[INFO ] 2026-06-02 05:06:01.272 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21454/300s
[WARN ] 2026-06-02 05:06:07.735 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:06:08.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:06:16.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 05:06:16.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429047,ok=429047,error=0, records=41
[WARN ] 2026-06-02 05:06:22.740 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:06:23.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:06:31.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 05:06:31.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429048,ok=429048,error=0, records=41
[WARN ] 2026-06-02 05:06:37.745 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:06:38.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:06:46.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:06:46.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429049,ok=429049,error=0, records=41
[INFO ] 2026-06-02 05:06:48.308 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21463/300s
[INFO ] 2026-06-02 05:06:51.772 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21454/300s
[WARN ] 2026-06-02 05:06:52.750 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:06:53.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:07:01.294 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:07:01.294 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429050,ok=429050,error=0, records=41
[WARN ] 2026-06-02 05:07:07.756 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:07:08.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:07:08.708 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21466/300s
[INFO ] 2026-06-02 05:07:16.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 05:07:16.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429051,ok=429051,error=0, records=41
[WARN ] 2026-06-02 05:07:22.761 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:07:23.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:07:31.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-02 05:07:31.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429052,ok=429052,error=0, records=41
[WARN ] 2026-06-02 05:07:37.766 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:07:38.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:07:46.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 05:07:46.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429053,ok=429053,error=0, records=41
[WARN ] 2026-06-02 05:07:52.771 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:07:53.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:07:55.176 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21464/300s
[INFO ] 2026-06-02 05:07:56.979 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21464/300s
[INFO ] 2026-06-02 05:07:58.991 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20854032},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:07:59.139 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:07:59.139 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 05:07:59.139 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:07:59.139 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:07:59.139 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:07:59.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:07:59.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:07:59.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:07:59.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:07:59.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:07:59.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:08:01.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 05:08:01.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429054,ok=429054,error=0, records=41
[INFO ] 2026-06-02 05:08:03.982 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21464/300s
[WARN ] 2026-06-02 05:08:07.776 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:08:08.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:08:16.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-02 05:08:16.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429055,ok=429055,error=0, records=41
[WARN ] 2026-06-02 05:08:22.781 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:08:23.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:08:31.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 05:08:31.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429056,ok=429056,error=0, records=41
[WARN ] 2026-06-02 05:08:37.787 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:08:38.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:08:46.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 05:08:46.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429057,ok=429057,error=0, records=41
[WARN ] 2026-06-02 05:08:52.792 [8232 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:08:53.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:08:53.712 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 05:09:01.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 05:09:01.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429058,ok=429058,error=0, records=41
[WARN ] 2026-06-02 05:09:07.797 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:09:08.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:09:16.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 05:09:16.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429059,ok=429059,error=0, records=41
[WARN ] 2026-06-02 05:09:22.803 [8228 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:09:23.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:09:31.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:09:31.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429060,ok=429060,error=0, records=41
[WARN ] 2026-06-02 05:09:37.808 [8809 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:09:38.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:09:46.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 05:09:46.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429061,ok=429061,error=0, records=41
[WARN ] 2026-06-02 05:09:52.815 [8258 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:09:53.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:10:01.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-02 05:10:01.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429062,ok=429062,error=0, records=41
[INFO ] 2026-06-02 05:10:01.754 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21468/300s
[WARN ] 2026-06-02 05:10:07.820 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:10:08.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:10:16.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10143, records=41
[INFO ] 2026-06-02 05:10:16.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429063,ok=429063,error=0, records=41
[WARN ] 2026-06-02 05:10:22.824 [8263 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:10:23.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:10:24.825 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21459/300s
[INFO ] 2026-06-02 05:10:31.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-02 05:10:31.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429064,ok=429064,error=0, records=41
[WARN ] 2026-06-02 05:10:37.830 [8799 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:10:38.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:10:42.782 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21468/300s
[INFO ] 2026-06-02 05:10:46.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10134, records=41
[INFO ] 2026-06-02 05:10:46.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429065,ok=429065,error=0, records=41
[WARN ] 2026-06-02 05:10:52.835 [8862 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:10:53.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:10:59.139 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17873/300s
[INFO ] 2026-06-02 05:10:59.141 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853944},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:10:59.289 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:10:59.289 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:10:59.289 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:10:59.289 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:10:59.290 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:10:59.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:10:59.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:10:59.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:10:59.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:10:59.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:10:59.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:11:01.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 05:11:01.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429066,ok=429066,error=0, records=41
[INFO ] 2026-06-02 05:11:01.383 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21455/300s
[WARN ] 2026-06-02 05:11:07.840 [8890 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:11:08.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:11:16.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:11:16.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429067,ok=429067,error=0, records=41
[WARN ] 2026-06-02 05:11:22.844 [8914 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:11:23.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:11:31.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 05:11:31.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429068,ok=429068,error=0, records=41
[WARN ] 2026-06-02 05:11:37.849 [8890 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:11:38.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:11:46.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:11:46.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429069,ok=429069,error=0, records=41
[INFO ] 2026-06-02 05:11:48.357 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21464/300s
[INFO ] 2026-06-02 05:11:51.952 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21455/300s
[WARN ] 2026-06-02 05:11:52.854 [8829 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:11:53.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:12:01.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 05:12:01.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429070,ok=429070,error=0, records=41
[WARN ] 2026-06-02 05:12:07.859 [8914 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:12:08.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:12:08.720 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21467/300s
[INFO ] 2026-06-02 05:12:16.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-02 05:12:16.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429071,ok=429071,error=0, records=41
[WARN ] 2026-06-02 05:12:22.863 [8970 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:12:23.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:12:31.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 05:12:31.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429072,ok=429072,error=0, records=41
[WARN ] 2026-06-02 05:12:37.868 [8829 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:12:38.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:12:46.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 05:12:46.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429073,ok=429073,error=0, records=41
[WARN ] 2026-06-02 05:12:52.872 [8914 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:12:53.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:12:55.225 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21465/300s
[INFO ] 2026-06-02 05:12:57.026 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21465/300s
[INFO ] 2026-06-02 05:13:01.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 05:13:01.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429074,ok=429074,error=0, records=41
[INFO ] 2026-06-02 05:13:04.032 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21465/300s
[WARN ] 2026-06-02 05:13:07.877 [9019 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:13:08.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:13:16.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 05:13:16.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429075,ok=429075,error=0, records=41
[WARN ] 2026-06-02 05:13:22.881 [8829 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:13:23.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:13:31.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 05:13:31.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429076,ok=429076,error=0, records=41
[WARN ] 2026-06-02 05:13:37.888 [9051 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:13:38.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 05:13:38.724 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 05:13:46.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 05:13:46.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429077,ok=429077,error=0, records=41
[WARN ] 2026-06-02 05:13:52.893 [9067 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:13:53.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:13:59.291 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853872},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:13:59.461 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:13:59.461 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:13:59.461 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:13:59.461 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:13:59.461 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:13:59.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:13:59.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:13:59.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:13:59.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:13:59.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:13:59.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:14:01.454 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 05:14:01.454 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429078,ok=429078,error=0, records=41
[WARN ] 2026-06-02 05:14:07.899 [9086 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:14:08.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:14:16.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 05:14:16.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429079,ok=429079,error=0, records=41
[WARN ] 2026-06-02 05:14:22.905 [9096 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:14:23.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:14:31.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 05:14:31.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429080,ok=429080,error=0, records=41
[WARN ] 2026-06-02 05:14:37.911 [9111 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:14:38.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:14:46.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 05:14:46.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429081,ok=429081,error=0, records=41
[WARN ] 2026-06-02 05:14:52.917 [9133 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:14:53.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:15:01.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 05:15:01.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429082,ok=429082,error=0, records=41
[INFO ] 2026-06-02 05:15:01.757 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21469/300s
[WARN ] 2026-06-02 05:15:07.922 [9138 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:15:08.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:15:16.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 05:15:16.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429083,ok=429083,error=0, records=41
[WARN ] 2026-06-02 05:15:22.927 [9145 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:15:23.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:15:24.928 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21460/300s
[INFO ] 2026-06-02 05:15:31.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 05:15:31.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429084,ok=429084,error=0, records=41
[WARN ] 2026-06-02 05:15:37.933 [9175 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:15:38.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:15:42.788 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21469/300s
[INFO ] 2026-06-02 05:15:46.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 05:15:46.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429085,ok=429085,error=0, records=41
[WARN ] 2026-06-02 05:15:52.938 [9191 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:15:53.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:16:01.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 05:16:01.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429086,ok=429086,error=0, records=41
[INFO ] 2026-06-02 05:16:01.495 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21456/300s
[WARN ] 2026-06-02 05:16:07.944 [9213 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:16:08.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:16:16.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 05:16:16.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429087,ok=429087,error=0, records=41
[WARN ] 2026-06-02 05:16:22.948 [9218 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:16:23.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:16:31.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 05:16:31.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429088,ok=429088,error=0, records=41
[WARN ] 2026-06-02 05:16:37.953 [9225 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:16:38.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:16:46.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 05:16:46.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429089,ok=429089,error=0, records=41
[INFO ] 2026-06-02 05:16:48.409 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21465/300s
[INFO ] 2026-06-02 05:16:52.132 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21456/300s
[WARN ] 2026-06-02 05:16:52.958 [9253 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:16:53.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:16:59.461 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17874/300s
[INFO ] 2026-06-02 05:16:59.463 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853788},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:16:59.624 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:16:59.624 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 05:16:59.624 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:16:59.624 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:16:59.624 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:16:59.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:16:59.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:16:59.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:16:59.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:16:59.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:16:59.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:17:01.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-02 05:17:01.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429090,ok=429090,error=0, records=41
[WARN ] 2026-06-02 05:17:07.964 [9225 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:17:08.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:17:08.732 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21468/300s
[INFO ] 2026-06-02 05:17:16.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 05:17:16.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429091,ok=429091,error=0, records=41
[WARN ] 2026-06-02 05:17:22.968 [9224 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:17:23.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:17:31.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 05:17:31.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429092,ok=429092,error=0, records=41
[WARN ] 2026-06-02 05:17:37.972 [9207 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:17:38.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:17:46.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 05:17:46.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429093,ok=429093,error=0, records=41
[WARN ] 2026-06-02 05:17:52.976 [9296 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:17:53.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:17:55.288 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21466/300s
[INFO ] 2026-06-02 05:17:57.089 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21466/300s
[INFO ] 2026-06-02 05:18:01.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 05:18:01.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429094,ok=429094,error=0, records=41
[INFO ] 2026-06-02 05:18:04.069 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21466/300s
[WARN ] 2026-06-02 05:18:07.981 [9225 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:18:08.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:18:16.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 05:18:16.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429095,ok=429095,error=0, records=41
[WARN ] 2026-06-02 05:18:22.986 [9224 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:18:23.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:18:31.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 05:18:31.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429096,ok=429096,error=0, records=41
[WARN ] 2026-06-02 05:18:37.990 [9225 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:18:38.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:18:46.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 05:18:46.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429097,ok=429097,error=0, records=41
[WARN ] 2026-06-02 05:18:52.995 [9366 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:18:53.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:19:01.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 05:19:01.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429098,ok=429098,error=0, records=41
[WARN ] 2026-06-02 05:19:08.000 [9338 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:19:08.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:19:16.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 05:19:16.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429099,ok=429099,error=0, records=41
[WARN ] 2026-06-02 05:19:23.005 [9224 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:19:23.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:19:31.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 05:19:31.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429100,ok=429100,error=0, records=41
[WARN ] 2026-06-02 05:19:38.010 [9394 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:19:38.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:19:46.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 05:19:46.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429101,ok=429101,error=0, records=41
[WARN ] 2026-06-02 05:19:53.015 [9422 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:19:53.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:19:59.626 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853716},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:19:59.778 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:19:59.778 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 05:19:59.779 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:19:59.779 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:19:59.779 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:19:59.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:19:59.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:19:59.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:19:59.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:19:59.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:19:59.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:20:01.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:20:01.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429102,ok=429102,error=0, records=41
[INFO ] 2026-06-02 05:20:01.760 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21470/300s
[WARN ] 2026-06-02 05:20:08.021 [9380 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:20:08.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:20:16.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:20:16.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429103,ok=429103,error=0, records=41
[WARN ] 2026-06-02 05:20:23.027 [9352 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:20:23.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:20:25.027 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21461/300s
[INFO ] 2026-06-02 05:20:31.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 05:20:31.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429104,ok=429104,error=0, records=41
[WARN ] 2026-06-02 05:20:38.032 [9456 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:20:38.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:20:42.794 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21470/300s
[INFO ] 2026-06-02 05:20:46.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 05:20:46.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429105,ok=429105,error=0, records=41
[WARN ] 2026-06-02 05:20:53.037 [9380 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:20:53.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:21:01.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 05:21:01.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429106,ok=429106,error=0, records=41
[INFO ] 2026-06-02 05:21:01.644 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21457/300s
[WARN ] 2026-06-02 05:21:08.042 [9501 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:21:08.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:21:16.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 05:21:16.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429107,ok=429107,error=0, records=41
[WARN ] 2026-06-02 05:21:23.047 [9518 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:21:23.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:21:31.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 05:21:31.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429108,ok=429108,error=0, records=41
[WARN ] 2026-06-02 05:21:37.554 [9511 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:21:38.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:21:46.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 05:21:46.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429109,ok=429109,error=0, records=41
[INFO ] 2026-06-02 05:21:48.462 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21466/300s
[INFO ] 2026-06-02 05:21:52.315 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21457/300s
[WARN ] 2026-06-02 05:21:52.559 [9535 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:21:53.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:22:01.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 05:22:01.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429110,ok=429110,error=0, records=41
[WARN ] 2026-06-02 05:22:07.563 [9501 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:22:08.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:22:08.744 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21469/300s
[INFO ] 2026-06-02 05:22:16.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 05:22:16.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429111,ok=429111,error=0, records=41
[WARN ] 2026-06-02 05:22:22.567 [9571 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:22:23.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:22:31.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 05:22:31.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429112,ok=429112,error=0, records=41
[WARN ] 2026-06-02 05:22:37.571 [9606 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:22:38.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:22:46.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 05:22:46.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429113,ok=429113,error=0, records=41
[WARN ] 2026-06-02 05:22:52.576 [9623 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:22:53.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:22:55.346 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21467/300s
[INFO ] 2026-06-02 05:22:57.140 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21467/300s
[INFO ] 2026-06-02 05:22:59.779 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17875/300s
[INFO ] 2026-06-02 05:22:59.780 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853628},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:22:59.965 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:22:59.965 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 05:22:59.965 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:22:59.965 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:22:59.965 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:23:00.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:23:00.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:23:00.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:23:00.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:23:00.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:23:00.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:23:01.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:23:01.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429114,ok=429114,error=0, records=41
[INFO ] 2026-06-02 05:23:04.098 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21467/300s
[WARN ] 2026-06-02 05:23:07.580 [9628 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:23:08.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:23:16.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-02 05:23:16.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429115,ok=429115,error=0, records=41
[WARN ] 2026-06-02 05:23:22.585 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:23:23.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:23:31.693 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 05:23:31.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429116,ok=429116,error=0, records=41
[WARN ] 2026-06-02 05:23:37.589 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:23:38.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 05:23:38.747 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 05:23:46.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 05:23:46.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429117,ok=429117,error=0, records=41
[WARN ] 2026-06-02 05:23:52.593 [9693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:23:53.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:23:53.748 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 05:24:01.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10426, records=41
[INFO ] 2026-06-02 05:24:01.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429118,ok=429118,error=0, records=41
[WARN ] 2026-06-02 05:24:07.598 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:24:08.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:24:16.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 05:24:16.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429119,ok=429119,error=0, records=41
[WARN ] 2026-06-02 05:24:22.603 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:24:23.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:24:31.713 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 05:24:31.713 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429120,ok=429120,error=0, records=41
[WARN ] 2026-06-02 05:24:37.609 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:24:38.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:24:46.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 05:24:46.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429121,ok=429121,error=0, records=41
[WARN ] 2026-06-02 05:24:52.615 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:24:53.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:25:01.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 05:25:01.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429122,ok=429122,error=0, records=41
[INFO ] 2026-06-02 05:25:01.763 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21471/300s
[WARN ] 2026-06-02 05:25:07.621 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:25:08.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:25:16.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 05:25:16.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429123,ok=429123,error=0, records=41
[WARN ] 2026-06-02 05:25:22.626 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:25:23.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:25:25.126 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21462/300s
[INFO ] 2026-06-02 05:25:31.734 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 05:25:31.734 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429124,ok=429124,error=0, records=41
[WARN ] 2026-06-02 05:25:37.630 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:25:38.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:25:42.800 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21471/300s
[INFO ] 2026-06-02 05:25:46.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 05:25:46.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429125,ok=429125,error=0, records=41
[WARN ] 2026-06-02 05:25:52.635 [9693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:25:53.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:25:59.966 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853548},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:26:00.138 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:26:00.138 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:26:00.139 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:26:00.139 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:26:00.139 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:26:00.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:26:00.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:26:00.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:26:00.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:26:00.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:26:00.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:26:01.745 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-06-02 05:26:01.745 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429126,ok=429126,error=0, records=41
[INFO ] 2026-06-02 05:26:01.745 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21458/300s
[WARN ] 2026-06-02 05:26:07.641 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:26:08.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:26:16.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-02 05:26:16.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429127,ok=429127,error=0, records=41
[WARN ] 2026-06-02 05:26:22.646 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:26:23.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:26:31.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 05:26:31.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429128,ok=429128,error=0, records=41
[WARN ] 2026-06-02 05:26:37.653 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:26:38.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:26:46.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 05:26:46.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429129,ok=429129,error=0, records=41
[INFO ] 2026-06-02 05:26:48.510 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21467/300s
[INFO ] 2026-06-02 05:26:52.490 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21458/300s
[WARN ] 2026-06-02 05:26:52.663 [9693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:26:53.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:27:01.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 05:27:01.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429130,ok=429130,error=0, records=41
[WARN ] 2026-06-02 05:27:07.668 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:27:08.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:27:08.756 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21470/300s
[INFO ] 2026-06-02 05:27:16.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 05:27:16.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429131,ok=429131,error=0, records=41
[WARN ] 2026-06-02 05:27:22.673 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:27:23.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:27:31.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-02 05:27:31.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429132,ok=429132,error=0, records=41
[WARN ] 2026-06-02 05:27:37.678 [9693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:27:38.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:27:46.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 05:27:46.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429133,ok=429133,error=0, records=41
[WARN ] 2026-06-02 05:27:52.684 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:27:53.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:27:55.364 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21468/300s
[INFO ] 2026-06-02 05:27:57.165 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21468/300s
[INFO ] 2026-06-02 05:28:01.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10122, records=41
[INFO ] 2026-06-02 05:28:01.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429134,ok=429134,error=0, records=41
[INFO ] 2026-06-02 05:28:04.120 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21468/300s
[WARN ] 2026-06-02 05:28:07.690 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:28:08.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:28:16.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 05:28:16.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429135,ok=429135,error=0, records=41
[WARN ] 2026-06-02 05:28:22.695 [9693 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:28:23.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:28:31.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 05:28:31.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429136,ok=429136,error=0, records=41
[WARN ] 2026-06-02 05:28:37.700 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:28:38.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:28:46.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 05:28:46.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429137,ok=429137,error=0, records=41
[WARN ] 2026-06-02 05:28:52.705 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:28:53.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:29:00.139 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17876/300s
[INFO ] 2026-06-02 05:29:00.140 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853472},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:29:00.317 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:29:00.317 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 05:29:00.317 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:29:00.317 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:29:00.317 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:29:00.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:29:00.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:29:00.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:29:00.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:29:00.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:29:00.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:29:01.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 05:29:01.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429138,ok=429138,error=0, records=41
[WARN ] 2026-06-02 05:29:07.712 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:29:08.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.73%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:29:16.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 05:29:16.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429139,ok=429139,error=0, records=41
[WARN ] 2026-06-02 05:29:22.717 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:29:23.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:29:31.838 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 05:29:31.838 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429140,ok=429140,error=0, records=41
[WARN ] 2026-06-02 05:29:37.724 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:29:38.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:29:46.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 05:29:46.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429141,ok=429141,error=0, records=41
[WARN ] 2026-06-02 05:29:52.728 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:29:53.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:30:01.766 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21472/300s
[INFO ] 2026-06-02 05:30:01.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 05:30:01.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429142,ok=429142,error=0, records=41
[WARN ] 2026-06-02 05:30:07.733 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:30:08.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:30:16.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 05:30:16.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429143,ok=429143,error=0, records=41
[WARN ] 2026-06-02 05:30:22.738 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:30:23.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:30:25.239 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21463/300s
[INFO ] 2026-06-02 05:30:31.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 05:30:31.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429144,ok=429144,error=0, records=41
[WARN ] 2026-06-02 05:30:37.743 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:30:38.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:30:42.806 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21472/300s
[INFO ] 2026-06-02 05:30:46.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:30:46.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429145,ok=429145,error=0, records=41
[WARN ] 2026-06-02 05:30:52.749 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:30:53.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:31:01.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 05:31:01.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429146,ok=429146,error=0, records=41
[INFO ] 2026-06-02 05:31:01.872 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21459/300s
[WARN ] 2026-06-02 05:31:07.753 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:31:08.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:31:16.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 05:31:16.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429147,ok=429147,error=0, records=41
[WARN ] 2026-06-02 05:31:22.759 [9675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:31:23.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:31:31.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 05:31:31.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429148,ok=429148,error=0, records=41
[WARN ] 2026-06-02 05:31:37.763 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:31:38.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:31:46.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:31:46.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429149,ok=429149,error=0, records=41
[INFO ] 2026-06-02 05:31:48.562 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21468/300s
[INFO ] 2026-06-02 05:31:52.672 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21459/300s
[WARN ] 2026-06-02 05:31:52.767 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:31:53.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:32:00.319 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853400},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:32:00.466 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:32:00.466 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:32:00.466 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:32:00.466 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:32:00.466 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:32:00.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:32:00.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:32:00.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:32:00.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:32:00.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:32:00.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:32:01.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 05:32:01.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429150,ok=429150,error=0, records=41
[WARN ] 2026-06-02 05:32:07.772 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:32:08.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:32:08.768 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21471/300s
[INFO ] 2026-06-02 05:32:16.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 05:32:16.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429151,ok=429151,error=0, records=41
[WARN ] 2026-06-02 05:32:22.777 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:32:23.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:32:31.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 05:32:31.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429152,ok=429152,error=0, records=41
[WARN ] 2026-06-02 05:32:37.782 [9646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:32:38.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:32:46.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 05:32:46.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429153,ok=429153,error=0, records=41
[WARN ] 2026-06-02 05:32:52.787 [9658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:32:53.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:32:55.436 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21469/300s
[INFO ] 2026-06-02 05:32:57.238 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21469/300s
[INFO ] 2026-06-02 05:33:01.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 05:33:01.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429154,ok=429154,error=0, records=41
[INFO ] 2026-06-02 05:33:04.177 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21469/300s
[WARN ] 2026-06-02 05:33:07.792 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:33:08.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:33:16.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 05:33:16.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429155,ok=429155,error=0, records=41
[WARN ] 2026-06-02 05:33:22.799 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:33:23.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:33:31.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 05:33:31.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429156,ok=429156,error=0, records=41
[WARN ] 2026-06-02 05:33:37.803 [10215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:33:38.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 05:33:38.772 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 05:33:46.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 05:33:46.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429157,ok=429157,error=0, records=41
[WARN ] 2026-06-02 05:33:52.808 [10231] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:33:53.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:34:01.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 05:34:01.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429158,ok=429158,error=0, records=41
[WARN ] 2026-06-02 05:34:07.813 [10246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:34:08.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:34:16.980 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 05:34:16.980 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429159,ok=429159,error=0, records=41
[WARN ] 2026-06-02 05:34:22.818 [10261] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:34:23.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:34:31.985 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 05:34:31.985 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429160,ok=429160,error=0, records=41
[WARN ] 2026-06-02 05:34:37.823 [10275] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:34:38.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:34:47.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 05:34:47.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429161,ok=429161,error=0, records=41
[WARN ] 2026-06-02 05:34:52.829 [10289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:34:53.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:35:00.466 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17877/300s
[INFO ] 2026-06-02 05:35:00.468 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853316},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:35:00.621 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:35:00.621 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 05:35:00.621 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:35:00.622 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:35:00.622 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:35:00.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:35:00.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:35:00.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:35:00.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:35:00.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:35:00.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:35:01.769 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21473/300s
[INFO ] 2026-06-02 05:35:02.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 05:35:02.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429162,ok=429162,error=0, records=41
[WARN ] 2026-06-02 05:35:07.834 [10289] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:35:08.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:35:17.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 05:35:17.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429163,ok=429163,error=0, records=41
[WARN ] 2026-06-02 05:35:22.840 [10256] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:35:23.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:35:25.340 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21464/300s
[INFO ] 2026-06-02 05:35:32.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 05:35:32.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429164,ok=429164,error=0, records=41
[WARN ] 2026-06-02 05:35:37.845 [10256] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:35:38.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:35:42.812 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21473/300s
[INFO ] 2026-06-02 05:35:47.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 05:35:47.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429165,ok=429165,error=0, records=41
[WARN ] 2026-06-02 05:35:52.850 [10326] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:35:53.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:36:02.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 05:36:02.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429166,ok=429166,error=0, records=41
[INFO ] 2026-06-02 05:36:02.086 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21460/300s
[WARN ] 2026-06-02 05:36:07.854 [10261] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:36:08.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:36:17.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 05:36:17.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429167,ok=429167,error=0, records=41
[WARN ] 2026-06-02 05:36:22.860 [10215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:36:23.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:36:32.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 05:36:32.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429168,ok=429168,error=0, records=41
[WARN ] 2026-06-02 05:36:37.864 [10354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:36:38.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:36:47.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 05:36:47.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429169,ok=429169,error=0, records=41
[INFO ] 2026-06-02 05:36:48.620 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21469/300s
[INFO ] 2026-06-02 05:36:52.856 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21460/300s
[WARN ] 2026-06-02 05:36:52.869 [10340] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:36:53.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:37:02.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-02 05:37:02.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429170,ok=429170,error=0, records=41
[WARN ] 2026-06-02 05:37:07.874 [10396] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:37:08.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:37:08.781 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21472/300s
[INFO ] 2026-06-02 05:37:17.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:37:17.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429171,ok=429171,error=0, records=41
[WARN ] 2026-06-02 05:37:22.879 [10419] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:37:23.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:37:32.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 05:37:32.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429172,ok=429172,error=0, records=41
[WARN ] 2026-06-02 05:37:37.884 [10440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:37:38.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:37:47.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 05:37:47.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429173,ok=429173,error=0, records=41
[WARN ] 2026-06-02 05:37:52.889 [10419] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:37:53.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:37:55.519 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21470/300s
[INFO ] 2026-06-02 05:37:57.320 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21470/300s
[INFO ] 2026-06-02 05:38:00.623 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853244},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:38:00.811 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:38:00.811 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 05:38:00.811 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:38:00.811 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:38:00.811 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:38:00.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:38:00.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:38:00.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:38:00.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:38:00.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:38:00.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:38:02.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 05:38:02.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429174,ok=429174,error=0, records=41
[INFO ] 2026-06-02 05:38:04.233 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21470/300s
[WARN ] 2026-06-02 05:38:07.894 [10440] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:38:08.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:38:17.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:38:17.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429175,ok=429175,error=0, records=41
[WARN ] 2026-06-02 05:38:22.901 [10419] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:38:23.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:38:32.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 05:38:32.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429176,ok=429176,error=0, records=41
[WARN ] 2026-06-02 05:38:37.906 [10507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:38:38.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:38:47.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 05:38:47.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429177,ok=429177,error=0, records=41
[WARN ] 2026-06-02 05:38:52.912 [10530] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:38:53.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:38:53.785 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 05:39:02.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 05:39:02.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429178,ok=429178,error=0, records=41
[WARN ] 2026-06-02 05:39:07.917 [10547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:39:08.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:39:17.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 05:39:17.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429179,ok=429179,error=0, records=41
[WARN ] 2026-06-02 05:39:22.922 [10552] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:39:23.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:39:32.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 05:39:32.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429180,ok=429180,error=0, records=41
[WARN ] 2026-06-02 05:39:37.927 [10575] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:39:38.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:39:47.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 05:39:47.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429181,ok=429181,error=0, records=41
[WARN ] 2026-06-02 05:39:52.931 [10591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:39:53.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:40:01.774 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21474/300s
[INFO ] 2026-06-02 05:40:02.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 05:40:02.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429182,ok=429182,error=0, records=41
[WARN ] 2026-06-02 05:40:07.936 [10613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:40:08.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:40:17.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-02 05:40:17.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429183,ok=429183,error=0, records=41
[WARN ] 2026-06-02 05:40:22.942 [10636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:40:23.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:40:25.442 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21465/300s
[INFO ] 2026-06-02 05:40:32.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 05:40:32.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429184,ok=429184,error=0, records=41
[WARN ] 2026-06-02 05:40:37.947 [10636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:40:38.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:40:42.819 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21474/300s
[INFO ] 2026-06-02 05:40:47.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10386, records=41
[INFO ] 2026-06-02 05:40:47.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429185,ok=429185,error=0, records=41
[WARN ] 2026-06-02 05:40:52.951 [10646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:40:53.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:41:00.811 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17878/300s
[INFO ] 2026-06-02 05:41:00.813 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853156},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:41:00.975 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:41:00.975 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 05:41:00.976 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:41:00.976 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:41:00.976 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:41:01.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:41:01.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:41:01.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:41:01.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:41:01.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:41:01.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:41:02.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-02 05:41:02.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429186,ok=429186,error=0, records=41
[INFO ] 2026-06-02 05:41:02.274 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21461/300s
[WARN ] 2026-06-02 05:41:07.956 [10636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:41:08.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:41:17.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 05:41:17.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429187,ok=429187,error=0, records=41
[WARN ] 2026-06-02 05:41:22.961 [10646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:41:23.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:41:32.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 05:41:32.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429188,ok=429188,error=0, records=41
[WARN ] 2026-06-02 05:41:37.966 [10647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:41:38.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:41:47.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 05:41:47.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429189,ok=429189,error=0, records=41
[INFO ] 2026-06-02 05:41:48.674 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21470/300s
[WARN ] 2026-06-02 05:41:52.971 [10636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:41:53.034 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21461/300s
[INFO ] 2026-06-02 05:41:53.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:42:02.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 05:42:02.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429190,ok=429190,error=0, records=41
[WARN ] 2026-06-02 05:42:07.975 [10646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:42:08.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:42:08.794 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21473/300s
[INFO ] 2026-06-02 05:42:17.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 05:42:17.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429191,ok=429191,error=0, records=41
[WARN ] 2026-06-02 05:42:22.981 [10646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:42:23.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:42:32.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 05:42:32.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429192,ok=429192,error=0, records=41
[WARN ] 2026-06-02 05:42:37.985 [10761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:42:38.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:42:47.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 05:42:47.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429193,ok=429193,error=0, records=41
[WARN ] 2026-06-02 05:42:52.990 [10761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:42:53.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:42:55.564 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21471/300s
[INFO ] 2026-06-02 05:42:57.364 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21471/300s
[INFO ] 2026-06-02 05:43:02.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 05:43:02.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429194,ok=429194,error=0, records=41
[INFO ] 2026-06-02 05:43:04.272 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21471/300s
[WARN ] 2026-06-02 05:43:07.995 [10647] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:43:08.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:43:17.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 05:43:17.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429195,ok=429195,error=0, records=41
[WARN ] 2026-06-02 05:43:23.000 [10646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:43:23.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:43:32.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 05:43:32.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429196,ok=429196,error=0, records=41
[WARN ] 2026-06-02 05:43:32.504 [10733] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6380/stat), No such file or directory
[WARN ] 2026-06-02 05:43:32.505 [10733] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6520/stat), No such file or directory
[WARN ] 2026-06-02 05:43:38.005 [10733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:43:38.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 05:43:38.797 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 05:43:47.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 05:43:47.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429197,ok=429197,error=0, records=41
[WARN ] 2026-06-02 05:43:47.509 [10733] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6380/stat), No such file or directory
[WARN ] 2026-06-02 05:43:47.509 [10733] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6531/stat), No such file or directory
[WARN ] 2026-06-02 05:43:47.509 [10733] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/6520/stat), No such file or directory
[WARN ] 2026-06-02 05:43:53.011 [10733] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:43:53.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:44:00.978 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20853060},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:44:01.142 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:44:01.142 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 05:44:01.142 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:44:01.142 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:44:01.142 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:44:01.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:44:01.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:44:01.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:44:01.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:44:01.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:44:01.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:44:02.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 05:44:02.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429198,ok=429198,error=0, records=41
[WARN ] 2026-06-02 05:44:08.016 [10719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:44:08.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:44:17.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 05:44:17.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429199,ok=429199,error=0, records=41
[WARN ] 2026-06-02 05:44:23.021 [10719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:44:23.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:44:32.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 05:44:32.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429200,ok=429200,error=0, records=41
[WARN ] 2026-06-02 05:44:38.026 [10945] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:44:38.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:44:47.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 05:44:47.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429201,ok=429201,error=0, records=41
[WARN ] 2026-06-02 05:44:53.031 [10719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:44:53.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:45:01.777 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21475/300s
[INFO ] 2026-06-02 05:45:02.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 05:45:02.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429202,ok=429202,error=0, records=41
[WARN ] 2026-06-02 05:45:08.036 [10889] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:45:08.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:45:17.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 05:45:17.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429203,ok=429203,error=0, records=41
[WARN ] 2026-06-02 05:45:23.040 [10994] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:45:23.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:45:25.541 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21466/300s
[INFO ] 2026-06-02 05:45:32.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 05:45:32.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429204,ok=429204,error=0, records=41
[WARN ] 2026-06-02 05:45:38.045 [11011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:45:38.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:45:42.824 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21475/300s
[INFO ] 2026-06-02 05:45:47.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 05:45:47.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429205,ok=429205,error=0, records=41
[WARN ] 2026-06-02 05:45:53.049 [11011] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:45:53.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:46:02.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 05:46:02.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429206,ok=429206,error=0, records=41
[INFO ] 2026-06-02 05:46:02.486 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21462/300s
[WARN ] 2026-06-02 05:46:08.054 [11038] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:46:08.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:46:17.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 05:46:17.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429207,ok=429207,error=0, records=41
[WARN ] 2026-06-02 05:46:22.558 [11059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:46:23.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:46:32.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 05:46:32.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429208,ok=429208,error=0, records=41
[WARN ] 2026-06-02 05:46:37.562 [11038] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:46:38.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:46:47.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 05:46:47.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429209,ok=429209,error=0, records=41
[INFO ] 2026-06-02 05:46:48.718 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21471/300s
[WARN ] 2026-06-02 05:46:52.568 [11094] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:46:53.210 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21462/300s
[INFO ] 2026-06-02 05:46:53.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:47:01.142 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17879/300s
[INFO ] 2026-06-02 05:47:01.144 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852988},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:47:01.323 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:47:01.324 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 05:47:01.324 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:47:01.324 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:47:01.324 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:47:01.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:47:01.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:47:01.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:47:01.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:47:01.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:47:01.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:47:02.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 05:47:02.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429210,ok=429210,error=0, records=41
[WARN ] 2026-06-02 05:47:07.572 [11106] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:47:08.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:47:08.805 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21474/300s
[INFO ] 2026-06-02 05:47:17.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:47:17.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429211,ok=429211,error=0, records=41
[WARN ] 2026-06-02 05:47:22.576 [11130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:47:23.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:47:32.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 05:47:32.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429212,ok=429212,error=0, records=41
[WARN ] 2026-06-02 05:47:37.581 [11124] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:47:38.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:47:47.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 05:47:47.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429213,ok=429213,error=0, records=41
[WARN ] 2026-06-02 05:47:52.586 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:47:53.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:47:55.587 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21472/300s
[INFO ] 2026-06-02 05:47:57.388 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21472/300s
[INFO ] 2026-06-02 05:48:02.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:48:02.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429214,ok=429214,error=0, records=41
[INFO ] 2026-06-02 05:48:04.295 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21472/300s
[WARN ] 2026-06-02 05:48:07.592 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:48:08.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:48:17.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 05:48:17.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429215,ok=429215,error=0, records=41
[WARN ] 2026-06-02 05:48:22.596 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:48:23.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:48:32.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 05:48:32.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429216,ok=429216,error=0, records=41
[WARN ] 2026-06-02 05:48:37.602 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:48:38.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:48:47.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 05:48:47.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429217,ok=429217,error=0, records=41
[WARN ] 2026-06-02 05:48:52.607 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:48:53.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:49:02.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 05:49:02.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429218,ok=429218,error=0, records=41
[WARN ] 2026-06-02 05:49:07.612 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:49:08.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:49:17.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 05:49:17.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429219,ok=429219,error=0, records=41
[WARN ] 2026-06-02 05:49:22.617 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:49:23.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:49:32.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 05:49:32.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429220,ok=429220,error=0, records=41
[WARN ] 2026-06-02 05:49:37.622 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:49:38.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:49:47.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 05:49:47.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429221,ok=429221,error=0, records=41
[WARN ] 2026-06-02 05:49:52.627 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:49:53.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:50:01.325 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852912},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:50:01.476 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:50:01.476 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 05:50:01.476 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:50:01.476 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:50:01.476 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:50:01.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:50:01.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:50:01.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:50:01.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:50:01.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:50:01.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:50:01.780 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21476/300s
[INFO ] 2026-06-02 05:50:02.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 05:50:02.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429222,ok=429222,error=0, records=41
[WARN ] 2026-06-02 05:50:07.632 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:50:08.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:50:17.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 05:50:17.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429223,ok=429223,error=0, records=41
[WARN ] 2026-06-02 05:50:22.638 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:50:23.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:50:25.639 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21467/300s
[INFO ] 2026-06-02 05:50:32.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 05:50:32.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429224,ok=429224,error=0, records=41
[WARN ] 2026-06-02 05:50:37.643 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:50:38.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:50:42.830 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21476/300s
[INFO ] 2026-06-02 05:50:47.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 05:50:47.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429225,ok=429225,error=0, records=41
[WARN ] 2026-06-02 05:50:52.648 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:50:53.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:51:02.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 05:51:02.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429226,ok=429226,error=0, records=41
[INFO ] 2026-06-02 05:51:02.638 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21463/300s
[WARN ] 2026-06-02 05:51:07.653 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:51:08.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:51:17.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 05:51:17.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429227,ok=429227,error=0, records=41
[WARN ] 2026-06-02 05:51:22.659 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:51:23.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:51:32.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 05:51:32.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429228,ok=429228,error=0, records=41
[WARN ] 2026-06-02 05:51:37.663 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:51:38.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:51:47.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 05:51:47.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429229,ok=429229,error=0, records=41
[INFO ] 2026-06-02 05:51:48.774 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21472/300s
[WARN ] 2026-06-02 05:51:52.669 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:51:53.391 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21463/300s
[INFO ] 2026-06-02 05:51:53.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:52:02.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=14398, records=52
[INFO ] 2026-06-02 05:52:02.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429230,ok=429230,error=0, records=52
[WARN ] 2026-06-02 05:52:07.674 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:52:08.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:52:08.818 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21475/300s
[INFO ] 2026-06-02 05:52:17.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 05:52:17.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429231,ok=429231,error=0, records=41
[WARN ] 2026-06-02 05:52:22.678 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:52:23.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:52:32.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 05:52:32.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429232,ok=429232,error=0, records=41
[WARN ] 2026-06-02 05:52:37.683 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:52:38.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:52:47.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 05:52:47.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429233,ok=429233,error=0, records=41
[WARN ] 2026-06-02 05:52:52.688 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:52:53.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:52:55.655 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21473/300s
[INFO ] 2026-06-02 05:52:57.456 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21473/300s
[INFO ] 2026-06-02 05:53:01.476 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17880/300s
[INFO ] 2026-06-02 05:53:01.478 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852852},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:53:01.655 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:53:01.655 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:53:01.655 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:53:01.655 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:53:01.655 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:53:01.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:53:01.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:53:01.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:53:01.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:53:01.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:53:01.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:53:02.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 05:53:02.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429234,ok=429234,error=0, records=41
[INFO ] 2026-06-02 05:53:04.353 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21473/300s
[WARN ] 2026-06-02 05:53:07.695 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:53:08.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:53:17.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 05:53:17.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429235,ok=429235,error=0, records=41
[WARN ] 2026-06-02 05:53:22.700 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:53:23.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:53:32.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 05:53:32.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429236,ok=429236,error=0, records=41
[WARN ] 2026-06-02 05:53:37.704 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:53:38.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 05:53:38.822 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 05:53:47.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 05:53:47.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429237,ok=429237,error=0, records=41
[WARN ] 2026-06-02 05:53:52.709 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:53:53.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:53:53.823 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 05:54:02.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 05:54:02.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429238,ok=429238,error=0, records=41
[WARN ] 2026-06-02 05:54:07.714 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:54:08.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:54:17.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 05:54:17.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429239,ok=429239,error=0, records=41
[WARN ] 2026-06-02 05:54:22.720 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:54:23.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:54:32.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 05:54:32.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429240,ok=429240,error=0, records=41
[WARN ] 2026-06-02 05:54:37.725 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:54:38.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:54:47.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 05:54:47.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429241,ok=429241,error=0, records=41
[WARN ] 2026-06-02 05:54:52.730 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:54:53.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:55:01.783 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21477/300s
[INFO ] 2026-06-02 05:55:02.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 05:55:02.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429242,ok=429242,error=0, records=41
[WARN ] 2026-06-02 05:55:07.736 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:55:08.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:55:17.838 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 05:55:17.838 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429243,ok=429243,error=0, records=41
[WARN ] 2026-06-02 05:55:22.741 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:55:23.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:55:25.742 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21468/300s
[INFO ] 2026-06-02 05:55:32.844 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10146, records=41
[INFO ] 2026-06-02 05:55:32.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429244,ok=429244,error=0, records=41
[WARN ] 2026-06-02 05:55:37.746 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:55:38.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:55:42.837 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21477/300s
[INFO ] 2026-06-02 05:55:47.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 05:55:47.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429245,ok=429245,error=0, records=41
[WARN ] 2026-06-02 05:55:52.752 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:55:53.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:56:01.657 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852784},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:56:01.805 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:56:01.805 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:56:01.805 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:56:01.805 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:56:01.805 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:56:01.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:56:01.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:56:01.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:56:01.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:56:01.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:56:01.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:56:02.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 05:56:02.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429246,ok=429246,error=0, records=41
[INFO ] 2026-06-02 05:56:02.856 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21464/300s
[WARN ] 2026-06-02 05:56:07.757 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:56:08.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:56:17.862 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 05:56:17.862 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429247,ok=429247,error=0, records=41
[WARN ] 2026-06-02 05:56:22.762 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:56:23.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:56:32.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 05:56:32.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429248,ok=429248,error=0, records=41
[WARN ] 2026-06-02 05:56:37.767 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:56:38.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:56:47.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 05:56:47.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429249,ok=429249,error=0, records=41
[INFO ] 2026-06-02 05:56:48.828 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21473/300s
[WARN ] 2026-06-02 05:56:52.772 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:56:53.575 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21464/300s
[INFO ] 2026-06-02 05:56:53.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:57:02.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 05:57:02.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429250,ok=429250,error=0, records=41
[WARN ] 2026-06-02 05:57:07.776 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:57:08.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:57:08.832 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21476/300s
[INFO ] 2026-06-02 05:57:17.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 05:57:17.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429251,ok=429251,error=0, records=41
[WARN ] 2026-06-02 05:57:22.780 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:57:23.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:57:32.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:57:32.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429252,ok=429252,error=0, records=41
[WARN ] 2026-06-02 05:57:37.784 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:57:38.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:57:47.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 05:57:47.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429253,ok=429253,error=0, records=41
[WARN ] 2026-06-02 05:57:52.789 [11076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:57:53.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:57:55.705 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21474/300s
[INFO ] 2026-06-02 05:57:57.506 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21474/300s
[INFO ] 2026-06-02 05:58:02.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 05:58:02.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429254,ok=429254,error=0, records=41
[INFO ] 2026-06-02 05:58:04.403 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21474/300s
[WARN ] 2026-06-02 05:58:07.795 [11176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:58:08.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:58:17.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 05:58:17.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429255,ok=429255,error=0, records=41
[WARN ] 2026-06-02 05:58:22.800 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:58:23.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:58:32.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 05:58:32.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429256,ok=429256,error=0, records=41
[WARN ] 2026-06-02 05:58:37.805 [11196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:58:38.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:58:47.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 05:58:47.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429257,ok=429257,error=0, records=41
[WARN ] 2026-06-02 05:58:52.811 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:58:53.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:59:01.805 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17881/300s
[INFO ] 2026-06-02 05:59:01.807 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852716},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 05:59:01.989 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 05:59:01.989 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 05:59:01.989 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 05:59:01.989 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 05:59:01.989 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 05:59:02.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:59:02.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 05:59:02.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:59:02.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 05:59:02.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:59:02.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 05:59:02.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:59:02.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429258,ok=429258,error=0, records=41
[WARN ] 2026-06-02 05:59:07.816 [11158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:59:08.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:59:17.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 05:59:17.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429259,ok=429259,error=0, records=41
[WARN ] 2026-06-02 05:59:22.821 [11787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:59:23.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:59:32.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 05:59:32.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429260,ok=429260,error=0, records=41
[WARN ] 2026-06-02 05:59:37.826 [11773] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:59:38.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 05:59:47.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 05:59:47.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429261,ok=429261,error=0, records=41
[WARN ] 2026-06-02 05:59:52.830 [11815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 05:59:53.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:00:01.787 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21478/300s
[INFO ] 2026-06-02 06:00:02.957 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 06:00:02.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429262,ok=429262,error=0, records=41
[WARN ] 2026-06-02 06:00:07.836 [11801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:00:08.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:00:17.962 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:00:17.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429263,ok=429263,error=0, records=41
[WARN ] 2026-06-02 06:00:22.840 [11767] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:00:23.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:00:25.841 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21469/300s
[INFO ] 2026-06-02 06:00:32.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 06:00:32.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429264,ok=429264,error=0, records=41
[WARN ] 2026-06-02 06:00:37.845 [11843] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:00:38.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:00:42.843 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21478/300s
[INFO ] 2026-06-02 06:00:47.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 06:00:47.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429265,ok=429265,error=0, records=41
[WARN ] 2026-06-02 06:00:52.850 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:00:53.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:01:02.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 06:01:02.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429266,ok=429266,error=0, records=41
[INFO ] 2026-06-02 06:01:02.978 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21465/300s
[WARN ] 2026-06-02 06:01:07.855 [11843] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:01:08.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:01:17.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 06:01:17.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429267,ok=429267,error=0, records=41
[WARN ] 2026-06-02 06:01:22.861 [11896] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:01:23.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:01:32.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-02 06:01:32.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429268,ok=429268,error=0, records=41
[WARN ] 2026-06-02 06:01:37.866 [11164] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:01:38.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:01:47.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10122, records=41
[INFO ] 2026-06-02 06:01:47.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429269,ok=429269,error=0, records=41
[INFO ] 2026-06-02 06:01:48.882 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21474/300s
[WARN ] 2026-06-02 06:01:52.870 [11923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:01:53.759 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21465/300s
[INFO ] 2026-06-02 06:01:53.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:02:01.991 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852660},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:02:02.148 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:02:02.148 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:02:02.148 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:02:02.148 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:02:02.148 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:02:02.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:02:02.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:02:02.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:02:02.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:02:02.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:02:02.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:02:02.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 06:02:02.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429270,ok=429270,error=0, records=41
[WARN ] 2026-06-02 06:02:07.877 [11787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:02:08.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:02:08.845 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21477/300s
[INFO ] 2026-06-02 06:02:18.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 06:02:18.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429271,ok=429271,error=0, records=41
[WARN ] 2026-06-02 06:02:22.883 [11975] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:02:23.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:02:33.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 06:02:33.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429272,ok=429272,error=0, records=41
[WARN ] 2026-06-02 06:02:37.889 [11980] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:02:38.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:02:48.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:02:48.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429273,ok=429273,error=0, records=41
[WARN ] 2026-06-02 06:02:52.894 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:02:53.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:02:55.759 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21475/300s
[INFO ] 2026-06-02 06:02:57.561 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21475/300s
[INFO ] 2026-06-02 06:03:03.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 06:03:03.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429274,ok=429274,error=0, records=41
[INFO ] 2026-06-02 06:03:04.460 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21475/300s
[WARN ] 2026-06-02 06:03:07.900 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:03:08.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:03:18.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 06:03:18.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429275,ok=429275,error=0, records=41
[WARN ] 2026-06-02 06:03:22.907 [12007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:03:23.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:03:33.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 06:03:33.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429276,ok=429276,error=0, records=41
[WARN ] 2026-06-02 06:03:37.912 [12056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:03:38.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 06:03:38.849 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 06:03:48.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 06:03:48.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429277,ok=429277,error=0, records=41
[WARN ] 2026-06-02 06:03:52.918 [12071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:03:53.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:04:03.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10407, records=41
[INFO ] 2026-06-02 06:04:03.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429278,ok=429278,error=0, records=41
[WARN ] 2026-06-02 06:04:07.923 [12087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:04:08.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:04:18.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10383, records=41
[INFO ] 2026-06-02 06:04:18.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429279,ok=429279,error=0, records=41
[WARN ] 2026-06-02 06:04:22.927 [12061] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:04:23.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:04:33.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10381, records=41
[INFO ] 2026-06-02 06:04:33.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429280,ok=429280,error=0, records=41
[WARN ] 2026-06-02 06:04:37.932 [12120] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:04:38.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:04:48.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10374, records=41
[INFO ] 2026-06-02 06:04:48.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429281,ok=429281,error=0, records=41
[WARN ] 2026-06-02 06:04:52.938 [12113] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:04:53.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:05:01.790 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21479/300s
[INFO ] 2026-06-02 06:05:02.148 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17882/300s
[INFO ] 2026-06-02 06:05:02.150 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852644},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:05:02.317 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:05:02.317 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:05:02.317 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:05:02.317 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:05:02.317 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:05:02.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:05:02.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:05:02.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:05:02.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:05:02.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:05:02.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:05:03.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 06:05:03.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429282,ok=429282,error=0, records=41
[WARN ] 2026-06-02 06:05:07.943 [12113] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:05:08.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:05:18.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 06:05:18.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429283,ok=429283,error=0, records=41
[WARN ] 2026-06-02 06:05:22.949 [12162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:05:23.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:05:25.950 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21470/300s
[INFO ] 2026-06-02 06:05:33.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 06:05:33.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429284,ok=429284,error=0, records=41
[WARN ] 2026-06-02 06:05:37.954 [12162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:05:38.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:05:42.849 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21479/300s
[INFO ] 2026-06-02 06:05:48.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 06:05:48.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429285,ok=429285,error=0, records=41
[WARN ] 2026-06-02 06:05:52.959 [12146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:05:53.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:06:03.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 06:06:03.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429286,ok=429286,error=0, records=41
[INFO ] 2026-06-02 06:06:03.154 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21466/300s
[WARN ] 2026-06-02 06:06:07.964 [12162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:06:08.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:06:18.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 06:06:18.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429287,ok=429287,error=0, records=41
[WARN ] 2026-06-02 06:06:22.968 [12146] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:06:23.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:06:33.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 06:06:33.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429288,ok=429288,error=0, records=41
[WARN ] 2026-06-02 06:06:37.973 [12232] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:06:38.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:06:48.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 06:06:48.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429289,ok=429289,error=0, records=41
[INFO ] 2026-06-02 06:06:48.938 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21475/300s
[WARN ] 2026-06-02 06:06:52.977 [12156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:06:53.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:06:53.942 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21466/300s
[INFO ] 2026-06-02 06:07:03.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-02 06:07:03.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429290,ok=429290,error=0, records=41
[WARN ] 2026-06-02 06:07:07.982 [12260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:07:08.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:07:08.858 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21478/300s
[INFO ] 2026-06-02 06:07:18.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10130, records=41
[INFO ] 2026-06-02 06:07:18.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429291,ok=429291,error=0, records=41
[WARN ] 2026-06-02 06:07:22.987 [12162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:07:23.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:07:33.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10123, records=41
[INFO ] 2026-06-02 06:07:33.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429292,ok=429292,error=0, records=41
[WARN ] 2026-06-02 06:07:37.991 [12260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:07:38.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:07:48.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 06:07:48.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429293,ok=429293,error=0, records=41
[WARN ] 2026-06-02 06:07:52.996 [12288] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:07:53.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:07:55.813 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21476/300s
[INFO ] 2026-06-02 06:07:57.615 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21476/300s
[INFO ] 2026-06-02 06:08:02.319 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852632},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:08:02.484 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:08:02.484 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:08:02.484 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:08:02.484 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:08:02.484 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:08:02.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:08:02.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:08:02.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:08:02.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:08:02.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:08:02.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:08:03.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-06-02 06:08:03.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429294,ok=429294,error=0, records=41
[INFO ] 2026-06-02 06:08:04.521 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21476/300s
[WARN ] 2026-06-02 06:08:08.001 [12302] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:08:08.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:08:18.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 06:08:18.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429295,ok=429295,error=0, records=41
[WARN ] 2026-06-02 06:08:23.006 [12333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:08:23.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:08:33.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10398, records=41
[INFO ] 2026-06-02 06:08:33.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429296,ok=429296,error=0, records=41
[WARN ] 2026-06-02 06:08:38.010 [12333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:08:38.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:08:48.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10380, records=41
[INFO ] 2026-06-02 06:08:48.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429297,ok=429297,error=0, records=41
[WARN ] 2026-06-02 06:08:53.015 [12333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:08:53.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:08:53.862 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 06:09:03.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 06:09:03.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429298,ok=429298,error=0, records=41
[WARN ] 2026-06-02 06:09:08.020 [12361] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:09:08.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:09:18.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10144, records=41
[INFO ] 2026-06-02 06:09:18.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429299,ok=429299,error=0, records=41
[WARN ] 2026-06-02 06:09:23.025 [12333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:09:23.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:09:33.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10122, records=41
[INFO ] 2026-06-02 06:09:33.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429300,ok=429300,error=0, records=41
[WARN ] 2026-06-02 06:09:38.031 [12125] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:09:38.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:09:48.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-02 06:09:48.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429301,ok=429301,error=0, records=41
[WARN ] 2026-06-02 06:09:53.036 [12375] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:09:53.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:10:01.794 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21480/300s
[INFO ] 2026-06-02 06:10:03.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 06:10:03.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429302,ok=429302,error=0, records=41
[WARN ] 2026-06-02 06:10:08.042 [12420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:10:08.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:10:18.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 06:10:18.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429303,ok=429303,error=0, records=41
[WARN ] 2026-06-02 06:10:23.047 [12456] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:10:23.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:10:26.047 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21471/300s
[INFO ] 2026-06-02 06:10:33.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-02 06:10:33.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429304,ok=429304,error=0, records=41
[WARN ] 2026-06-02 06:10:38.052 [12420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:10:38.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:10:42.856 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21480/300s
[INFO ] 2026-06-02 06:10:48.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-02 06:10:48.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429305,ok=429305,error=0, records=41
[WARN ] 2026-06-02 06:10:52.558 [12439] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:10:53.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:11:02.484 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17883/300s
[INFO ] 2026-06-02 06:11:02.486 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:11:02.642 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:11:02.642 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 06:11:02.642 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:11:02.642 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:11:02.642 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:11:02.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:11:02.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:11:02.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:11:02.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:11:02.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:11:02.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:11:03.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 06:11:03.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429306,ok=429306,error=0, records=41
[INFO ] 2026-06-02 06:11:03.275 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21467/300s
[WARN ] 2026-06-02 06:11:07.563 [12515] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:11:08.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:11:18.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 06:11:18.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429307,ok=429307,error=0, records=41
[WARN ] 2026-06-02 06:11:22.567 [12525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:11:23.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:11:33.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 06:11:33.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429308,ok=429308,error=0, records=41
[WARN ] 2026-06-02 06:11:37.572 [12542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:11:38.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:11:48.372 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 06:11:48.372 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429309,ok=429309,error=0, records=41
[INFO ] 2026-06-02 06:11:48.993 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21476/300s
[WARN ] 2026-06-02 06:11:52.578 [12554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:11:53.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:11:54.115 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21467/300s
[INFO ] 2026-06-02 06:12:03.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 06:12:03.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429310,ok=429310,error=0, records=41
[WARN ] 2026-06-02 06:12:07.583 [12575] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:12:08.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:12:08.871 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21479/300s
[INFO ] 2026-06-02 06:12:18.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 06:12:18.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429311,ok=429311,error=0, records=41
[WARN ] 2026-06-02 06:12:22.588 [12584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:12:23.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:12:33.389 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 06:12:33.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429312,ok=429312,error=0, records=41
[WARN ] 2026-06-02 06:12:37.593 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:12:38.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:12:48.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 06:12:48.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429313,ok=429313,error=0, records=41
[WARN ] 2026-06-02 06:12:52.599 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:12:53.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:12:55.885 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21477/300s
[INFO ] 2026-06-02 06:12:57.687 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21477/300s
[INFO ] 2026-06-02 06:13:03.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 06:13:03.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429314,ok=429314,error=0, records=41
[INFO ] 2026-06-02 06:13:04.594 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21477/300s
[WARN ] 2026-06-02 06:13:07.604 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:13:08.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:13:18.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 06:13:18.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429315,ok=429315,error=0, records=41
[WARN ] 2026-06-02 06:13:22.608 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:13:23.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:13:33.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 06:13:33.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429316,ok=429316,error=0, records=41
[WARN ] 2026-06-02 06:13:37.614 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:13:38.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 06:13:38.875 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 06:13:48.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 06:13:48.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429317,ok=429317,error=0, records=41
[WARN ] 2026-06-02 06:13:52.618 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:13:53.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:14:02.644 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:14:02.804 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:14:02.804 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:14:02.804 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:14:02.804 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:14:02.804 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:14:02.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:14:02.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:14:02.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:14:02.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:14:02.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:14:02.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:14:03.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 06:14:03.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429318,ok=429318,error=0, records=41
[WARN ] 2026-06-02 06:14:07.624 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:14:08.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:14:18.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 06:14:18.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429319,ok=429319,error=0, records=41
[WARN ] 2026-06-02 06:14:22.628 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:14:23.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:14:33.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 06:14:33.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429320,ok=429320,error=0, records=41
[WARN ] 2026-06-02 06:14:37.633 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:14:38.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:14:48.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 06:14:48.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429321,ok=429321,error=0, records=41
[WARN ] 2026-06-02 06:14:52.638 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:14:53.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:15:01.797 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21481/300s
[INFO ] 2026-06-02 06:15:03.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 06:15:03.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429322,ok=429322,error=0, records=41
[WARN ] 2026-06-02 06:15:07.644 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:15:08.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:15:18.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 06:15:18.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429323,ok=429323,error=0, records=41
[WARN ] 2026-06-02 06:15:22.648 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:15:23.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:15:26.149 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21472/300s
[INFO ] 2026-06-02 06:15:33.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 06:15:33.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429324,ok=429324,error=0, records=41
[WARN ] 2026-06-02 06:15:37.653 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:15:38.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:15:42.862 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21481/300s
[INFO ] 2026-06-02 06:15:48.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:15:48.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429325,ok=429325,error=0, records=41
[WARN ] 2026-06-02 06:15:52.659 [12593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:15:53.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:16:03.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 06:16:03.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429326,ok=429326,error=0, records=41
[INFO ] 2026-06-02 06:16:03.577 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21468/300s
[WARN ] 2026-06-02 06:16:07.664 [12593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:16:08.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:16:18.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 06:16:18.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429327,ok=429327,error=0, records=41
[WARN ] 2026-06-02 06:16:22.672 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:16:23.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:16:33.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 06:16:33.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429328,ok=429328,error=0, records=41
[WARN ] 2026-06-02 06:16:37.676 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:16:38.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:16:48.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 06:16:48.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429329,ok=429329,error=0, records=41
[INFO ] 2026-06-02 06:16:49.053 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21477/300s
[WARN ] 2026-06-02 06:16:52.681 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:16:53.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:16:54.300 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21468/300s
[INFO ] 2026-06-02 06:17:02.804 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17884/300s
[INFO ] 2026-06-02 06:17:02.806 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852580},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:17:02.990 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:17:02.990 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 06:17:02.990 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:17:02.990 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:17:02.990 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:17:03.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:17:03.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:17:03.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:17:03.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:17:03.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:17:03.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:17:03.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 06:17:03.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429330,ok=429330,error=0, records=41
[WARN ] 2026-06-02 06:17:07.686 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:17:08.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:17:08.884 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21480/300s
[INFO ] 2026-06-02 06:17:18.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 06:17:18.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429331,ok=429331,error=0, records=41
[WARN ] 2026-06-02 06:17:22.692 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:17:23.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:17:33.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 06:17:33.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429332,ok=429332,error=0, records=41
[WARN ] 2026-06-02 06:17:37.697 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:17:38.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:17:48.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 06:17:48.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429333,ok=429333,error=0, records=41
[WARN ] 2026-06-02 06:17:52.703 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:17:53.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:17:55.963 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21478/300s
[INFO ] 2026-06-02 06:17:57.765 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21478/300s
[INFO ] 2026-06-02 06:18:03.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 06:18:03.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429334,ok=429334,error=0, records=41
[INFO ] 2026-06-02 06:18:04.671 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21478/300s
[WARN ] 2026-06-02 06:18:07.708 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:18:08.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:18:18.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-02 06:18:18.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429335,ok=429335,error=0, records=41
[WARN ] 2026-06-02 06:18:22.713 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:18:23.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:18:33.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 06:18:33.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429336,ok=429336,error=0, records=41
[WARN ] 2026-06-02 06:18:37.718 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:18:38.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:18:48.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 06:18:48.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429337,ok=429337,error=0, records=41
[WARN ] 2026-06-02 06:18:52.722 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:18:53.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:19:03.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 06:19:03.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429338,ok=429338,error=0, records=41
[WARN ] 2026-06-02 06:19:07.727 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:19:08.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:19:18.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 06:19:18.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429339,ok=429339,error=0, records=41
[WARN ] 2026-06-02 06:19:22.732 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:19:23.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:19:33.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 06:19:33.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429340,ok=429340,error=0, records=41
[WARN ] 2026-06-02 06:19:37.737 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:19:38.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:19:48.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 06:19:48.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429341,ok=429341,error=0, records=41
[WARN ] 2026-06-02 06:19:52.743 [12593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:19:53.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:20:01.801 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21482/300s
[INFO ] 2026-06-02 06:20:02.992 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:20:03.149 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:20:03.149 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:20:03.149 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:20:03.149 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:20:03.149 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:20:03.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:20:03.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:20:03.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:20:03.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:20:03.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:20:03.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:20:03.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 06:20:03.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429342,ok=429342,error=0, records=41
[WARN ] 2026-06-02 06:20:07.748 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:20:08.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:20:18.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 06:20:18.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429343,ok=429343,error=0, records=41
[WARN ] 2026-06-02 06:20:22.753 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:20:23.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:20:26.255 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21473/300s
[INFO ] 2026-06-02 06:20:33.696 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 06:20:33.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429344,ok=429344,error=0, records=41
[WARN ] 2026-06-02 06:20:37.759 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:20:38.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:20:42.869 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21482/300s
[INFO ] 2026-06-02 06:20:48.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 06:20:48.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429345,ok=429345,error=0, records=41
[WARN ] 2026-06-02 06:20:52.764 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:20:53.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:21:03.708 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 06:21:03.708 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429346,ok=429346,error=0, records=41
[INFO ] 2026-06-02 06:21:03.708 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21469/300s
[WARN ] 2026-06-02 06:21:07.769 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:21:08.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:21:18.717 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 06:21:18.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429347,ok=429347,error=0, records=41
[WARN ] 2026-06-02 06:21:22.774 [12593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:21:23.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:21:33.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 06:21:33.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429348,ok=429348,error=0, records=41
[WARN ] 2026-06-02 06:21:37.778 [12593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:21:38.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:21:48.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 06:21:48.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429349,ok=429349,error=0, records=41
[INFO ] 2026-06-02 06:21:49.108 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21478/300s
[WARN ] 2026-06-02 06:21:52.782 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:21:53.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:21:54.486 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21469/300s
[INFO ] 2026-06-02 06:22:03.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-02 06:22:03.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429350,ok=429350,error=0, records=41
[WARN ] 2026-06-02 06:22:07.788 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:22:08.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:22:08.897 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21481/300s
[INFO ] 2026-06-02 06:22:18.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 06:22:18.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429351,ok=429351,error=0, records=41
[WARN ] 2026-06-02 06:22:22.794 [12600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:22:23.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:22:33.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10344, records=41
[INFO ] 2026-06-02 06:22:33.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429352,ok=429352,error=0, records=41
[WARN ] 2026-06-02 06:22:37.799 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:22:38.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:22:48.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 06:22:48.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429353,ok=429353,error=0, records=41
[WARN ] 2026-06-02 06:22:52.804 [12616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:22:53.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:22:56.040 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21479/300s
[INFO ] 2026-06-02 06:22:57.842 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21479/300s
[INFO ] 2026-06-02 06:23:03.149 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17885/300s
[INFO ] 2026-06-02 06:23:03.151 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852484},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:23:03.326 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:23:03.326 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 06:23:03.326 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:23:03.326 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:23:03.326 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:23:03.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:23:03.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:23:03.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:23:03.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:23:03.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:23:03.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:23:03.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 06:23:03.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429354,ok=429354,error=0, records=41
[INFO ] 2026-06-02 06:23:04.748 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21479/300s
[WARN ] 2026-06-02 06:23:07.809 [12625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:23:08.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:23:18.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 06:23:18.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429355,ok=429355,error=0, records=41
[WARN ] 2026-06-02 06:23:22.814 [13198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:23:23.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:23:33.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 06:23:33.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429356,ok=429356,error=0, records=41
[WARN ] 2026-06-02 06:23:37.819 [13219] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:23:38.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 06:23:38.901 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 06:23:48.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 06:23:48.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429357,ok=429357,error=0, records=41
[WARN ] 2026-06-02 06:23:52.824 [13198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:23:53.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:23:53.902 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 06:24:03.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 06:24:03.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429358,ok=429358,error=0, records=41
[WARN ] 2026-06-02 06:24:07.829 [13228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:24:08.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:24:18.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:24:18.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429359,ok=429359,error=0, records=41
[WARN ] 2026-06-02 06:24:22.835 [13262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:24:23.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:24:33.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 06:24:33.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429360,ok=429360,error=0, records=41
[WARN ] 2026-06-02 06:24:37.839 [13276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:24:38.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:24:48.800 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 06:24:48.800 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429361,ok=429361,error=0, records=41
[WARN ] 2026-06-02 06:24:52.844 [12599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:24:53.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:25:01.804 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21483/300s
[INFO ] 2026-06-02 06:25:03.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 06:25:03.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429362,ok=429362,error=0, records=41
[WARN ] 2026-06-02 06:25:07.849 [13262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:25:08.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:25:18.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 06:25:18.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429363,ok=429363,error=0, records=41
[WARN ] 2026-06-02 06:25:22.854 [13312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:25:23.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:25:26.355 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21474/300s
[INFO ] 2026-06-02 06:25:33.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 06:25:33.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429364,ok=429364,error=0, records=41
[WARN ] 2026-06-02 06:25:37.859 [13262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:25:38.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:25:42.875 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21483/300s
[INFO ] 2026-06-02 06:25:48.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 06:25:48.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429365,ok=429365,error=0, records=41
[WARN ] 2026-06-02 06:25:52.864 [13312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:25:53.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:26:03.328 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852416},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:26:03.471 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:26:03.471 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:26:03.472 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:26:03.472 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:26:03.472 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:26:03.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:26:03.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:26:03.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:26:03.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:26:03.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:26:03.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:26:03.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 06:26:03.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429366,ok=429366,error=0, records=41
[INFO ] 2026-06-02 06:26:03.976 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21470/300s
[WARN ] 2026-06-02 06:26:07.868 [13312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:26:08.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:26:18.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 06:26:18.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429367,ok=429367,error=0, records=41
[WARN ] 2026-06-02 06:26:22.875 [13368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:26:23.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:26:33.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 06:26:33.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429368,ok=429368,error=0, records=41
[WARN ] 2026-06-02 06:26:37.880 [13383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:26:38.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:26:48.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 06:26:48.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429369,ok=429369,error=0, records=41
[INFO ] 2026-06-02 06:26:49.167 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21479/300s
[WARN ] 2026-06-02 06:26:52.886 [13393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:26:53.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:26:54.672 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21470/300s
[INFO ] 2026-06-02 06:27:03.998 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 06:27:03.998 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429370,ok=429370,error=0, records=41
[WARN ] 2026-06-02 06:27:07.892 [13415] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:27:08.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:27:08.911 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21482/300s
[INFO ] 2026-06-02 06:27:19.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 06:27:19.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429371,ok=429371,error=0, records=41
[WARN ] 2026-06-02 06:27:22.899 [13438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:27:23.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:27:34.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 06:27:34.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429372,ok=429372,error=0, records=41
[WARN ] 2026-06-02 06:27:37.905 [13470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:27:38.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:27:49.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 06:27:49.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429373,ok=429373,error=0, records=41
[WARN ] 2026-06-02 06:27:52.910 [13448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:27:53.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:27:56.133 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21480/300s
[INFO ] 2026-06-02 06:27:57.934 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21480/300s
[INFO ] 2026-06-02 06:28:04.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 06:28:04.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429374,ok=429374,error=0, records=41
[INFO ] 2026-06-02 06:28:04.842 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21480/300s
[WARN ] 2026-06-02 06:28:07.915 [13497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:28:08.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:28:19.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 06:28:19.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429375,ok=429375,error=0, records=41
[WARN ] 2026-06-02 06:28:22.922 [13432] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:28:23.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:28:34.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 06:28:34.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429376,ok=429376,error=0, records=41
[WARN ] 2026-06-02 06:28:37.927 [13432] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:28:38.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:28:49.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 06:28:49.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429377,ok=429377,error=0, records=41
[WARN ] 2026-06-02 06:28:52.933 [13555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:28:53.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:29:03.472 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17886/300s
[INFO ] 2026-06-02 06:29:03.473 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852344},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:29:03.644 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:29:03.644 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:29:03.644 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:29:03.644 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:29:03.644 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:29:03.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:29:03.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:29:03.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:29:03.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:29:03.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:29:03.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:29:04.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 06:29:04.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429378,ok=429378,error=0, records=41
[WARN ] 2026-06-02 06:29:07.939 [13514] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:29:08.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:29:19.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 06:29:19.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429379,ok=429379,error=0, records=41
[WARN ] 2026-06-02 06:29:22.944 [13589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:29:23.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:29:34.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 06:29:34.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429380,ok=429380,error=0, records=41
[WARN ] 2026-06-02 06:29:37.949 [13577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:29:38.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:29:49.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 06:29:49.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429381,ok=429381,error=0, records=41
[WARN ] 2026-06-02 06:29:52.953 [13589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:29:53.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:30:01.808 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21484/300s
[INFO ] 2026-06-02 06:30:04.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 06:30:04.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429382,ok=429382,error=0, records=41
[WARN ] 2026-06-02 06:30:07.957 [13600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:30:08.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:30:19.080 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-02 06:30:19.080 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429383,ok=429383,error=0, records=41
[WARN ] 2026-06-02 06:30:22.962 [13555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:30:23.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:30:26.462 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21475/300s
[INFO ] 2026-06-02 06:30:34.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 06:30:34.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429384,ok=429384,error=0, records=41
[WARN ] 2026-06-02 06:30:37.966 [13589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:30:38.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:30:42.882 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21484/300s
[INFO ] 2026-06-02 06:30:49.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 06:30:49.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429385,ok=429385,error=0, records=41
[WARN ] 2026-06-02 06:30:52.971 [13600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:30:53.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:31:04.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 06:31:04.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429386,ok=429386,error=0, records=41
[INFO ] 2026-06-02 06:31:04.101 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21471/300s
[WARN ] 2026-06-02 06:31:07.977 [13600] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:31:08.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:31:19.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 06:31:19.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429387,ok=429387,error=0, records=41
[WARN ] 2026-06-02 06:31:22.982 [13555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:31:23.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:31:34.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 06:31:34.112 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429388,ok=429388,error=0, records=41
[WARN ] 2026-06-02 06:31:37.987 [13633] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:31:38.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:31:49.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 06:31:49.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429389,ok=429389,error=0, records=41
[INFO ] 2026-06-02 06:31:49.223 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21480/300s
[WARN ] 2026-06-02 06:31:52.992 [13555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:31:53.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:31:54.856 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21471/300s
[INFO ] 2026-06-02 06:32:03.646 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852276},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:32:03.804 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:32:03.804 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 06:32:03.804 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:32:03.804 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:32:03.804 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:32:03.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:32:03.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:32:03.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:32:03.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:32:03.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:32:03.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:32:04.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 06:32:04.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429390,ok=429390,error=0, records=41
[WARN ] 2026-06-02 06:32:08.000 [13555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:32:08.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:32:08.924 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21483/300s
[INFO ] 2026-06-02 06:32:19.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 06:32:19.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429391,ok=429391,error=0, records=41
[WARN ] 2026-06-02 06:32:23.006 [13689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:32:23.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:32:34.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 06:32:34.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429392,ok=429392,error=0, records=41
[WARN ] 2026-06-02 06:32:38.010 [13633] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:32:38.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:32:49.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 06:32:49.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429393,ok=429393,error=0, records=41
[WARN ] 2026-06-02 06:32:53.015 [13772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:32:53.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:32:56.203 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21481/300s
[INFO ] 2026-06-02 06:32:58.005 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21481/300s
[INFO ] 2026-06-02 06:33:04.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 06:33:04.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429394,ok=429394,error=0, records=41
[INFO ] 2026-06-02 06:33:04.912 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21481/300s
[WARN ] 2026-06-02 06:33:08.021 [13689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:33:08.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:33:19.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 06:33:19.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429395,ok=429395,error=0, records=41
[WARN ] 2026-06-02 06:33:23.026 [13772] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:33:23.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:33:34.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 06:33:34.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429396,ok=429396,error=0, records=41
[WARN ] 2026-06-02 06:33:38.031 [13555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:33:38.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 06:33:38.928 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 06:33:49.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 06:33:49.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429397,ok=429397,error=0, records=41
[WARN ] 2026-06-02 06:33:53.035 [13847] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:33:53.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:34:04.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 06:34:04.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429398,ok=429398,error=0, records=41
[WARN ] 2026-06-02 06:34:08.040 [13852] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:34:08.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:34:19.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 06:34:19.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429399,ok=429399,error=0, records=41
[WARN ] 2026-06-02 06:34:23.048 [13858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:34:23.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:34:34.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 06:34:34.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429400,ok=429400,error=0, records=41
[WARN ] 2026-06-02 06:34:38.053 [13888] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:34:38.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:34:49.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:34:49.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429401,ok=429401,error=0, records=41
[WARN ] 2026-06-02 06:34:52.558 [13905] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:34:53.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:35:01.811 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21485/300s
[INFO ] 2026-06-02 06:35:03.805 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17887/300s
[INFO ] 2026-06-02 06:35:03.806 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852208},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:35:03.944 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:35:03.944 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 06:35:03.944 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:35:03.944 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:35:03.944 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:35:03.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:35:03.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:35:03.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:35:03.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:35:03.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:35:03.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:35:04.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 06:35:04.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429402,ok=429402,error=0, records=41
[WARN ] 2026-06-02 06:35:07.563 [13911] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:35:08.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:35:19.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 06:35:19.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429403,ok=429403,error=0, records=41
[WARN ] 2026-06-02 06:35:22.568 [13929] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:35:23.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:35:26.569 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21476/300s
[INFO ] 2026-06-02 06:35:34.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 06:35:34.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429404,ok=429404,error=0, records=41
[WARN ] 2026-06-02 06:35:37.573 [13940] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:35:38.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:35:42.889 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21485/300s
[INFO ] 2026-06-02 06:35:49.218 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 06:35:49.218 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429405,ok=429405,error=0, records=41
[WARN ] 2026-06-02 06:35:52.579 [13962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:35:53.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:36:04.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 06:36:04.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429406,ok=429406,error=0, records=41
[INFO ] 2026-06-02 06:36:04.224 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21472/300s
[WARN ] 2026-06-02 06:36:07.584 [13996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:36:08.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:36:19.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-02 06:36:19.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429407,ok=429407,error=0, records=41
[WARN ] 2026-06-02 06:36:22.589 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:36:23.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:36:34.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 06:36:34.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429408,ok=429408,error=0, records=41
[WARN ] 2026-06-02 06:36:37.594 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:36:38.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:36:49.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 06:36:49.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429409,ok=429409,error=0, records=41
[INFO ] 2026-06-02 06:36:49.277 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21481/300s
[WARN ] 2026-06-02 06:36:52.598 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:36:53.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:36:55.038 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21472/300s
[INFO ] 2026-06-02 06:37:04.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-02 06:37:04.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429410,ok=429410,error=0, records=41
[WARN ] 2026-06-02 06:37:07.604 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:37:08.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:37:08.938 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21484/300s
[INFO ] 2026-06-02 06:37:19.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-02 06:37:19.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429411,ok=429411,error=0, records=41
[WARN ] 2026-06-02 06:37:22.610 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:37:23.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:37:34.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 06:37:34.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429412,ok=429412,error=0, records=41
[WARN ] 2026-06-02 06:37:37.614 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:37:38.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:37:49.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 06:37:49.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429413,ok=429413,error=0, records=41
[WARN ] 2026-06-02 06:37:52.620 [14042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:37:53.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:37:56.256 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21482/300s
[INFO ] 2026-06-02 06:37:58.057 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21482/300s
[INFO ] 2026-06-02 06:38:03.945 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852148},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:38:04.109 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:38:04.109 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 06:38:04.109 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:38:04.109 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:38:04.109 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:38:04.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:38:04.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:38:04.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:38:04.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:38:04.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:38:04.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:38:04.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 06:38:04.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429414,ok=429414,error=0, records=41
[INFO ] 2026-06-02 06:38:04.963 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21482/300s
[WARN ] 2026-06-02 06:38:07.626 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:38:08.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:38:19.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 06:38:19.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429415,ok=429415,error=0, records=41
[WARN ] 2026-06-02 06:38:22.633 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:38:23.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:38:34.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 06:38:34.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429416,ok=429416,error=0, records=41
[WARN ] 2026-06-02 06:38:37.637 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:38:38.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:38:49.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 06:38:49.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429417,ok=429417,error=0, records=41
[WARN ] 2026-06-02 06:38:52.643 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:38:53.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:38:53.942 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 06:39:04.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 06:39:04.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429418,ok=429418,error=0, records=41
[WARN ] 2026-06-02 06:39:07.649 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:39:08.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:39:19.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 06:39:19.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429419,ok=429419,error=0, records=41
[WARN ] 2026-06-02 06:39:22.654 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:39:23.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:39:34.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 06:39:34.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429420,ok=429420,error=0, records=41
[WARN ] 2026-06-02 06:39:37.661 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:39:38.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:39:49.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 06:39:49.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429421,ok=429421,error=0, records=41
[WARN ] 2026-06-02 06:39:52.666 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:39:53.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:40:01.814 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21486/300s
[INFO ] 2026-06-02 06:40:04.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10396, records=41
[INFO ] 2026-06-02 06:40:04.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429422,ok=429422,error=0, records=41
[WARN ] 2026-06-02 06:40:07.670 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:40:08.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:40:19.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10385, records=41
[INFO ] 2026-06-02 06:40:19.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429423,ok=429423,error=0, records=41
[WARN ] 2026-06-02 06:40:22.675 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:40:23.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:40:26.677 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21477/300s
[INFO ] 2026-06-02 06:40:34.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 06:40:34.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429424,ok=429424,error=0, records=41
[WARN ] 2026-06-02 06:40:37.681 [14042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:40:38.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:40:42.895 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21486/300s
[INFO ] 2026-06-02 06:40:49.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-02 06:40:49.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429425,ok=429425,error=0, records=41
[WARN ] 2026-06-02 06:40:52.686 [14042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:40:53.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:41:04.110 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17888/300s
[INFO ] 2026-06-02 06:41:04.111 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852064},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:41:04.312 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:41:04.312 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 06:41:04.312 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:41:04.312 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:41:04.312 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:41:04.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 06:41:04.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429426,ok=429426,error=0, records=41
[INFO ] 2026-06-02 06:41:04.348 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21473/300s
[INFO ] 2026-06-02 06:41:04.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:41:04.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:41:04.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:41:04.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:41:04.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:41:04.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 06:41:07.692 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:41:08.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:41:19.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-02 06:41:19.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429427,ok=429427,error=0, records=41
[WARN ] 2026-06-02 06:41:22.697 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:41:23.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:41:34.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 06:41:34.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429428,ok=429428,error=0, records=41
[WARN ] 2026-06-02 06:41:37.703 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:41:38.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:41:49.329 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21482/300s
[INFO ] 2026-06-02 06:41:49.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 06:41:49.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429429,ok=429429,error=0, records=41
[WARN ] 2026-06-02 06:41:52.708 [14042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:41:53.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:41:55.219 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21473/300s
[INFO ] 2026-06-02 06:42:04.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 06:42:04.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429430,ok=429430,error=0, records=41
[WARN ] 2026-06-02 06:42:07.713 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:42:08.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:42:08.951 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21485/300s
[INFO ] 2026-06-02 06:42:19.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 06:42:19.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429431,ok=429431,error=0, records=41
[WARN ] 2026-06-02 06:42:22.719 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:42:23.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:42:34.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 06:42:34.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429432,ok=429432,error=0, records=41
[WARN ] 2026-06-02 06:42:37.724 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:42:38.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:42:49.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 06:42:49.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429433,ok=429433,error=0, records=41
[WARN ] 2026-06-02 06:42:52.730 [14042] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:42:53.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:42:56.295 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21483/300s
[INFO ] 2026-06-02 06:42:58.097 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21483/300s
[INFO ] 2026-06-02 06:43:04.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 06:43:04.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429434,ok=429434,error=0, records=41
[INFO ] 2026-06-02 06:43:05.004 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21483/300s
[WARN ] 2026-06-02 06:43:07.735 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:43:08.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:43:19.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 06:43:19.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429435,ok=429435,error=0, records=41
[WARN ] 2026-06-02 06:43:22.740 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:43:23.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:43:34.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 06:43:34.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429436,ok=429436,error=0, records=41
[WARN ] 2026-06-02 06:43:37.745 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:43:38.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 06:43:38.955 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 06:43:49.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 06:43:49.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429437,ok=429437,error=0, records=41
[WARN ] 2026-06-02 06:43:52.749 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:43:53.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:44:04.314 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20852000},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:44:04.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-02 06:44:04.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429438,ok=429438,error=0, records=41
[INFO ] 2026-06-02 06:44:04.459 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:44:04.459 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 06:44:04.459 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:44:04.459 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:44:04.459 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:44:04.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:44:04.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:44:04.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:44:04.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:44:04.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:44:04.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 06:44:07.755 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:44:08.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:44:19.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 06:44:19.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429439,ok=429439,error=0, records=41
[WARN ] 2026-06-02 06:44:22.760 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:44:23.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:44:34.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 06:44:34.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429440,ok=429440,error=0, records=41
[WARN ] 2026-06-02 06:44:37.765 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:44:38.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:44:49.435 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 06:44:49.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429441,ok=429441,error=0, records=41
[WARN ] 2026-06-02 06:44:52.769 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:44:53.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:45:01.818 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21487/300s
[INFO ] 2026-06-02 06:45:04.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 06:45:04.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429442,ok=429442,error=0, records=41
[WARN ] 2026-06-02 06:45:07.774 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:45:08.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:45:19.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 06:45:19.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429443,ok=429443,error=0, records=41
[WARN ] 2026-06-02 06:45:22.779 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:45:23.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:45:26.780 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21478/300s
[INFO ] 2026-06-02 06:45:34.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 06:45:34.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429444,ok=429444,error=0, records=41
[WARN ] 2026-06-02 06:45:37.785 [14010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:45:38.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:45:42.901 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21487/300s
[INFO ] 2026-06-02 06:45:49.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 06:45:49.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429445,ok=429445,error=0, records=41
[WARN ] 2026-06-02 06:45:52.790 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:45:53.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:46:04.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 06:46:04.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429446,ok=429446,error=0, records=41
[INFO ] 2026-06-02 06:46:04.471 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21474/300s
[WARN ] 2026-06-02 06:46:07.796 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:46:08.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:46:19.477 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 06:46:19.477 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429447,ok=429447,error=0, records=41
[WARN ] 2026-06-02 06:46:22.801 [14017] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:46:23.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:46:34.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:46:34.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429448,ok=429448,error=0, records=41
[WARN ] 2026-06-02 06:46:37.805 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:46:38.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:46:49.382 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21483/300s
[INFO ] 2026-06-02 06:46:49.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 06:46:49.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429449,ok=429449,error=0, records=41
[WARN ] 2026-06-02 06:46:52.811 [14048] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:46:53.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:46:55.400 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21474/300s
[INFO ] 2026-06-02 06:47:04.459 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17889/300s
[INFO ] 2026-06-02 06:47:04.461 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:47:04.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 06:47:04.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429450,ok=429450,error=0, records=41
[INFO ] 2026-06-02 06:47:04.608 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:47:04.608 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 06:47:04.608 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:47:04.608 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:47:04.608 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:47:04.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:47:04.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:47:04.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:47:04.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:47:04.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:47:04.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 06:47:07.817 [14608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:47:08.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:47:08.963 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21486/300s
[INFO ] 2026-06-02 06:47:19.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 06:47:19.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429451,ok=429451,error=0, records=41
[WARN ] 2026-06-02 06:47:22.823 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:47:23.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:47:34.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 06:47:34.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429452,ok=429452,error=0, records=41
[WARN ] 2026-06-02 06:47:37.828 [14608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:47:38.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:47:49.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 06:47:49.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429453,ok=429453,error=0, records=41
[WARN ] 2026-06-02 06:47:52.835 [14629] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:47:53.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:47:56.339 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21484/300s
[INFO ] 2026-06-02 06:47:58.141 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21484/300s
[INFO ] 2026-06-02 06:48:04.531 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-02 06:48:04.531 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429454,ok=429454,error=0, records=41
[INFO ] 2026-06-02 06:48:05.042 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21484/300s
[WARN ] 2026-06-02 06:48:07.841 [13999] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:48:08.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:48:19.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-02 06:48:19.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429455,ok=429455,error=0, records=41
[WARN ] 2026-06-02 06:48:22.847 [14608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:48:23.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:48:34.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10141, records=41
[INFO ] 2026-06-02 06:48:34.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429456,ok=429456,error=0, records=41
[WARN ] 2026-06-02 06:48:37.852 [14608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:48:38.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:48:49.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 06:48:49.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429457,ok=429457,error=0, records=41
[WARN ] 2026-06-02 06:48:52.858 [14608] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:48:53.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:49:04.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 06:49:04.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429458,ok=429458,error=0, records=41
[WARN ] 2026-06-02 06:49:07.864 [14629] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:49:08.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:49:19.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 06:49:19.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429459,ok=429459,error=0, records=41
[WARN ] 2026-06-02 06:49:22.868 [14629] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:49:23.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:49:34.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 06:49:34.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429460,ok=429460,error=0, records=41
[WARN ] 2026-06-02 06:49:37.874 [14629] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:49:38.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:49:49.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-02 06:49:49.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429461,ok=429461,error=0, records=41
[WARN ] 2026-06-02 06:49:52.878 [14776] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:49:53.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:50:01.821 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21488/300s
[INFO ] 2026-06-02 06:50:04.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 06:50:04.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429462,ok=429462,error=0, records=41
[INFO ] 2026-06-02 06:50:04.610 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851856},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:50:04.794 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:50:04.794 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 06:50:04.794 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:50:04.794 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:50:04.794 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:50:04.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:50:04.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:50:04.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:50:04.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:50:04.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:50:04.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 06:50:07.883 [14788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:50:08.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:50:19.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 06:50:19.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429463,ok=429463,error=0, records=41
[WARN ] 2026-06-02 06:50:22.890 [14657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:50:23.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:50:26.891 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21479/300s
[INFO ] 2026-06-02 06:50:34.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 06:50:34.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429464,ok=429464,error=0, records=41
[WARN ] 2026-06-02 06:50:37.895 [14832] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:50:38.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:50:42.907 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21488/300s
[INFO ] 2026-06-02 06:50:49.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 06:50:49.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429465,ok=429465,error=0, records=41
[WARN ] 2026-06-02 06:50:52.900 [14838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:50:53.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:51:04.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 06:51:04.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429466,ok=429466,error=0, records=41
[INFO ] 2026-06-02 06:51:04.601 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21475/300s
[WARN ] 2026-06-02 06:51:07.907 [14871] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:51:08.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:51:19.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 06:51:19.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429467,ok=429467,error=0, records=41
[WARN ] 2026-06-02 06:51:22.913 [14838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:51:23.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:51:34.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 06:51:34.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429468,ok=429468,error=0, records=41
[WARN ] 2026-06-02 06:51:37.918 [14905] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:51:38.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:51:49.429 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21484/300s
[INFO ] 2026-06-02 06:51:49.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 06:51:49.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429469,ok=429469,error=0, records=41
[WARN ] 2026-06-02 06:51:52.924 [14922] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:51:53.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:51:55.574 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21475/300s
[INFO ] 2026-06-02 06:52:04.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 06:52:04.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429470,ok=429470,error=0, records=41
[WARN ] 2026-06-02 06:52:07.929 [14927] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:52:08.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:52:08.975 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21487/300s
[INFO ] 2026-06-02 06:52:19.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 06:52:19.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429471,ok=429471,error=0, records=41
[WARN ] 2026-06-02 06:52:22.934 [14932] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:52:23.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:52:34.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 06:52:34.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429472,ok=429472,error=0, records=41
[WARN ] 2026-06-02 06:52:37.940 [14922] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:52:38.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:52:49.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 06:52:49.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429473,ok=429473,error=0, records=41
[WARN ] 2026-06-02 06:52:52.946 [14986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:52:53.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:52:56.372 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21485/300s
[INFO ] 2026-06-02 06:52:58.173 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21485/300s
[INFO ] 2026-06-02 06:53:04.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 06:53:04.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429474,ok=429474,error=0, records=41
[INFO ] 2026-06-02 06:53:04.794 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17890/300s
[INFO ] 2026-06-02 06:53:04.796 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851796},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:53:05.006 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:53:05.006 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 06:53:05.006 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:53:05.006 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:53:05.006 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:53:05.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:53:05.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:53:05.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:53:05.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:53:05.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:53:05.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:53:05.079 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21485/300s
[WARN ] 2026-06-02 06:53:07.950 [14986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:53:08.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:53:19.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 06:53:19.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429475,ok=429475,error=0, records=41
[WARN ] 2026-06-02 06:53:22.954 [14996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:53:23.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:53:34.693 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 06:53:34.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429476,ok=429476,error=0, records=41
[WARN ] 2026-06-02 06:53:37.959 [14991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:53:38.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 06:53:38.978 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 06:53:49.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 06:53:49.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429477,ok=429477,error=0, records=41
[WARN ] 2026-06-02 06:53:52.963 [14986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:53:53.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:53:53.979 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 06:54:04.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 06:54:04.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429478,ok=429478,error=0, records=41
[WARN ] 2026-06-02 06:54:07.967 [14991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:54:08.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:54:19.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 06:54:19.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429479,ok=429479,error=0, records=41
[WARN ] 2026-06-02 06:54:22.972 [14991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:54:23.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:54:34.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 06:54:34.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429480,ok=429480,error=0, records=41
[WARN ] 2026-06-02 06:54:37.977 [14922] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:54:38.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:54:49.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 06:54:49.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429481,ok=429481,error=0, records=41
[WARN ] 2026-06-02 06:54:52.982 [14986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:54:53.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:55:01.824 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21489/300s
[INFO ] 2026-06-02 06:55:04.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-02 06:55:04.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429482,ok=429482,error=0, records=41
[WARN ] 2026-06-02 06:55:07.987 [15066] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:55:08.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:55:19.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10797, records=44
[INFO ] 2026-06-02 06:55:19.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429483,ok=429483,error=0, records=44
[WARN ] 2026-06-02 06:55:22.992 [15066] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:55:23.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:55:26.993 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21480/300s
[INFO ] 2026-06-02 06:55:34.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 06:55:34.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429484,ok=429484,error=0, records=41
[WARN ] 2026-06-02 06:55:37.997 [15120] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:55:38.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:55:42.913 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21489/300s
[INFO ] 2026-06-02 06:55:49.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 06:55:49.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429485,ok=429485,error=0, records=41
[WARN ] 2026-06-02 06:55:53.003 [14991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:55:53.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:56:04.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 06:56:04.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429486,ok=429486,error=0, records=41
[INFO ] 2026-06-02 06:56:04.748 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21476/300s
[INFO ] 2026-06-02 06:56:05.008 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851732},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:56:05.164 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:56:05.164 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 06:56:05.165 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:56:05.165 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:56:05.165 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:56:05.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:56:05.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:56:05.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:56:05.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:56:05.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:56:05.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 06:56:08.008 [14922] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:56:08.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:56:19.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 06:56:19.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429487,ok=429487,error=0, records=41
[WARN ] 2026-06-02 06:56:23.012 [15161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:56:23.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:56:34.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 06:56:34.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429488,ok=429488,error=0, records=41
[WARN ] 2026-06-02 06:56:38.016 [15190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:56:38.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:56:49.484 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21485/300s
[INFO ] 2026-06-02 06:56:49.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 06:56:49.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429489,ok=429489,error=0, records=41
[WARN ] 2026-06-02 06:56:53.022 [15204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:56:53.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:56:55.754 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21476/300s
[INFO ] 2026-06-02 06:57:04.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 06:57:04.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429490,ok=429490,error=0, records=41
[WARN ] 2026-06-02 06:57:08.026 [15176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:57:08.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:57:08.987 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21488/300s
[INFO ] 2026-06-02 06:57:19.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 06:57:19.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429491,ok=429491,error=0, records=41
[WARN ] 2026-06-02 06:57:23.031 [15176] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:57:23.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:57:34.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 06:57:34.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429492,ok=429492,error=0, records=41
[WARN ] 2026-06-02 06:57:38.036 [15161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:57:38.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:57:49.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 06:57:49.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429493,ok=429493,error=0, records=41
[WARN ] 2026-06-02 06:57:53.040 [15204] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:57:53.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:57:56.429 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21486/300s
[INFO ] 2026-06-02 06:57:58.230 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21486/300s
[INFO ] 2026-06-02 06:58:04.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 06:58:04.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429494,ok=429494,error=0, records=41
[INFO ] 2026-06-02 06:58:05.137 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21486/300s
[WARN ] 2026-06-02 06:58:08.045 [15281] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:58:08.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:58:19.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 06:58:19.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429495,ok=429495,error=0, records=41
[WARN ] 2026-06-02 06:58:23.050 [15297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:58:23.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:58:34.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 06:58:34.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429496,ok=429496,error=0, records=41
[WARN ] 2026-06-02 06:58:37.555 [15313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:58:38.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:58:49.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 06:58:49.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429497,ok=429497,error=0, records=41
[WARN ] 2026-06-02 06:58:52.561 [15328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:58:53.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:59:04.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 06:59:04.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429498,ok=429498,error=0, records=41
[INFO ] 2026-06-02 06:59:05.165 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17891/300s
[INFO ] 2026-06-02 06:59:05.166 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851668},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 06:59:05.332 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 06:59:05.332 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 06:59:05.332 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 06:59:05.333 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 06:59:05.333 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 06:59:05.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:59:05.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 06:59:05.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:59:05.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 06:59:05.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 06:59:05.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 06:59:07.567 [15346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:59:08.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:59:19.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 06:59:19.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429499,ok=429499,error=0, records=41
[WARN ] 2026-06-02 06:59:22.573 [15372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:59:23.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:59:34.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 06:59:34.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429500,ok=429500,error=0, records=41
[WARN ] 2026-06-02 06:59:37.578 [15382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:59:38.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 06:59:49.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 06:59:49.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429501,ok=429501,error=0, records=41
[WARN ] 2026-06-02 06:59:52.582 [15401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 06:59:53.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:00:01.828 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21490/300s
[INFO ] 2026-06-02 07:00:04.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 07:00:04.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429502,ok=429502,error=0, records=41
[WARN ] 2026-06-02 07:00:07.588 [15412] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:00:08.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:00:19.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 07:00:19.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429503,ok=429503,error=0, records=41
[WARN ] 2026-06-02 07:00:22.593 [15445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:00:23.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:00:27.094 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21481/300s
[INFO ] 2026-06-02 07:00:34.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 07:00:34.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429504,ok=429504,error=0, records=41
[WARN ] 2026-06-02 07:00:37.598 [15439] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:00:38.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:00:42.920 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21490/300s
[INFO ] 2026-06-02 07:00:49.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 07:00:49.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429505,ok=429505,error=0, records=41
[WARN ] 2026-06-02 07:00:52.603 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:00:53.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:01:04.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 07:01:04.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429506,ok=429506,error=0, records=41
[INFO ] 2026-06-02 07:01:04.903 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21477/300s
[WARN ] 2026-06-02 07:01:07.607 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:01:08.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:01:19.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 07:01:19.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429507,ok=429507,error=0, records=41
[WARN ] 2026-06-02 07:01:22.611 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:01:23.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:01:34.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 07:01:34.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429508,ok=429508,error=0, records=41
[WARN ] 2026-06-02 07:01:37.616 [15445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:01:38.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:01:49.542 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21486/300s
[INFO ] 2026-06-02 07:01:49.922 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:01:49.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429509,ok=429509,error=0, records=41
[WARN ] 2026-06-02 07:01:52.621 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:01:54.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:01:55.937 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21477/300s
[INFO ] 2026-06-02 07:02:04.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 07:02:04.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429510,ok=429510,error=0, records=41
[INFO ] 2026-06-02 07:02:05.334 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851600},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:02:05.485 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:02:05.485 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 07:02:05.485 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:02:05.485 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:02:05.485 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:02:05.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:02:05.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:02:05.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:02:05.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:02:05.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:02:05.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:02:07.626 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:02:09.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:02:09.000 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21489/300s
[INFO ] 2026-06-02 07:02:19.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 07:02:19.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429511,ok=429511,error=0, records=41
[WARN ] 2026-06-02 07:02:22.632 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:02:24.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:02:34.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 07:02:34.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429512,ok=429512,error=0, records=41
[WARN ] 2026-06-02 07:02:37.637 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:02:39.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:02:49.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:02:49.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429513,ok=429513,error=0, records=41
[WARN ] 2026-06-02 07:02:52.642 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:02:54.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:02:56.491 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21487/300s
[INFO ] 2026-06-02 07:02:58.292 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21487/300s
[INFO ] 2026-06-02 07:03:04.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:03:04.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429514,ok=429514,error=0, records=41
[INFO ] 2026-06-02 07:03:05.199 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21487/300s
[WARN ] 2026-06-02 07:03:07.648 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:03:09.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:03:20.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 07:03:20.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429515,ok=429515,error=0, records=41
[WARN ] 2026-06-02 07:03:22.652 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:03:24.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:03:35.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:03:35.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429516,ok=429516,error=0, records=41
[WARN ] 2026-06-02 07:03:37.657 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:03:39.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 07:03:39.004 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 07:03:50.080 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 07:03:50.080 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429517,ok=429517,error=0, records=41
[WARN ] 2026-06-02 07:03:52.663 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:03:54.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:04:05.138 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 07:04:05.138 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429518,ok=429518,error=0, records=41
[WARN ] 2026-06-02 07:04:07.667 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:04:09.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:04:20.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 07:04:20.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429519,ok=429519,error=0, records=41
[WARN ] 2026-06-02 07:04:22.672 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:04:24.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:04:35.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 07:04:35.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429520,ok=429520,error=0, records=41
[WARN ] 2026-06-02 07:04:37.676 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:04:39.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:04:50.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 07:04:50.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429521,ok=429521,error=0, records=41
[WARN ] 2026-06-02 07:04:52.680 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:04:54.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:05:01.831 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21491/300s
[INFO ] 2026-06-02 07:05:05.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 07:05:05.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429522,ok=429522,error=0, records=41
[INFO ] 2026-06-02 07:05:05.486 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17892/300s
[INFO ] 2026-06-02 07:05:05.487 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851532},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:05:05.650 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:05:05.651 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:05:05.651 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:05:05.651 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:05:05.651 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:05:05.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:05:05.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:05:05.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:05:05.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:05:05.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:05:05.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:05:07.686 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:05:09.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:05:20.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:05:20.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429523,ok=429523,error=0, records=41
[WARN ] 2026-06-02 07:05:22.692 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:05:24.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:05:27.194 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21482/300s
[INFO ] 2026-06-02 07:05:35.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:05:35.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429524,ok=429524,error=0, records=41
[WARN ] 2026-06-02 07:05:37.698 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:05:39.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:05:42.926 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21491/300s
[INFO ] 2026-06-02 07:05:50.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 07:05:50.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429525,ok=429525,error=0, records=41
[WARN ] 2026-06-02 07:05:52.703 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:05:54.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:06:05.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-02 07:06:05.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429526,ok=429526,error=0, records=41
[INFO ] 2026-06-02 07:06:05.191 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21478/300s
[WARN ] 2026-06-02 07:06:07.708 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:06:09.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:06:20.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 07:06:20.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429527,ok=429527,error=0, records=41
[WARN ] 2026-06-02 07:06:22.714 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:06:24.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:06:35.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 07:06:35.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429528,ok=429528,error=0, records=41
[WARN ] 2026-06-02 07:06:37.720 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:06:39.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:06:49.595 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21487/300s
[INFO ] 2026-06-02 07:06:50.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 07:06:50.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429529,ok=429529,error=0, records=41
[WARN ] 2026-06-02 07:06:52.725 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:06:54.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:06:56.120 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21478/300s
[INFO ] 2026-06-02 07:07:05.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 07:07:05.226 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429530,ok=429530,error=0, records=41
[WARN ] 2026-06-02 07:07:07.729 [15445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:07:09.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:07:09.013 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21490/300s
[INFO ] 2026-06-02 07:07:20.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 07:07:20.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429531,ok=429531,error=0, records=41
[WARN ] 2026-06-02 07:07:22.733 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:07:24.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:07:35.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 07:07:35.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429532,ok=429532,error=0, records=41
[WARN ] 2026-06-02 07:07:37.739 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:07:39.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:07:50.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 07:07:50.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429533,ok=429533,error=0, records=41
[WARN ] 2026-06-02 07:07:52.744 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:07:54.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:07:56.549 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21488/300s
[INFO ] 2026-06-02 07:07:58.350 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21488/300s
[INFO ] 2026-06-02 07:08:05.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 07:08:05.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429534,ok=429534,error=0, records=41
[INFO ] 2026-06-02 07:08:05.257 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21488/300s
[INFO ] 2026-06-02 07:08:05.652 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851468},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:08:05.821 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:08:05.821 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 07:08:05.821 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:08:05.821 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:08:05.821 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:08:05.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:08:05.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:08:05.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:08:05.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:08:05.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:08:05.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:08:07.748 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:08:09.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:08:20.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 07:08:20.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429535,ok=429535,error=0, records=41
[WARN ] 2026-06-02 07:08:22.755 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:08:24.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:08:35.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 07:08:35.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429536,ok=429536,error=0, records=41
[WARN ] 2026-06-02 07:08:37.760 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:08:39.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:08:50.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 07:08:50.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429537,ok=429537,error=0, records=41
[WARN ] 2026-06-02 07:08:52.765 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:08:54.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:08:54.018 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 07:09:05.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 07:09:05.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429538,ok=429538,error=0, records=41
[WARN ] 2026-06-02 07:09:07.769 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:09:09.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:09:20.276 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 07:09:20.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429539,ok=429539,error=0, records=41
[WARN ] 2026-06-02 07:09:22.775 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:09:24.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:09:35.283 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 07:09:35.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429540,ok=429540,error=0, records=41
[WARN ] 2026-06-02 07:09:37.780 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:09:39.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:09:50.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 07:09:50.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429541,ok=429541,error=0, records=41
[WARN ] 2026-06-02 07:09:52.785 [15460] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:09:54.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:10:01.835 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21492/300s
[INFO ] 2026-06-02 07:10:05.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13032, records=54
[INFO ] 2026-06-02 07:10:05.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429542,ok=429542,error=0, records=54
[WARN ] 2026-06-02 07:10:07.789 [15430] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:10:09.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:10:20.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:10:20.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429543,ok=429543,error=0, records=41
[WARN ] 2026-06-02 07:10:22.793 [15445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:10:24.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:10:27.295 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21483/300s
[INFO ] 2026-06-02 07:10:35.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:10:35.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429544,ok=429544,error=0, records=41
[WARN ] 2026-06-02 07:10:37.799 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:10:39.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:10:42.933 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21492/300s
[INFO ] 2026-06-02 07:10:50.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:10:50.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429545,ok=429545,error=0, records=41
[WARN ] 2026-06-02 07:10:52.804 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:10:54.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:11:05.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 07:11:05.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429546,ok=429546,error=0, records=41
[INFO ] 2026-06-02 07:11:05.318 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21479/300s
[INFO ] 2026-06-02 07:11:05.821 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17893/300s
[INFO ] 2026-06-02 07:11:05.823 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851404},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:11:05.973 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:11:05.973 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:11:05.974 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:11:05.974 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:11:05.974 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:11:06.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:11:06.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:11:06.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:11:06.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:11:06.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:11:06.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:11:07.809 [15429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:11:09.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:11:20.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 07:11:20.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429547,ok=429547,error=0, records=41
[WARN ] 2026-06-02 07:11:22.815 [16049] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:11:24.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:11:35.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:11:35.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429548,ok=429548,error=0, records=41
[WARN ] 2026-06-02 07:11:37.820 [15445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:11:39.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:11:49.654 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21488/300s
[INFO ] 2026-06-02 07:11:50.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 07:11:50.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429549,ok=429549,error=0, records=41
[WARN ] 2026-06-02 07:11:52.825 [16035] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:11:54.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:11:56.302 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21479/300s
[INFO ] 2026-06-02 07:12:05.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 07:12:05.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429550,ok=429550,error=0, records=41
[WARN ] 2026-06-02 07:12:07.830 [15445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:12:09.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:12:09.027 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21491/300s
[INFO ] 2026-06-02 07:12:20.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 07:12:20.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429551,ok=429551,error=0, records=41
[WARN ] 2026-06-02 07:12:22.835 [16091] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:12:24.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:12:35.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10389, records=41
[INFO ] 2026-06-02 07:12:35.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429552,ok=429552,error=0, records=41
[WARN ] 2026-06-02 07:12:37.842 [16091] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:12:39.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:12:50.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 07:12:50.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429553,ok=429553,error=0, records=41
[WARN ] 2026-06-02 07:12:52.847 [16106] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:12:54.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:12:56.604 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21489/300s
[INFO ] 2026-06-02 07:12:58.405 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21489/300s
[INFO ] 2026-06-02 07:13:05.309 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21489/300s
[INFO ] 2026-06-02 07:13:05.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 07:13:05.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429554,ok=429554,error=0, records=41
[WARN ] 2026-06-02 07:13:07.853 [16106] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:13:09.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:13:20.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 07:13:20.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429555,ok=429555,error=0, records=41
[WARN ] 2026-06-02 07:13:22.860 [16115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:13:24.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:13:35.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 07:13:35.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429556,ok=429556,error=0, records=41
[WARN ] 2026-06-02 07:13:37.867 [16156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:13:39.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 07:13:39.030 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 07:13:50.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 07:13:50.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429557,ok=429557,error=0, records=41
[WARN ] 2026-06-02 07:13:52.873 [16142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:13:54.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:14:05.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 07:14:05.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429558,ok=429558,error=0, records=41
[INFO ] 2026-06-02 07:14:05.975 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851328},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:14:06.130 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:14:06.130 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:14:06.130 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:14:06.130 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:14:06.130 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:14:06.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:14:06.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:14:06.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:14:06.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:14:06.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:14:06.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:14:07.878 [16206] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:14:09.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:14:20.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-02 07:14:20.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429559,ok=429559,error=0, records=41
[WARN ] 2026-06-02 07:14:22.882 [16223] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:14:24.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:14:35.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 07:14:35.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429560,ok=429560,error=0, records=41
[WARN ] 2026-06-02 07:14:37.887 [16240] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:14:39.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:14:50.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 07:14:50.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429561,ok=429561,error=0, records=41
[WARN ] 2026-06-02 07:14:52.891 [16250] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:14:54.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:15:01.838 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21493/300s
[INFO ] 2026-06-02 07:15:05.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 07:15:05.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429562,ok=429562,error=0, records=41
[WARN ] 2026-06-02 07:15:07.895 [16250] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:15:09.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:15:20.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 07:15:20.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429563,ok=429563,error=0, records=41
[WARN ] 2026-06-02 07:15:22.900 [16228] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:15:24.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:15:27.402 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21484/300s
[INFO ] 2026-06-02 07:15:35.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 07:15:35.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429564,ok=429564,error=0, records=41
[WARN ] 2026-06-02 07:15:37.906 [16306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:15:39.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:15:42.939 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21493/300s
[INFO ] 2026-06-02 07:15:50.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 07:15:50.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429565,ok=429565,error=0, records=41
[WARN ] 2026-06-02 07:15:52.912 [16299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:15:54.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:16:05.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 07:16:05.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429566,ok=429566,error=0, records=41
[INFO ] 2026-06-02 07:16:05.579 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21480/300s
[WARN ] 2026-06-02 07:16:07.918 [16338] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:16:09.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:16:20.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 07:16:20.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429567,ok=429567,error=0, records=41
[WARN ] 2026-06-02 07:16:22.923 [16348] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:16:24.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:16:35.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 07:16:35.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429568,ok=429568,error=0, records=41
[WARN ] 2026-06-02 07:16:37.929 [16366] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:16:39.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:16:49.706 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21489/300s
[INFO ] 2026-06-02 07:16:50.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 07:16:50.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429569,ok=429569,error=0, records=41
[WARN ] 2026-06-02 07:16:52.934 [16383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:16:54.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:16:56.479 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21480/300s
[INFO ] 2026-06-02 07:17:05.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 07:17:05.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429570,ok=429570,error=0, records=41
[INFO ] 2026-06-02 07:17:06.130 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17894/300s
[INFO ] 2026-06-02 07:17:06.132 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851264},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:17:06.305 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:17:06.306 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:17:06.306 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:17:06.306 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:17:06.306 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:17:06.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:17:06.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:17:06.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:17:06.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:17:06.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:17:06.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:17:07.940 [16406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:17:09.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:17:09.039 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21492/300s
[INFO ] 2026-06-02 07:17:20.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 07:17:20.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429571,ok=429571,error=0, records=41
[WARN ] 2026-06-02 07:17:22.946 [16417] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:17:24.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:17:35.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 07:17:35.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429572,ok=429572,error=0, records=41
[WARN ] 2026-06-02 07:17:37.952 [16416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:17:39.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:17:50.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 07:17:50.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429573,ok=429573,error=0, records=41
[WARN ] 2026-06-02 07:17:52.957 [16428] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:17:54.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:17:56.655 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21490/300s
[INFO ] 2026-06-02 07:17:58.456 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21490/300s
[INFO ] 2026-06-02 07:18:05.363 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21490/300s
[INFO ] 2026-06-02 07:18:05.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:18:05.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429574,ok=429574,error=0, records=41
[WARN ] 2026-06-02 07:18:07.961 [16399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:18:09.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:18:20.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 07:18:20.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429575,ok=429575,error=0, records=41
[WARN ] 2026-06-02 07:18:22.966 [16428] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:18:24.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:18:35.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 07:18:35.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429576,ok=429576,error=0, records=41
[WARN ] 2026-06-02 07:18:37.970 [16428] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:18:39.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:18:50.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 07:18:50.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429577,ok=429577,error=0, records=41
[WARN ] 2026-06-02 07:18:52.976 [16399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:18:54.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:19:05.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 07:19:05.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429578,ok=429578,error=0, records=41
[WARN ] 2026-06-02 07:19:07.980 [16399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:19:09.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:19:20.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 07:19:20.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429579,ok=429579,error=0, records=41
[WARN ] 2026-06-02 07:19:22.985 [16447] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:19:24.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:19:35.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 07:19:35.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429580,ok=429580,error=0, records=41
[WARN ] 2026-06-02 07:19:37.990 [16530] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:19:39.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:19:50.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 07:19:50.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429581,ok=429581,error=0, records=41
[WARN ] 2026-06-02 07:19:52.994 [16502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:19:54.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:20:01.842 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21494/300s
[INFO ] 2026-06-02 07:20:05.687 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:20:05.687 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429582,ok=429582,error=0, records=41
[INFO ] 2026-06-02 07:20:06.307 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851196},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:20:06.473 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:20:06.473 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 07:20:06.473 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:20:06.473 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:20:06.473 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:20:06.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:20:06.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:20:06.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:20:06.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:20:06.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:20:06.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:20:07.998 [16530] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:20:09.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:20:20.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 07:20:20.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429583,ok=429583,error=0, records=41
[WARN ] 2026-06-02 07:20:23.003 [16592] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:20:24.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:20:27.504 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21485/300s
[INFO ] 2026-06-02 07:20:35.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 07:20:35.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429584,ok=429584,error=0, records=41
[WARN ] 2026-06-02 07:20:38.007 [16502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:20:39.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:20:42.945 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21494/300s
[INFO ] 2026-06-02 07:20:50.701 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 07:20:50.701 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429585,ok=429585,error=0, records=41
[WARN ] 2026-06-02 07:20:53.012 [16544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:20:54.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:21:05.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 07:21:05.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429586,ok=429586,error=0, records=41
[INFO ] 2026-06-02 07:21:05.707 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21481/300s
[WARN ] 2026-06-02 07:21:08.017 [16606] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:21:09.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:21:20.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 07:21:20.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429587,ok=429587,error=0, records=41
[WARN ] 2026-06-02 07:21:23.021 [16399] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:21:24.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:21:35.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 07:21:35.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429588,ok=429588,error=0, records=41
[WARN ] 2026-06-02 07:21:38.027 [16502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:21:39.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:21:49.763 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21490/300s
[INFO ] 2026-06-02 07:21:50.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 07:21:50.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429589,ok=429589,error=0, records=41
[WARN ] 2026-06-02 07:21:53.031 [16606] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:21:54.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:21:56.663 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21481/300s
[INFO ] 2026-06-02 07:22:05.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 07:22:05.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429590,ok=429590,error=0, records=41
[WARN ] 2026-06-02 07:22:08.038 [16690] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:22:09.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:22:09.051 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21493/300s
[INFO ] 2026-06-02 07:22:20.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 07:22:20.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429591,ok=429591,error=0, records=41
[WARN ] 2026-06-02 07:22:23.044 [16714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:22:24.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:22:35.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 07:22:35.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429592,ok=429592,error=0, records=41
[WARN ] 2026-06-02 07:22:38.049 [16714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:22:39.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:22:50.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:22:50.752 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429593,ok=429593,error=0, records=41
[WARN ] 2026-06-02 07:22:52.554 [16742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:22:54.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:22:56.706 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21491/300s
[INFO ] 2026-06-02 07:22:58.507 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21491/300s
[INFO ] 2026-06-02 07:23:05.413 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21491/300s
[INFO ] 2026-06-02 07:23:05.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:23:05.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429594,ok=429594,error=0, records=41
[INFO ] 2026-06-02 07:23:06.473 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17895/300s
[INFO ] 2026-06-02 07:23:06.475 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851128},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:23:06.639 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:23:06.640 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 07:23:06.640 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:23:06.640 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:23:06.640 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:23:06.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:23:06.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:23:06.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:23:06.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:23:06.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:23:06.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:23:07.559 [16759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:23:09.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:23:20.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 07:23:20.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429595,ok=429595,error=0, records=41
[WARN ] 2026-06-02 07:23:22.565 [16778] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:23:24.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:23:35.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 07:23:35.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429596,ok=429596,error=0, records=41
[WARN ] 2026-06-02 07:23:37.569 [16802] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:23:39.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 07:23:39.055 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 07:23:50.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:23:50.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429597,ok=429597,error=0, records=41
[WARN ] 2026-06-02 07:23:52.575 [16785] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:23:54.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:23:54.055 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 07:24:05.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 07:24:05.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429598,ok=429598,error=0, records=41
[WARN ] 2026-06-02 07:24:07.580 [16837] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:24:09.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:24:20.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:24:20.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429599,ok=429599,error=0, records=41
[WARN ] 2026-06-02 07:24:22.585 [16848] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:24:24.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:24:35.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 07:24:35.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429600,ok=429600,error=0, records=41
[WARN ] 2026-06-02 07:24:37.591 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:24:39.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:24:50.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 07:24:50.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429601,ok=429601,error=0, records=41
[WARN ] 2026-06-02 07:24:52.596 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:24:54.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:25:01.845 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21495/300s
[INFO ] 2026-06-02 07:25:05.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 07:25:05.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429602,ok=429602,error=0, records=41
[WARN ] 2026-06-02 07:25:07.601 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:25:09.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:25:20.813 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 07:25:20.813 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429603,ok=429603,error=0, records=41
[WARN ] 2026-06-02 07:25:22.606 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:25:24.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:25:27.607 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21486/300s
[INFO ] 2026-06-02 07:25:35.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 07:25:35.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429604,ok=429604,error=0, records=41
[WARN ] 2026-06-02 07:25:37.611 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:25:39.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:25:42.952 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21495/300s
[INFO ] 2026-06-02 07:25:50.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 07:25:50.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429605,ok=429605,error=0, records=41
[WARN ] 2026-06-02 07:25:52.616 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:25:54.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:26:05.836 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 07:26:05.836 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429606,ok=429606,error=0, records=41
[INFO ] 2026-06-02 07:26:05.836 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21482/300s
[INFO ] 2026-06-02 07:26:06.641 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851068},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:26:06.805 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:26:06.805 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:26:06.805 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:26:06.805 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:26:06.805 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:26:06.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:26:06.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:26:06.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:26:06.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:26:06.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:26:06.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:26:07.621 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:26:09.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:26:20.842 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 07:26:20.842 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429607,ok=429607,error=0, records=41
[WARN ] 2026-06-02 07:26:22.626 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:26:24.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:26:35.848 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 07:26:35.848 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429608,ok=429608,error=0, records=41
[WARN ] 2026-06-02 07:26:37.631 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:26:39.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:26:49.817 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21491/300s
[INFO ] 2026-06-02 07:26:50.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 07:26:50.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429609,ok=429609,error=0, records=41
[WARN ] 2026-06-02 07:26:52.636 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:26:54.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:26:56.846 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21482/300s
[INFO ] 2026-06-02 07:27:05.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 07:27:05.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429610,ok=429610,error=0, records=41
[WARN ] 2026-06-02 07:27:07.641 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:27:09.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:27:09.064 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21494/300s
[INFO ] 2026-06-02 07:27:20.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 07:27:20.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429611,ok=429611,error=0, records=41
[WARN ] 2026-06-02 07:27:22.647 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:27:24.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:27:35.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 07:27:35.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429612,ok=429612,error=0, records=41
[WARN ] 2026-06-02 07:27:37.652 [16859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:27:39.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:27:50.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 07:27:50.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429613,ok=429613,error=0, records=41
[WARN ] 2026-06-02 07:27:52.657 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:27:54.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:27:56.771 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21492/300s
[INFO ] 2026-06-02 07:27:58.572 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21492/300s
[INFO ] 2026-06-02 07:28:05.478 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21492/300s
[INFO ] 2026-06-02 07:28:05.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:28:05.888 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429614,ok=429614,error=0, records=41
[WARN ] 2026-06-02 07:28:07.665 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:28:09.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:28:20.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 07:28:20.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429615,ok=429615,error=0, records=41
[WARN ] 2026-06-02 07:28:22.670 [16859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:28:24.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:28:35.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 07:28:35.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429616,ok=429616,error=0, records=41
[WARN ] 2026-06-02 07:28:37.675 [16885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:28:39.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:28:50.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 07:28:50.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429617,ok=429617,error=0, records=41
[WARN ] 2026-06-02 07:28:52.679 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:28:54.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:29:05.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 07:29:05.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429618,ok=429618,error=0, records=41
[INFO ] 2026-06-02 07:29:06.806 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17896/300s
[INFO ] 2026-06-02 07:29:06.807 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20851004},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:29:06.979 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:29:06.979 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 07:29:06.980 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:29:06.980 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:29:06.980 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:29:07.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:29:07.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:29:07.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:29:07.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:29:07.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:29:07.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:29:07.684 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:29:09.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:29:20.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 07:29:20.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429619,ok=429619,error=0, records=41
[WARN ] 2026-06-02 07:29:22.689 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:29:24.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:29:35.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 07:29:35.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429620,ok=429620,error=0, records=41
[WARN ] 2026-06-02 07:29:37.693 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:29:39.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:29:50.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 07:29:50.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429621,ok=429621,error=0, records=41
[WARN ] 2026-06-02 07:29:52.699 [16885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:29:54.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:30:01.848 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21496/300s
[INFO ] 2026-06-02 07:30:05.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 07:30:05.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429622,ok=429622,error=0, records=41
[WARN ] 2026-06-02 07:30:07.706 [16885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:30:09.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:30:20.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-02 07:30:20.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429623,ok=429623,error=0, records=41
[WARN ] 2026-06-02 07:30:22.710 [16885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:30:24.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:30:27.711 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21487/300s
[INFO ] 2026-06-02 07:30:35.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 07:30:35.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429624,ok=429624,error=0, records=41
[WARN ] 2026-06-02 07:30:37.714 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:30:39.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:30:42.959 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21496/300s
[INFO ] 2026-06-02 07:30:50.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 07:30:50.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429625,ok=429625,error=0, records=41
[WARN ] 2026-06-02 07:30:52.720 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:30:54.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:31:05.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 07:31:05.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429626,ok=429626,error=0, records=41
[INFO ] 2026-06-02 07:31:05.971 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21483/300s
[WARN ] 2026-06-02 07:31:07.725 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:31:09.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:31:20.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 07:31:20.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429627,ok=429627,error=0, records=41
[WARN ] 2026-06-02 07:31:22.730 [16859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:31:24.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:31:35.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 07:31:35.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429628,ok=429628,error=0, records=41
[WARN ] 2026-06-02 07:31:37.736 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:31:39.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:31:49.873 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21492/300s
[INFO ] 2026-06-02 07:31:50.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 07:31:50.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429629,ok=429629,error=0, records=41
[WARN ] 2026-06-02 07:31:52.741 [16859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:31:54.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:31:57.029 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21483/300s
[INFO ] 2026-06-02 07:32:05.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 07:32:05.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429630,ok=429630,error=0, records=41
[INFO ] 2026-06-02 07:32:06.981 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850920},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:32:07.134 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:32:07.134 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:32:07.135 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:32:07.135 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:32:07.135 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:32:07.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:32:07.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:32:07.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:32:07.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:32:07.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:32:07.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:32:07.746 [16885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:32:09.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:32:09.076 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21495/300s
[WARN ] 2026-06-02 07:32:17.751 [16838] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12327/stat), No such file or directory
[INFO ] 2026-06-02 07:32:21.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 07:32:21.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429631,ok=429631,error=0, records=41
[WARN ] 2026-06-02 07:32:22.752 [16859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:32:24.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 07:32:32.757 [16838] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12327/stat), No such file or directory
[WARN ] 2026-06-02 07:32:32.758 [16838] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/14135/stat), No such file or directory
[INFO ] 2026-06-02 07:32:36.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 07:32:36.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429632,ok=429632,error=0, records=41
[WARN ] 2026-06-02 07:32:37.758 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:32:39.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 07:32:47.762 [16885] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10852/stat), No such file or directory
[WARN ] 2026-06-02 07:32:47.762 [16885] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/12327/stat), No such file or directory
[WARN ] 2026-06-02 07:32:47.763 [16885] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/14128/stat), No such file or directory
[WARN ] 2026-06-02 07:32:47.763 [16885] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/14135/stat), No such file or directory
[INFO ] 2026-06-02 07:32:51.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:32:51.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429633,ok=429633,error=0, records=41
[WARN ] 2026-06-02 07:32:52.764 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:32:54.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:32:56.822 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21493/300s
[INFO ] 2026-06-02 07:32:58.623 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21493/300s
[INFO ] 2026-06-02 07:33:05.528 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21493/300s
[INFO ] 2026-06-02 07:33:06.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 07:33:06.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429634,ok=429634,error=0, records=41
[WARN ] 2026-06-02 07:33:07.770 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:33:09.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:33:21.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 07:33:21.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429635,ok=429635,error=0, records=41
[WARN ] 2026-06-02 07:33:22.775 [16859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:33:24.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:33:36.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 07:33:36.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429636,ok=429636,error=0, records=41
[WARN ] 2026-06-02 07:33:37.780 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:33:39.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 07:33:39.080 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 07:33:51.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 07:33:51.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429637,ok=429637,error=0, records=41
[WARN ] 2026-06-02 07:33:52.785 [16830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:33:54.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:34:06.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 07:34:06.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429638,ok=429638,error=0, records=41
[WARN ] 2026-06-02 07:34:07.790 [16865] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:34:09.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:34:21.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 07:34:21.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429639,ok=429639,error=0, records=41
[WARN ] 2026-06-02 07:34:22.796 [16885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:34:24.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:34:36.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 07:34:36.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429640,ok=429640,error=0, records=41
[WARN ] 2026-06-02 07:34:37.803 [17446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:34:39.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:34:51.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 07:34:51.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429641,ok=429641,error=0, records=41
[WARN ] 2026-06-02 07:34:52.807 [17455] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:34:54.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:35:01.852 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21497/300s
[INFO ] 2026-06-02 07:35:06.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 07:35:06.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429642,ok=429642,error=0, records=41
[INFO ] 2026-06-02 07:35:07.135 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17897/300s
[INFO ] 2026-06-02 07:35:07.136 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850844},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:35:07.321 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:35:07.321 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 07:35:07.322 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:35:07.322 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:35:07.322 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:35:07.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:35:07.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:35:07.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:35:07.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:35:07.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:35:07.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:35:07.813 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:35:09.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:35:21.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 07:35:21.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429643,ok=429643,error=0, records=41
[WARN ] 2026-06-02 07:35:22.819 [17446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:35:24.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:35:27.820 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21488/300s
[INFO ] 2026-06-02 07:35:36.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 07:35:36.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429644,ok=429644,error=0, records=41
[WARN ] 2026-06-02 07:35:37.825 [17446] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:35:39.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:35:42.965 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21497/300s
[INFO ] 2026-06-02 07:35:51.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 07:35:51.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429645,ok=429645,error=0, records=41
[WARN ] 2026-06-02 07:35:52.830 [16838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:35:54.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:36:06.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 07:36:06.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429646,ok=429646,error=0, records=41
[INFO ] 2026-06-02 07:36:06.152 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21484/300s
[WARN ] 2026-06-02 07:36:07.836 [17520] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:36:09.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:36:21.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 07:36:21.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429647,ok=429647,error=0, records=41
[WARN ] 2026-06-02 07:36:22.841 [17492] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:36:24.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:36:36.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 07:36:36.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429648,ok=429648,error=0, records=41
[WARN ] 2026-06-02 07:36:37.846 [17534] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:36:39.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:36:49.922 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21493/300s
[INFO ] 2026-06-02 07:36:51.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-02 07:36:51.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429649,ok=429649,error=0, records=41
[WARN ] 2026-06-02 07:36:52.852 [17534] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:36:54.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:36:57.214 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21484/300s
[INFO ] 2026-06-02 07:37:06.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 07:37:06.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429650,ok=429650,error=0, records=41
[WARN ] 2026-06-02 07:37:07.857 [17585] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:37:09.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:37:09.088 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21496/300s
[INFO ] 2026-06-02 07:37:21.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 07:37:21.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429651,ok=429651,error=0, records=41
[WARN ] 2026-06-02 07:37:22.863 [17543] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:37:24.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:37:36.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 07:37:36.262 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429652,ok=429652,error=0, records=41
[WARN ] 2026-06-02 07:37:37.870 [17599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:37:39.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:37:51.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 07:37:51.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429653,ok=429653,error=0, records=41
[WARN ] 2026-06-02 07:37:52.875 [17599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:37:54.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:37:56.849 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21494/300s
[INFO ] 2026-06-02 07:37:58.650 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21494/300s
[INFO ] 2026-06-02 07:38:05.554 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21494/300s
[INFO ] 2026-06-02 07:38:06.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 07:38:06.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429654,ok=429654,error=0, records=41
[INFO ] 2026-06-02 07:38:07.323 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850776},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:38:07.496 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:38:07.497 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:38:07.497 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:38:07.497 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:38:07.497 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:38:07.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:38:07.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:38:07.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:38:07.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:38:07.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:38:07.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:38:07.881 [17633] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:38:09.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:38:21.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 07:38:21.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429655,ok=429655,error=0, records=41
[WARN ] 2026-06-02 07:38:22.887 [17627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:38:24.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:38:36.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 07:38:36.282 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429656,ok=429656,error=0, records=41
[WARN ] 2026-06-02 07:38:37.893 [17644] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:38:39.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:38:51.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-02 07:38:51.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429657,ok=429657,error=0, records=41
[WARN ] 2026-06-02 07:38:52.899 [17644] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:38:54.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:38:54.092 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 07:39:06.299 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 07:39:06.299 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429658,ok=429658,error=0, records=41
[WARN ] 2026-06-02 07:39:07.905 [17633] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:39:09.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:39:21.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 07:39:21.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429659,ok=429659,error=0, records=41
[WARN ] 2026-06-02 07:39:22.911 [17676] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:39:24.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 07:39:32.415 [17835] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17302/stat), No such file or directory
[INFO ] 2026-06-02 07:39:36.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 07:39:36.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429660,ok=429660,error=0, records=41
[WARN ] 2026-06-02 07:39:37.915 [17830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:39:39.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 07:39:47.419 [17852] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17302/stat), No such file or directory
[WARN ] 2026-06-02 07:39:47.419 [17852] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17277/stat), No such file or directory
[INFO ] 2026-06-02 07:39:51.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 07:39:51.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429661,ok=429661,error=0, records=41
[WARN ] 2026-06-02 07:39:52.921 [17852] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:39:54.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:40:01.855 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21498/300s
[INFO ] 2026-06-02 07:40:06.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-02 07:40:06.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429662,ok=429662,error=0, records=41
[WARN ] 2026-06-02 07:40:07.927 [17882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:40:09.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:40:21.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 07:40:21.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429663,ok=429663,error=0, records=41
[WARN ] 2026-06-02 07:40:22.933 [17882] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:40:24.097 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:40:27.934 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21489/300s
[INFO ] 2026-06-02 07:40:36.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 07:40:36.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429664,ok=429664,error=0, records=41
[WARN ] 2026-06-02 07:40:37.937 [17910] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:40:39.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:40:42.970 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21498/300s
[INFO ] 2026-06-02 07:40:51.416 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 07:40:51.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429665,ok=429665,error=0, records=41
[WARN ] 2026-06-02 07:40:52.943 [17921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:40:54.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:41:06.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 07:41:06.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429666,ok=429666,error=0, records=41
[INFO ] 2026-06-02 07:41:06.425 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21485/300s
[INFO ] 2026-06-02 07:41:07.497 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17898/300s
[INFO ] 2026-06-02 07:41:07.499 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850656},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:41:07.657 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:41:07.657 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 07:41:07.658 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:41:07.658 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:41:07.658 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:41:07.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:41:07.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:41:07.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:41:07.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:41:07.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:41:07.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:41:07.948 [17948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:41:09.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:41:21.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 07:41:21.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429667,ok=429667,error=0, records=41
[WARN ] 2026-06-02 07:41:22.954 [17958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:41:24.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:41:36.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11327, records=45
[INFO ] 2026-06-02 07:41:36.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429668,ok=429668,error=0, records=45
[WARN ] 2026-06-02 07:41:37.959 [17948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:41:39.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:41:49.970 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21494/300s
[INFO ] 2026-06-02 07:41:51.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-02 07:41:51.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429669,ok=429669,error=0, records=41
[WARN ] 2026-06-02 07:41:52.963 [17958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:41:54.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:41:57.403 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21485/300s
[INFO ] 2026-06-02 07:42:06.495 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 07:42:06.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429670,ok=429670,error=0, records=41
[WARN ] 2026-06-02 07:42:07.967 [17958] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:42:09.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:42:09.102 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21497/300s
[INFO ] 2026-06-02 07:42:21.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 07:42:21.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429671,ok=429671,error=0, records=41
[WARN ] 2026-06-02 07:42:22.972 [18000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:42:24.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:42:36.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 07:42:36.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429672,ok=429672,error=0, records=41
[WARN ] 2026-06-02 07:42:37.977 [18031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:42:39.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:42:51.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 07:42:51.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429673,ok=429673,error=0, records=41
[WARN ] 2026-06-02 07:42:52.981 [17921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:42:54.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:42:56.886 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21495/300s
[INFO ] 2026-06-02 07:42:58.688 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21495/300s
[INFO ] 2026-06-02 07:43:05.593 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21495/300s
[INFO ] 2026-06-02 07:43:06.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10134, records=41
[INFO ] 2026-06-02 07:43:06.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429674,ok=429674,error=0, records=41
[WARN ] 2026-06-02 07:43:07.986 [18045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:43:09.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:43:21.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10093, records=41
[INFO ] 2026-06-02 07:43:21.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429675,ok=429675,error=0, records=41
[WARN ] 2026-06-02 07:43:22.991 [18031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:43:24.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:43:36.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10096, records=41
[INFO ] 2026-06-02 07:43:36.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429676,ok=429676,error=0, records=41
[WARN ] 2026-06-02 07:43:37.995 [18031] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:43:39.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 07:43:39.106 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 07:43:51.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10080, records=41
[INFO ] 2026-06-02 07:43:51.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429677,ok=429677,error=0, records=41
[WARN ] 2026-06-02 07:43:53.000 [18059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:43:54.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:44:06.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 07:44:06.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429678,ok=429678,error=0, records=41
[INFO ] 2026-06-02 07:44:07.659 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850588},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:44:07.843 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:44:07.843 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:44:07.843 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:44:07.843 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:44:07.843 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:44:07.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:44:07.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:44:07.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:44:07.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:44:07.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:44:07.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 07:44:08.004 [18059] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:44:09.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:44:21.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 07:44:21.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429679,ok=429679,error=0, records=41
[WARN ] 2026-06-02 07:44:23.008 [18073] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:44:24.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:44:36.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 07:44:36.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429680,ok=429680,error=0, records=41
[WARN ] 2026-06-02 07:44:38.013 [18045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:44:39.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:44:51.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 07:44:51.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429681,ok=429681,error=0, records=41
[WARN ] 2026-06-02 07:44:53.018 [18101] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:44:54.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:45:01.858 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21499/300s
[WARN ] 2026-06-02 07:45:02.532 [18045] cloudMonitor/base_collect.cpp:241: SicGetProcessState failed, err: FeadFileContent(/proc/18653/stat), No such file or directory
[INFO ] 2026-06-02 07:45:06.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 07:45:06.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429682,ok=429682,error=0, records=41
[WARN ] 2026-06-02 07:45:08.024 [18158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:45:09.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:45:21.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 07:45:21.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429683,ok=429683,error=0, records=41
[WARN ] 2026-06-02 07:45:23.029 [18158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:45:24.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:45:28.030 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21490/300s
[INFO ] 2026-06-02 07:45:36.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 07:45:36.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429684,ok=429684,error=0, records=41
[WARN ] 2026-06-02 07:45:38.034 [18045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:45:39.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:45:42.977 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21499/300s
[INFO ] 2026-06-02 07:45:51.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 07:45:51.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429685,ok=429685,error=0, records=41
[WARN ] 2026-06-02 07:45:53.039 [18215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:45:54.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:46:06.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 07:46:06.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429686,ok=429686,error=0, records=41
[INFO ] 2026-06-02 07:46:06.691 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21486/300s
[WARN ] 2026-06-02 07:46:08.046 [18219] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:46:09.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:46:21.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:46:21.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429687,ok=429687,error=0, records=41
[WARN ] 2026-06-02 07:46:23.051 [18216] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:46:24.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:46:36.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 07:46:36.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429688,ok=429688,error=0, records=41
[WARN ] 2026-06-02 07:46:37.556 [18266] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:46:39.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:46:50.025 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21495/300s
[INFO ] 2026-06-02 07:46:51.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 07:46:51.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429689,ok=429689,error=0, records=41
[WARN ] 2026-06-02 07:46:52.561 [18291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:46:54.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:46:57.570 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21486/300s
[INFO ] 2026-06-02 07:47:06.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 07:47:06.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429690,ok=429690,error=0, records=41
[WARN ] 2026-06-02 07:47:07.567 [18296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:47:07.843 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17899/300s
[INFO ] 2026-06-02 07:47:07.845 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850520},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:47:08.007 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:47:08.007 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 07:47:08.008 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:47:08.008 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:47:08.008 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:47:08.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:47:08.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:47:08.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:47:08.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:47:08.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:47:08.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:47:09.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:47:09.115 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21498/300s
[INFO ] 2026-06-02 07:47:21.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 07:47:21.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429691,ok=429691,error=0, records=41
[WARN ] 2026-06-02 07:47:22.573 [18309] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:47:24.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:47:36.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 07:47:36.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429692,ok=429692,error=0, records=41
[WARN ] 2026-06-02 07:47:37.578 [18332] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:47:39.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:47:51.745 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 07:47:51.745 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429693,ok=429693,error=0, records=41
[WARN ] 2026-06-02 07:47:52.582 [18344] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:47:54.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:47:56.956 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21496/300s
[INFO ] 2026-06-02 07:47:58.758 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21496/300s
[INFO ] 2026-06-02 07:48:05.663 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21496/300s
[INFO ] 2026-06-02 07:48:06.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 07:48:06.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429694,ok=429694,error=0, records=41
[WARN ] 2026-06-02 07:48:07.587 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:48:09.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:48:21.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 07:48:21.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429695,ok=429695,error=0, records=41
[WARN ] 2026-06-02 07:48:22.592 [18354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:48:24.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:48:36.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 07:48:36.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429696,ok=429696,error=0, records=41
[WARN ] 2026-06-02 07:48:37.598 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:48:39.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:48:51.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 07:48:51.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429697,ok=429697,error=0, records=41
[WARN ] 2026-06-02 07:48:52.603 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:48:54.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:49:06.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 07:49:06.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429698,ok=429698,error=0, records=41
[WARN ] 2026-06-02 07:49:07.608 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:49:09.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:49:21.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 07:49:21.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429699,ok=429699,error=0, records=41
[WARN ] 2026-06-02 07:49:22.613 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:49:24.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:49:36.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 07:49:36.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429700,ok=429700,error=0, records=41
[WARN ] 2026-06-02 07:49:37.618 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:49:39.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:49:51.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 07:49:51.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429701,ok=429701,error=0, records=41
[WARN ] 2026-06-02 07:49:52.623 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:49:54.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:50:01.862 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21500/300s
[INFO ] 2026-06-02 07:50:06.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 07:50:06.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429702,ok=429702,error=0, records=41
[WARN ] 2026-06-02 07:50:07.628 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:50:08.009 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850456},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:50:08.164 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:50:08.164 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 07:50:08.164 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:50:08.164 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:50:08.164 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:50:08.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:50:08.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:50:08.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:50:08.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:50:08.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:50:08.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:50:09.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:50:21.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 07:50:21.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429703,ok=429703,error=0, records=41
[WARN ] 2026-06-02 07:50:22.633 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:50:24.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:50:28.134 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21491/300s
[INFO ] 2026-06-02 07:50:36.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 07:50:36.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429704,ok=429704,error=0, records=41
[WARN ] 2026-06-02 07:50:37.638 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:50:39.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:50:42.984 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21500/300s
[INFO ] 2026-06-02 07:50:51.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 07:50:51.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429705,ok=429705,error=0, records=41
[WARN ] 2026-06-02 07:50:52.643 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:50:54.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:51:06.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 07:51:06.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429706,ok=429706,error=0, records=41
[INFO ] 2026-06-02 07:51:06.921 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21487/300s
[WARN ] 2026-06-02 07:51:07.648 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:51:09.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:51:21.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 07:51:21.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429707,ok=429707,error=0, records=41
[WARN ] 2026-06-02 07:51:22.653 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:51:24.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:51:36.935 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:51:36.935 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429708,ok=429708,error=0, records=41
[WARN ] 2026-06-02 07:51:37.658 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:51:39.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:51:50.082 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21496/300s
[INFO ] 2026-06-02 07:51:51.940 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 07:51:51.940 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429709,ok=429709,error=0, records=41
[WARN ] 2026-06-02 07:51:52.662 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:51:54.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:51:57.754 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21487/300s
[INFO ] 2026-06-02 07:52:06.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 07:52:06.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429710,ok=429710,error=0, records=41
[WARN ] 2026-06-02 07:52:07.666 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:52:09.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:52:09.127 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21499/300s
[INFO ] 2026-06-02 07:52:21.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 07:52:21.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429711,ok=429711,error=0, records=41
[WARN ] 2026-06-02 07:52:22.671 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:52:24.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:52:36.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:52:36.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429712,ok=429712,error=0, records=41
[WARN ] 2026-06-02 07:52:37.677 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:52:39.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:52:51.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 07:52:51.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429713,ok=429713,error=0, records=41
[WARN ] 2026-06-02 07:52:52.683 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:52:54.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:52:57.016 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21497/300s
[INFO ] 2026-06-02 07:52:58.817 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21497/300s
[INFO ] 2026-06-02 07:53:05.723 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21497/300s
[INFO ] 2026-06-02 07:53:07.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 07:53:07.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429714,ok=429714,error=0, records=41
[WARN ] 2026-06-02 07:53:07.688 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:53:08.165 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17900/300s
[INFO ] 2026-06-02 07:53:08.166 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850392},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:53:08.347 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:53:08.347 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 07:53:08.347 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:53:08.347 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:53:08.347 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:53:08.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:53:08.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:53:08.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:53:08.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:53:08.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:53:08.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:53:09.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:53:22.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 07:53:22.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429715,ok=429715,error=0, records=41
[WARN ] 2026-06-02 07:53:22.693 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:53:24.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:53:37.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 07:53:37.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429716,ok=429716,error=0, records=41
[WARN ] 2026-06-02 07:53:37.697 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:53:39.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 07:53:39.131 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 07:53:52.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 07:53:52.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429717,ok=429717,error=0, records=41
[WARN ] 2026-06-02 07:53:52.702 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:53:54.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:53:54.132 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 07:54:07.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 07:54:07.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429718,ok=429718,error=0, records=41
[WARN ] 2026-06-02 07:54:07.708 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:54:09.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:54:22.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 07:54:22.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429719,ok=429719,error=0, records=41
[WARN ] 2026-06-02 07:54:22.712 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:54:24.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:54:37.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 07:54:37.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429720,ok=429720,error=0, records=41
[WARN ] 2026-06-02 07:54:37.717 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:54:39.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:54:52.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 07:54:52.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429721,ok=429721,error=0, records=41
[WARN ] 2026-06-02 07:54:52.721 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:54:54.135 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:55:01.865 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21501/300s
[INFO ] 2026-06-02 07:55:07.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10312, records=41
[INFO ] 2026-06-02 07:55:07.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429722,ok=429722,error=0, records=41
[WARN ] 2026-06-02 07:55:07.727 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:55:09.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:55:22.080 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 07:55:22.080 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429723,ok=429723,error=0, records=41
[WARN ] 2026-06-02 07:55:22.732 [18354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:55:24.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:55:28.234 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21492/300s
[INFO ] 2026-06-02 07:55:37.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-02 07:55:37.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429724,ok=429724,error=0, records=41
[WARN ] 2026-06-02 07:55:37.738 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:55:39.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:55:42.990 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21501/300s
[INFO ] 2026-06-02 07:55:52.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-02 07:55:52.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429725,ok=429725,error=0, records=41
[WARN ] 2026-06-02 07:55:52.743 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:55:54.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:56:07.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 07:56:07.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429726,ok=429726,error=0, records=41
[INFO ] 2026-06-02 07:56:07.232 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21488/300s
[WARN ] 2026-06-02 07:56:07.749 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:56:08.349 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850328},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:56:08.526 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:56:08.526 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 07:56:08.526 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:56:08.526 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:56:08.526 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:56:08.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:56:08.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:56:08.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:56:08.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:56:08.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:56:08.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:56:09.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:56:22.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 07:56:22.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429727,ok=429727,error=0, records=41
[WARN ] 2026-06-02 07:56:22.754 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:56:24.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:56:37.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 07:56:37.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429728,ok=429728,error=0, records=41
[WARN ] 2026-06-02 07:56:37.759 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:56:39.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:56:50.136 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21497/300s
[INFO ] 2026-06-02 07:56:52.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 07:56:52.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429729,ok=429729,error=0, records=41
[WARN ] 2026-06-02 07:56:52.764 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:56:54.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:56:57.937 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21488/300s
[INFO ] 2026-06-02 07:57:07.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 07:57:07.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429730,ok=429730,error=0, records=41
[WARN ] 2026-06-02 07:57:07.769 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:57:09.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:57:09.141 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21500/300s
[INFO ] 2026-06-02 07:57:22.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 07:57:22.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429731,ok=429731,error=0, records=41
[WARN ] 2026-06-02 07:57:22.774 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:57:24.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:57:37.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 07:57:37.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429732,ok=429732,error=0, records=41
[WARN ] 2026-06-02 07:57:37.779 [18354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:57:39.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:57:52.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 07:57:52.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429733,ok=429733,error=0, records=41
[WARN ] 2026-06-02 07:57:52.784 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:57:54.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:57:57.066 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21498/300s
[INFO ] 2026-06-02 07:57:58.867 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21498/300s
[INFO ] 2026-06-02 07:58:05.772 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21498/300s
[INFO ] 2026-06-02 07:58:07.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 07:58:07.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429734,ok=429734,error=0, records=41
[WARN ] 2026-06-02 07:58:07.789 [18373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:58:09.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:58:22.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 07:58:22.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429735,ok=429735,error=0, records=41
[WARN ] 2026-06-02 07:58:22.793 [18408] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:58:24.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:58:37.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 07:58:37.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429736,ok=429736,error=0, records=41
[WARN ] 2026-06-02 07:58:37.798 [18363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:58:39.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:58:52.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 07:58:52.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429737,ok=429737,error=0, records=41
[WARN ] 2026-06-02 07:58:52.804 [18354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:58:54.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:59:07.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 07:59:07.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429738,ok=429738,error=0, records=41
[WARN ] 2026-06-02 07:59:07.809 [18960] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:59:08.526 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17901/300s
[INFO ] 2026-06-02 07:59:08.528 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 07:59:08.686 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 07:59:08.686 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 07:59:08.686 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 07:59:08.686 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 07:59:08.686 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 07:59:08.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:59:08.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 07:59:08.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:59:08.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 07:59:08.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:59:08.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 07:59:09.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:59:22.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 07:59:22.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429739,ok=429739,error=0, records=41
[WARN ] 2026-06-02 07:59:22.815 [18976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:59:24.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:59:37.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 07:59:37.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429740,ok=429740,error=0, records=41
[WARN ] 2026-06-02 07:59:37.820 [18966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:59:39.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 07:59:52.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 07:59:52.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429741,ok=429741,error=0, records=41
[WARN ] 2026-06-02 07:59:52.825 [18996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 07:59:54.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:00:01.868 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21502/300s
[INFO ] 2026-06-02 08:00:07.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 08:00:07.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429742,ok=429742,error=0, records=41
[WARN ] 2026-06-02 08:00:07.833 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:00:09.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:00:22.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 08:00:22.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429743,ok=429743,error=0, records=41
[WARN ] 2026-06-02 08:00:22.839 [18394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:00:24.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:00:28.341 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21493/300s
[INFO ] 2026-06-02 08:00:37.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 08:00:37.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429744,ok=429744,error=0, records=41
[WARN ] 2026-06-02 08:00:37.845 [18982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:00:39.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:00:42.996 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21502/300s
[INFO ] 2026-06-02 08:00:52.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 08:00:52.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429745,ok=429745,error=0, records=41
[WARN ] 2026-06-02 08:00:52.850 [19071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:00:54.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:01:07.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 08:01:07.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429746,ok=429746,error=0, records=41
[INFO ] 2026-06-02 08:01:07.478 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21489/300s
[WARN ] 2026-06-02 08:01:07.854 [19057] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:01:09.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:01:22.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 08:01:22.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429747,ok=429747,error=0, records=41
[WARN ] 2026-06-02 08:01:22.859 [18982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:01:24.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:01:37.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 08:01:37.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429748,ok=429748,error=0, records=41
[WARN ] 2026-06-02 08:01:37.864 [19097] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:01:39.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:01:50.188 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21498/300s
[INFO ] 2026-06-02 08:01:52.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 08:01:52.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429749,ok=429749,error=0, records=41
[WARN ] 2026-06-02 08:01:52.870 [18982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:01:54.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:01:58.119 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21489/300s
[INFO ] 2026-06-02 08:02:07.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 08:02:07.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429750,ok=429750,error=0, records=41
[WARN ] 2026-06-02 08:02:07.874 [19097] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:02:08.688 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850192},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:02:08.826 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:02:08.827 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 08:02:08.827 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:02:08.827 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:02:08.827 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:02:08.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:02:08.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:02:08.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:02:08.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:02:08.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:02:08.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:02:09.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:02:09.152 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21501/300s
[INFO ] 2026-06-02 08:02:22.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 08:02:22.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429751,ok=429751,error=0, records=41
[WARN ] 2026-06-02 08:02:22.879 [19170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:02:24.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:02:37.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-02 08:02:37.513 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429752,ok=429752,error=0, records=41
[WARN ] 2026-06-02 08:02:37.885 [19192] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:02:39.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:02:52.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 08:02:52.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429753,ok=429753,error=0, records=41
[WARN ] 2026-06-02 08:02:52.891 [19208] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:02:54.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:02:57.115 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21499/300s
[INFO ] 2026-06-02 08:02:58.917 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21499/300s
[INFO ] 2026-06-02 08:03:05.823 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21499/300s
[INFO ] 2026-06-02 08:03:07.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 08:03:07.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429754,ok=429754,error=0, records=41
[WARN ] 2026-06-02 08:03:07.897 [19225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:03:09.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:03:22.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 08:03:22.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429755,ok=429755,error=0, records=41
[WARN ] 2026-06-02 08:03:22.903 [19236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:03:24.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:03:37.538 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 08:03:37.538 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429756,ok=429756,error=0, records=41
[WARN ] 2026-06-02 08:03:37.908 [19220] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:03:39.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 08:03:39.156 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 08:03:52.544 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 08:03:52.544 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429757,ok=429757,error=0, records=41
[WARN ] 2026-06-02 08:03:52.913 [19273] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:03:54.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:04:07.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 08:04:07.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429758,ok=429758,error=0, records=41
[WARN ] 2026-06-02 08:04:07.920 [19225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:04:09.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:04:22.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 08:04:22.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429759,ok=429759,error=0, records=41
[WARN ] 2026-06-02 08:04:22.924 [19225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:04:24.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:04:37.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 08:04:37.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429760,ok=429760,error=0, records=41
[WARN ] 2026-06-02 08:04:37.932 [19252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:04:39.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:04:52.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 08:04:52.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429761,ok=429761,error=0, records=41
[WARN ] 2026-06-02 08:04:52.939 [19252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:04:54.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:05:01.871 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21503/300s
[INFO ] 2026-06-02 08:05:07.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 08:05:07.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429762,ok=429762,error=0, records=41
[WARN ] 2026-06-02 08:05:07.945 [19352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:05:08.827 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17902/300s
[INFO ] 2026-06-02 08:05:08.829 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850120},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:05:08.984 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:05:08.984 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 08:05:08.984 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:05:08.984 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:05:08.984 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:05:09.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:05:09.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:05:09.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:05:09.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:05:09.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:05:09.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:05:09.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:05:22.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 08:05:22.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429763,ok=429763,error=0, records=41
[WARN ] 2026-06-02 08:05:22.951 [19252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:05:24.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:05:28.452 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21494/300s
[INFO ] 2026-06-02 08:05:37.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 08:05:37.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429764,ok=429764,error=0, records=41
[WARN ] 2026-06-02 08:05:37.956 [19378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:05:39.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:05:43.003 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21503/300s
[INFO ] 2026-06-02 08:05:52.626 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 08:05:52.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429765,ok=429765,error=0, records=41
[WARN ] 2026-06-02 08:05:52.961 [19359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:05:54.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:06:07.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 08:06:07.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429766,ok=429766,error=0, records=41
[INFO ] 2026-06-02 08:06:07.630 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21490/300s
[WARN ] 2026-06-02 08:06:07.967 [19406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:06:09.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:06:22.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 08:06:22.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429767,ok=429767,error=0, records=41
[WARN ] 2026-06-02 08:06:22.971 [19392] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:06:24.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:06:37.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 08:06:37.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429768,ok=429768,error=0, records=41
[WARN ] 2026-06-02 08:06:37.975 [19392] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:06:39.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:06:50.246 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21499/300s
[INFO ] 2026-06-02 08:06:52.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 08:06:52.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429769,ok=429769,error=0, records=41
[WARN ] 2026-06-02 08:06:52.980 [19341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:06:54.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:06:58.300 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21490/300s
[INFO ] 2026-06-02 08:07:07.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 08:07:07.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429770,ok=429770,error=0, records=41
[WARN ] 2026-06-02 08:07:07.986 [19359] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:07:09.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:07:09.165 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21502/300s
[INFO ] 2026-06-02 08:07:22.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 08:07:22.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429771,ok=429771,error=0, records=41
[WARN ] 2026-06-02 08:07:22.990 [19341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:07:24.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:07:37.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 08:07:37.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429772,ok=429772,error=0, records=41
[WARN ] 2026-06-02 08:07:37.996 [19392] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:07:39.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:07:52.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:07:52.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429773,ok=429773,error=0, records=41
[WARN ] 2026-06-02 08:07:53.001 [19503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:07:54.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:07:57.174 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21500/300s
[INFO ] 2026-06-02 08:07:58.976 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21500/300s
[INFO ] 2026-06-02 08:08:05.881 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21500/300s
[INFO ] 2026-06-02 08:08:07.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 08:08:07.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429774,ok=429774,error=0, records=41
[WARN ] 2026-06-02 08:08:08.006 [19517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:08:08.986 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20850060},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:08:09.149 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:08:09.149 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:08:09.150 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:08:09.150 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:08:09.150 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:08:09.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:08:09.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:08:09.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:08:09.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:08:09.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:08:09.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:08:09.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:08:22.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 08:08:22.683 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429775,ok=429775,error=0, records=41
[WARN ] 2026-06-02 08:08:23.010 [19503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:08:24.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:08:37.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 08:08:37.688 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429776,ok=429776,error=0, records=41
[WARN ] 2026-06-02 08:08:38.015 [19341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:08:39.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:08:52.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 08:08:52.694 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429777,ok=429777,error=0, records=41
[WARN ] 2026-06-02 08:08:53.021 [19559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:08:54.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:08:54.169 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 08:09:07.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 08:09:07.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429778,ok=429778,error=0, records=41
[WARN ] 2026-06-02 08:09:08.025 [19559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:09:09.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:09:22.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 08:09:22.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429779,ok=429779,error=0, records=41
[WARN ] 2026-06-02 08:09:23.031 [19587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:09:24.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:09:37.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:09:37.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429780,ok=429780,error=0, records=41
[WARN ] 2026-06-02 08:09:38.035 [19532] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:09:39.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:09:52.722 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 08:09:52.722 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429781,ok=429781,error=0, records=41
[WARN ] 2026-06-02 08:09:53.040 [19601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:09:54.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:10:01.875 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21504/300s
[INFO ] 2026-06-02 08:10:07.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 08:10:07.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429782,ok=429782,error=0, records=41
[WARN ] 2026-06-02 08:10:08.046 [19623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:10:09.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:10:22.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 08:10:22.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429783,ok=429783,error=0, records=41
[WARN ] 2026-06-02 08:10:23.051 [19639] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:10:24.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:10:28.552 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21495/300s
[WARN ] 2026-06-02 08:10:37.554 [19638] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:10:37.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 08:10:37.737 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429784,ok=429784,error=0, records=41
[INFO ] 2026-06-02 08:10:39.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:10:43.009 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21504/300s
[WARN ] 2026-06-02 08:10:52.559 [19660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:10:52.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 08:10:52.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429785,ok=429785,error=0, records=41
[INFO ] 2026-06-02 08:10:54.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:11:07.564 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:11:07.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 08:11:07.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429786,ok=429786,error=0, records=41
[INFO ] 2026-06-02 08:11:07.751 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21491/300s
[INFO ] 2026-06-02 08:11:09.150 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17903/300s
[INFO ] 2026-06-02 08:11:09.151 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849992},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:11:09.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-02 08:11:09.291 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:11:09.291 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:11:09.291 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:11:09.291 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:11:09.291 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:11:09.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:11:09.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:11:09.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:11:09.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:11:09.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:11:09.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:11:22.569 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:11:22.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 08:11:22.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429787,ok=429787,error=0, records=41
[INFO ] 2026-06-02 08:11:24.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:11:37.575 [19744] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:11:37.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 08:11:37.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429788,ok=429788,error=0, records=41
[INFO ] 2026-06-02 08:11:39.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:11:50.303 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21500/300s
[WARN ] 2026-06-02 08:11:52.580 [19744] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:11:52.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 08:11:52.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429789,ok=429789,error=0, records=41
[INFO ] 2026-06-02 08:11:54.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:11:58.484 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21491/300s
[WARN ] 2026-06-02 08:12:07.585 [19744] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:12:07.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 08:12:07.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429790,ok=429790,error=0, records=41
[INFO ] 2026-06-02 08:12:09.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:12:09.177 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21503/300s
[WARN ] 2026-06-02 08:12:22.591 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:12:22.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 08:12:22.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429791,ok=429791,error=0, records=41
[INFO ] 2026-06-02 08:12:24.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:12:37.596 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:12:37.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 08:12:37.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429792,ok=429792,error=0, records=41
[INFO ] 2026-06-02 08:12:39.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:12:52.601 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:12:52.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 08:12:52.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429793,ok=429793,error=0, records=41
[INFO ] 2026-06-02 08:12:54.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:12:57.232 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21501/300s
[INFO ] 2026-06-02 08:12:59.033 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21501/300s
[INFO ] 2026-06-02 08:13:05.938 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21501/300s
[WARN ] 2026-06-02 08:13:07.607 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:13:07.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 08:13:07.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429794,ok=429794,error=0, records=41
[INFO ] 2026-06-02 08:13:09.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:13:22.613 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:13:22.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 08:13:22.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429795,ok=429795,error=0, records=41
[INFO ] 2026-06-02 08:13:24.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:13:37.618 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:13:37.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 08:13:37.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429796,ok=429796,error=0, records=41
[INFO ] 2026-06-02 08:13:39.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 08:13:39.181 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 08:13:52.624 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:13:52.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 08:13:52.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429797,ok=429797,error=0, records=41
[INFO ] 2026-06-02 08:13:54.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:14:07.632 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:14:07.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 08:14:07.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429798,ok=429798,error=0, records=41
[INFO ] 2026-06-02 08:14:09.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:14:09.292 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849924},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:14:09.452 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:14:09.452 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:14:09.452 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:14:09.452 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:14:09.452 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:14:09.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:14:09.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:14:09.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:14:09.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:14:09.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:14:09.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:14:22.636 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:14:22.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 08:14:22.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429799,ok=429799,error=0, records=41
[INFO ] 2026-06-02 08:14:24.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:14:37.641 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:14:37.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-02 08:14:37.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429800,ok=429800,error=0, records=41
[INFO ] 2026-06-02 08:14:39.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:14:52.646 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:14:52.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 08:14:52.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429801,ok=429801,error=0, records=41
[INFO ] 2026-06-02 08:14:54.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:15:01.878 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21505/300s
[WARN ] 2026-06-02 08:15:07.650 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:15:07.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 08:15:07.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429802,ok=429802,error=0, records=41
[INFO ] 2026-06-02 08:15:09.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:15:22.655 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:15:22.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 08:15:22.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429803,ok=429803,error=0, records=41
[INFO ] 2026-06-02 08:15:24.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:15:28.657 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21496/300s
[WARN ] 2026-06-02 08:15:37.660 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:15:37.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 08:15:37.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429804,ok=429804,error=0, records=41
[INFO ] 2026-06-02 08:15:39.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:15:43.015 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21505/300s
[WARN ] 2026-06-02 08:15:52.664 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:15:52.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 08:15:52.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429805,ok=429805,error=0, records=41
[INFO ] 2026-06-02 08:15:54.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:16:07.669 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:16:07.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 08:16:07.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429806,ok=429806,error=0, records=41
[INFO ] 2026-06-02 08:16:07.886 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21492/300s
[INFO ] 2026-06-02 08:16:09.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:16:22.673 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:16:22.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 08:16:22.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429807,ok=429807,error=0, records=41
[INFO ] 2026-06-02 08:16:24.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:16:37.678 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:16:37.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 08:16:37.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429808,ok=429808,error=0, records=41
[INFO ] 2026-06-02 08:16:39.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:16:50.359 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21501/300s
[WARN ] 2026-06-02 08:16:52.683 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:16:52.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 08:16:52.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429809,ok=429809,error=0, records=41
[INFO ] 2026-06-02 08:16:54.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:16:58.666 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21492/300s
[WARN ] 2026-06-02 08:17:07.690 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:17:07.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 08:17:07.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429810,ok=429810,error=0, records=41
[INFO ] 2026-06-02 08:17:09.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:17:09.189 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21504/300s
[INFO ] 2026-06-02 08:17:09.452 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17904/300s
[INFO ] 2026-06-02 08:17:09.454 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849856},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:17:09.617 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:17:09.617 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:17:09.618 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:17:09.618 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:17:09.618 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:17:09.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:17:09.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:17:09.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:17:09.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:17:09.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:17:09.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:17:22.696 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:17:22.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 08:17:22.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429811,ok=429811,error=0, records=41
[INFO ] 2026-06-02 08:17:24.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:17:37.701 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:17:37.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 08:17:37.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429812,ok=429812,error=0, records=41
[INFO ] 2026-06-02 08:17:39.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:17:52.707 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:17:52.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 08:17:52.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429813,ok=429813,error=0, records=41
[INFO ] 2026-06-02 08:17:54.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:17:57.285 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21502/300s
[INFO ] 2026-06-02 08:17:59.086 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21502/300s
[INFO ] 2026-06-02 08:18:05.993 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21502/300s
[WARN ] 2026-06-02 08:18:07.711 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:18:07.950 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:18:07.950 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429814,ok=429814,error=0, records=41
[INFO ] 2026-06-02 08:18:09.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:18:22.717 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:18:22.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 08:18:22.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429815,ok=429815,error=0, records=41
[INFO ] 2026-06-02 08:18:24.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:18:37.723 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:18:37.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 08:18:37.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429816,ok=429816,error=0, records=41
[INFO ] 2026-06-02 08:18:39.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:18:52.729 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:18:52.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 08:18:52.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429817,ok=429817,error=0, records=41
[INFO ] 2026-06-02 08:18:54.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:19:07.733 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:19:07.974 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 08:19:07.974 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429818,ok=429818,error=0, records=41
[INFO ] 2026-06-02 08:19:09.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:19:22.739 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:19:22.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 08:19:22.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429819,ok=429819,error=0, records=41
[INFO ] 2026-06-02 08:19:24.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:19:37.745 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:19:37.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 08:19:37.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429820,ok=429820,error=0, records=41
[INFO ] 2026-06-02 08:19:39.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:19:52.750 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:19:52.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 08:19:52.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429821,ok=429821,error=0, records=41
[INFO ] 2026-06-02 08:19:54.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:20:01.881 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21506/300s
[WARN ] 2026-06-02 08:20:07.755 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:20:07.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 08:20:07.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429822,ok=429822,error=0, records=41
[INFO ] 2026-06-02 08:20:09.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:20:09.619 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849788},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:20:09.799 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:20:09.799 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 08:20:09.799 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:20:09.799 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:20:09.799 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:20:09.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:20:09.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:20:09.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:20:09.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:20:09.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:20:09.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:20:22.761 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:20:23.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:20:23.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429823,ok=429823,error=0, records=41
[INFO ] 2026-06-02 08:20:24.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:20:28.762 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21497/300s
[WARN ] 2026-06-02 08:20:37.765 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:20:38.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 08:20:38.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429824,ok=429824,error=0, records=41
[INFO ] 2026-06-02 08:20:39.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:20:43.022 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21506/300s
[WARN ] 2026-06-02 08:20:52.769 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:20:53.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 08:20:53.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429825,ok=429825,error=0, records=41
[INFO ] 2026-06-02 08:20:54.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:21:07.774 [19789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:21:08.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 08:21:08.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429826,ok=429826,error=0, records=41
[INFO ] 2026-06-02 08:21:08.022 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21493/300s
[INFO ] 2026-06-02 08:21:09.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:21:22.779 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:21:23.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 08:21:23.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429827,ok=429827,error=0, records=41
[INFO ] 2026-06-02 08:21:24.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:21:37.783 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:21:38.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 08:21:38.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429828,ok=429828,error=0, records=41
[INFO ] 2026-06-02 08:21:39.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:21:50.416 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21502/300s
[WARN ] 2026-06-02 08:21:52.788 [19809] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:21:53.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 08:21:53.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429829,ok=429829,error=0, records=41
[INFO ] 2026-06-02 08:21:54.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:21:58.849 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21493/300s
[WARN ] 2026-06-02 08:22:07.794 [19788] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:22:08.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 08:22:08.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429830,ok=429830,error=0, records=41
[INFO ] 2026-06-02 08:22:09.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:22:09.202 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21505/300s
[WARN ] 2026-06-02 08:22:22.799 [19804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:22:23.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 08:22:23.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429831,ok=429831,error=0, records=41
[INFO ] 2026-06-02 08:22:24.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:22:37.804 [19666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:22:38.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 08:22:38.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429832,ok=429832,error=0, records=41
[INFO ] 2026-06-02 08:22:39.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:22:52.809 [20351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:22:53.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 08:22:53.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429833,ok=429833,error=0, records=41
[INFO ] 2026-06-02 08:22:54.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:22:57.347 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21503/300s
[INFO ] 2026-06-02 08:22:59.148 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21503/300s
[INFO ] 2026-06-02 08:23:06.055 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21503/300s
[WARN ] 2026-06-02 08:23:07.814 [20365] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:23:08.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:23:08.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429834,ok=429834,error=0, records=41
[INFO ] 2026-06-02 08:23:09.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:23:09.799 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17905/300s
[INFO ] 2026-06-02 08:23:09.801 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:23:09.973 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:23:09.974 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 08:23:09.974 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:23:09.974 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:23:09.974 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:23:10.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:23:10.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:23:10.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:23:10.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:23:10.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:23:10.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:23:22.819 [20385] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:23:23.080 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 08:23:23.080 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429835,ok=429835,error=0, records=41
[INFO ] 2026-06-02 08:23:24.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:23:37.824 [20370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:23:38.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 08:23:38.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429836,ok=429836,error=0, records=41
[INFO ] 2026-06-02 08:23:39.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 08:23:39.206 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 08:23:52.829 [20385] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:23:53.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:23:53.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429837,ok=429837,error=0, records=41
[INFO ] 2026-06-02 08:23:54.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:23:54.207 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 08:24:07.835 [20413] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:24:08.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11276, records=50
[INFO ] 2026-06-02 08:24:08.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429838,ok=429838,error=0, records=50
[INFO ] 2026-06-02 08:24:09.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:24:22.840 [20427] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:24:23.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 08:24:23.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429839,ok=429839,error=0, records=41
[INFO ] 2026-06-02 08:24:24.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:24:37.846 [20450] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:24:38.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:24:38.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429840,ok=429840,error=0, records=41
[INFO ] 2026-06-02 08:24:39.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:24:52.851 [20385] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:24:53.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 08:24:53.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429841,ok=429841,error=0, records=41
[INFO ] 2026-06-02 08:24:54.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:25:01.885 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21507/300s
[WARN ] 2026-06-02 08:25:07.856 [20370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:25:08.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 08:25:08.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429842,ok=429842,error=0, records=41
[INFO ] 2026-06-02 08:25:09.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:25:22.861 [20385] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:25:23.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 08:25:23.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429843,ok=429843,error=0, records=41
[INFO ] 2026-06-02 08:25:24.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:25:28.863 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21498/300s
[WARN ] 2026-06-02 08:25:37.865 [20506] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:25:38.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 08:25:38.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429844,ok=429844,error=0, records=41
[INFO ] 2026-06-02 08:25:39.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:25:43.028 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21507/300s
[WARN ] 2026-06-02 08:25:52.870 [20506] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:25:53.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 08:25:53.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429845,ok=429845,error=0, records=41
[INFO ] 2026-06-02 08:25:54.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:26:07.875 [20535] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:26:08.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 08:26:08.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429846,ok=429846,error=0, records=41
[INFO ] 2026-06-02 08:26:08.249 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21494/300s
[INFO ] 2026-06-02 08:26:09.213 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:26:09.976 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849656},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:26:10.138 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:26:10.138 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:26:10.138 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:26:10.138 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:26:10.138 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:26:10.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:26:10.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:26:10.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:26:10.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:26:10.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:26:10.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:26:22.882 [20535] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:26:23.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 08:26:23.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429847,ok=429847,error=0, records=41
[INFO ] 2026-06-02 08:26:24.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:26:37.888 [20492] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:26:38.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 08:26:38.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429848,ok=429848,error=0, records=41
[INFO ] 2026-06-02 08:26:39.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:26:50.472 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21503/300s
[WARN ] 2026-06-02 08:26:52.893 [20583] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:26:53.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 08:26:53.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429849,ok=429849,error=0, records=41
[INFO ] 2026-06-02 08:26:54.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:26:59.028 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21494/300s
[WARN ] 2026-06-02 08:27:07.900 [20605] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:27:08.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 08:27:08.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429850,ok=429850,error=0, records=41
[INFO ] 2026-06-02 08:27:09.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:27:09.216 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21506/300s
[WARN ] 2026-06-02 08:27:22.905 [20566] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:27:23.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 08:27:23.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429851,ok=429851,error=0, records=41
[INFO ] 2026-06-02 08:27:24.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:27:37.911 [20616] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:27:38.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 08:27:38.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429852,ok=429852,error=0, records=41
[INFO ] 2026-06-02 08:27:39.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:27:52.916 [20654] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:27:53.290 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 08:27:53.290 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429853,ok=429853,error=0, records=41
[INFO ] 2026-06-02 08:27:54.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:27:57.419 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21504/300s
[INFO ] 2026-06-02 08:27:59.220 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21504/300s
[INFO ] 2026-06-02 08:28:06.126 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21504/300s
[WARN ] 2026-06-02 08:28:07.921 [20643] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:28:08.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 08:28:08.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429854,ok=429854,error=0, records=41
[INFO ] 2026-06-02 08:28:09.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:28:22.926 [20680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:28:23.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 08:28:23.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429855,ok=429855,error=0, records=41
[INFO ] 2026-06-02 08:28:24.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:28:37.932 [20703] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:28:38.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 08:28:38.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429856,ok=429856,error=0, records=41
[INFO ] 2026-06-02 08:28:39.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:28:52.939 [20714] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:28:53.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 08:28:53.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429857,ok=429857,error=0, records=41
[INFO ] 2026-06-02 08:28:54.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:29:07.946 [20734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:29:08.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10381, records=41
[INFO ] 2026-06-02 08:29:08.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429858,ok=429858,error=0, records=41
[INFO ] 2026-06-02 08:29:09.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:29:10.138 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17906/300s
[INFO ] 2026-06-02 08:29:10.140 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:29:10.300 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:29:10.301 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 08:29:10.301 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:29:10.301 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:29:10.301 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:29:10.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:29:10.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:29:10.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:29:10.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:29:10.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:29:10.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:29:22.952 [20734] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:29:23.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 08:29:23.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429859,ok=429859,error=0, records=41
[INFO ] 2026-06-02 08:29:24.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:29:37.957 [20680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:29:38.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 08:29:38.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429860,ok=429860,error=0, records=41
[INFO ] 2026-06-02 08:29:39.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:29:52.962 [20681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:29:53.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 08:29:53.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429861,ok=429861,error=0, records=41
[INFO ] 2026-06-02 08:29:54.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:30:01.888 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21508/300s
[WARN ] 2026-06-02 08:30:07.965 [20680] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:30:08.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 08:30:08.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429862,ok=429862,error=0, records=41
[INFO ] 2026-06-02 08:30:09.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:30:22.970 [20741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:30:23.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 08:30:23.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429863,ok=429863,error=0, records=41
[INFO ] 2026-06-02 08:30:24.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:30:28.971 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21499/300s
[WARN ] 2026-06-02 08:30:37.974 [20681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:30:38.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 08:30:38.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429864,ok=429864,error=0, records=41
[INFO ] 2026-06-02 08:30:39.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:30:43.035 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21508/300s
[WARN ] 2026-06-02 08:30:52.978 [20746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:30:53.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 08:30:53.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429865,ok=429865,error=0, records=41
[INFO ] 2026-06-02 08:30:54.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:31:07.983 [20847] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:31:08.464 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:31:08.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429866,ok=429866,error=0, records=41
[INFO ] 2026-06-02 08:31:08.464 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21495/300s
[INFO ] 2026-06-02 08:31:09.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:31:22.988 [20741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:31:23.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 08:31:23.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429867,ok=429867,error=0, records=41
[INFO ] 2026-06-02 08:31:24.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:31:37.995 [20746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:31:38.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 08:31:38.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429868,ok=429868,error=0, records=41
[INFO ] 2026-06-02 08:31:39.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:31:50.526 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21504/300s
[WARN ] 2026-06-02 08:31:53.001 [20874] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:31:53.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 08:31:53.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429869,ok=429869,error=0, records=41
[INFO ] 2026-06-02 08:31:54.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:31:59.212 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21495/300s
[WARN ] 2026-06-02 08:32:08.006 [20746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:32:08.487 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 08:32:08.487 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429870,ok=429870,error=0, records=41
[INFO ] 2026-06-02 08:32:09.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:32:09.229 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21507/300s
[INFO ] 2026-06-02 08:32:10.302 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849524},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:32:10.468 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:32:10.468 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 08:32:10.468 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:32:10.468 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:32:10.468 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:32:10.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:32:10.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:32:10.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:32:10.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:32:10.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:32:10.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:32:23.012 [20901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:32:23.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 08:32:23.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429871,ok=429871,error=0, records=41
[INFO ] 2026-06-02 08:32:24.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:32:38.017 [20746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:32:38.499 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 08:32:38.499 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429872,ok=429872,error=0, records=41
[INFO ] 2026-06-02 08:32:39.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:32:53.021 [20833] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:32:53.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 08:32:53.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429873,ok=429873,error=0, records=41
[INFO ] 2026-06-02 08:32:54.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:32:57.469 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21505/300s
[INFO ] 2026-06-02 08:32:59.271 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21505/300s
[INFO ] 2026-06-02 08:33:06.174 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21505/300s
[WARN ] 2026-06-02 08:33:08.027 [20948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:33:08.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 08:33:08.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429874,ok=429874,error=0, records=41
[INFO ] 2026-06-02 08:33:09.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:33:23.032 [20948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:33:23.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 08:33:23.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429875,ok=429875,error=0, records=41
[INFO ] 2026-06-02 08:33:24.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:33:38.037 [20992] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:33:38.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-02 08:33:38.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429876,ok=429876,error=0, records=41
[INFO ] 2026-06-02 08:33:39.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 08:33:39.232 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 08:33:53.044 [20986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:33:53.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 08:33:53.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429877,ok=429877,error=0, records=41
[INFO ] 2026-06-02 08:33:54.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:34:08.049 [21007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:34:08.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-02 08:34:08.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429878,ok=429878,error=0, records=41
[INFO ] 2026-06-02 08:34:09.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:34:22.554 [21030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:34:23.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 08:34:23.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429879,ok=429879,error=0, records=41
[INFO ] 2026-06-02 08:34:24.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:34:37.559 [21057] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:34:38.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10938, records=42
[INFO ] 2026-06-02 08:34:38.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429880,ok=429880,error=0, records=42
[INFO ] 2026-06-02 08:34:39.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:34:52.564 [21077] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:34:53.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 08:34:53.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429881,ok=429881,error=0, records=41
[INFO ] 2026-06-02 08:34:54.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:35:01.892 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21509/300s
[WARN ] 2026-06-02 08:35:07.568 [21076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:35:08.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 08:35:08.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429882,ok=429882,error=0, records=41
[INFO ] 2026-06-02 08:35:09.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:35:10.469 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17907/300s
[INFO ] 2026-06-02 08:35:10.470 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849452},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:35:10.629 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:35:10.629 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 08:35:10.629 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:35:10.629 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:35:10.629 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:35:10.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:35:10.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:35:10.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:35:10.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:35:10.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:35:10.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:35:22.574 [21093] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:35:23.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 08:35:23.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429883,ok=429883,error=0, records=41
[INFO ] 2026-06-02 08:35:24.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:35:29.075 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21500/300s
[WARN ] 2026-06-02 08:35:37.579 [21135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:35:38.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 08:35:38.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429884,ok=429884,error=0, records=41
[INFO ] 2026-06-02 08:35:39.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:35:43.041 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21509/300s
[WARN ] 2026-06-02 08:35:52.583 [21119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:35:53.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 08:35:53.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429885,ok=429885,error=0, records=41
[INFO ] 2026-06-02 08:35:54.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:36:07.588 [21167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:36:08.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-02 08:36:08.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429886,ok=429886,error=0, records=41
[INFO ] 2026-06-02 08:36:08.613 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21496/300s
[INFO ] 2026-06-02 08:36:09.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:36:22.593 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:36:23.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 08:36:23.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429887,ok=429887,error=0, records=41
[INFO ] 2026-06-02 08:36:24.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:36:37.598 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:36:38.624 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 08:36:38.624 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429888,ok=429888,error=0, records=41
[INFO ] 2026-06-02 08:36:39.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:36:50.579 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21505/300s
[WARN ] 2026-06-02 08:36:52.603 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:36:53.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 08:36:53.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429889,ok=429889,error=0, records=41
[INFO ] 2026-06-02 08:36:54.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:36:59.395 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21496/300s
[WARN ] 2026-06-02 08:37:07.608 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:37:08.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 08:37:08.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429890,ok=429890,error=0, records=41
[INFO ] 2026-06-02 08:37:09.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:37:09.241 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21508/300s
[WARN ] 2026-06-02 08:37:22.614 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:37:23.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:37:23.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429891,ok=429891,error=0, records=41
[INFO ] 2026-06-02 08:37:24.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:37:37.620 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:37:38.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 08:37:38.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429892,ok=429892,error=0, records=41
[INFO ] 2026-06-02 08:37:39.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:37:52.625 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:37:53.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 08:37:53.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429893,ok=429893,error=0, records=41
[INFO ] 2026-06-02 08:37:54.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:37:57.530 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21506/300s
[INFO ] 2026-06-02 08:37:59.331 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21506/300s
[INFO ] 2026-06-02 08:38:06.237 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21506/300s
[WARN ] 2026-06-02 08:38:07.630 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:38:08.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 08:38:08.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429894,ok=429894,error=0, records=41
[INFO ] 2026-06-02 08:38:09.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:38:10.630 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849384},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:38:10.801 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:38:10.801 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 08:38:10.802 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:38:10.802 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:38:10.802 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:38:10.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:38:10.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:38:10.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:38:10.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:38:10.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:38:10.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:38:22.635 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:38:23.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 08:38:23.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429895,ok=429895,error=0, records=41
[INFO ] 2026-06-02 08:38:24.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:38:37.640 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:38:38.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 08:38:38.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429896,ok=429896,error=0, records=41
[INFO ] 2026-06-02 08:38:39.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:38:52.645 [21188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:38:53.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:38:53.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429897,ok=429897,error=0, records=41
[INFO ] 2026-06-02 08:38:54.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:38:54.246 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 08:39:07.651 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:39:08.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 08:39:08.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429898,ok=429898,error=0, records=41
[INFO ] 2026-06-02 08:39:09.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:39:22.657 [21188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:39:23.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 08:39:23.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429899,ok=429899,error=0, records=41
[INFO ] 2026-06-02 08:39:24.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:39:37.661 [21188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:39:38.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 08:39:38.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429900,ok=429900,error=0, records=41
[INFO ] 2026-06-02 08:39:39.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:39:52.666 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:39:53.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 08:39:53.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429901,ok=429901,error=0, records=41
[INFO ] 2026-06-02 08:39:54.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:40:01.895 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21510/300s
[WARN ] 2026-06-02 08:40:07.671 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:40:08.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 08:40:08.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429902,ok=429902,error=0, records=41
[INFO ] 2026-06-02 08:40:09.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:40:22.676 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:40:23.707 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 08:40:23.707 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429903,ok=429903,error=0, records=41
[INFO ] 2026-06-02 08:40:24.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:40:29.178 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21501/300s
[WARN ] 2026-06-02 08:40:37.682 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:40:38.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 08:40:38.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429904,ok=429904,error=0, records=41
[INFO ] 2026-06-02 08:40:39.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:40:43.048 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21510/300s
[WARN ] 2026-06-02 08:40:52.688 [21188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:40:53.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 08:40:53.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429905,ok=429905,error=0, records=41
[INFO ] 2026-06-02 08:40:54.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:41:07.694 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:41:08.795 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 08:41:08.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429906,ok=429906,error=0, records=41
[INFO ] 2026-06-02 08:41:08.796 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21497/300s
[INFO ] 2026-06-02 08:41:09.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:41:10.802 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17908/300s
[INFO ] 2026-06-02 08:41:10.803 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849320},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:41:10.978 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:41:10.978 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 08:41:10.979 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:41:10.979 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:41:10.979 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:41:11.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:41:11.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:41:11.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:41:11.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:41:11.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:41:11.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:41:22.699 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:41:23.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 08:41:23.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429907,ok=429907,error=0, records=41
[INFO ] 2026-06-02 08:41:24.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:41:37.705 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:41:38.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 08:41:38.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429908,ok=429908,error=0, records=41
[INFO ] 2026-06-02 08:41:39.253 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:41:50.633 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21506/300s
[WARN ] 2026-06-02 08:41:52.710 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:41:53.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 08:41:53.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429909,ok=429909,error=0, records=41
[INFO ] 2026-06-02 08:41:54.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:41:59.569 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21497/300s
[WARN ] 2026-06-02 08:42:07.716 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:42:08.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 08:42:08.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429910,ok=429910,error=0, records=41
[INFO ] 2026-06-02 08:42:09.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:42:09.254 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21509/300s
[WARN ] 2026-06-02 08:42:22.721 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:42:23.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 08:42:23.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429911,ok=429911,error=0, records=41
[INFO ] 2026-06-02 08:42:24.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:42:37.725 [21188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:42:38.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-02 08:42:38.831 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429912,ok=429912,error=0, records=41
[INFO ] 2026-06-02 08:42:39.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:42:52.730 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:42:53.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 08:42:53.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429913,ok=429913,error=0, records=41
[INFO ] 2026-06-02 08:42:54.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:42:57.566 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21507/300s
[INFO ] 2026-06-02 08:42:59.367 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21507/300s
[INFO ] 2026-06-02 08:43:06.273 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21507/300s
[WARN ] 2026-06-02 08:43:07.735 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:43:08.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 08:43:08.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429914,ok=429914,error=0, records=41
[INFO ] 2026-06-02 08:43:09.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:43:22.740 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:43:23.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 08:43:23.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429915,ok=429915,error=0, records=41
[INFO ] 2026-06-02 08:43:24.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:43:37.745 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:43:38.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:43:38.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429916,ok=429916,error=0, records=41
[INFO ] 2026-06-02 08:43:39.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 08:43:39.258 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 08:43:52.752 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:43:53.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 08:43:53.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429917,ok=429917,error=0, records=41
[INFO ] 2026-06-02 08:43:54.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:44:07.757 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:44:08.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 08:44:08.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429918,ok=429918,error=0, records=41
[INFO ] 2026-06-02 08:44:09.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:44:10.980 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:44:11.159 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:44:11.159 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:44:11.160 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:44:11.160 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:44:11.160 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:44:11.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:44:11.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:44:11.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:44:11.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:44:11.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:44:11.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:44:22.761 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:44:23.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 08:44:23.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429919,ok=429919,error=0, records=41
[INFO ] 2026-06-02 08:44:24.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:44:37.766 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:44:38.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 08:44:38.888 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429920,ok=429920,error=0, records=41
[INFO ] 2026-06-02 08:44:39.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:44:52.772 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:44:53.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 08:44:53.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429921,ok=429921,error=0, records=41
[INFO ] 2026-06-02 08:44:54.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:45:01.899 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21511/300s
[WARN ] 2026-06-02 08:45:07.777 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:45:08.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 08:45:08.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429922,ok=429922,error=0, records=41
[INFO ] 2026-06-02 08:45:09.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:45:22.783 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:45:23.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 08:45:23.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429923,ok=429923,error=0, records=41
[INFO ] 2026-06-02 08:45:24.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:45:29.285 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21502/300s
[WARN ] 2026-06-02 08:45:37.789 [21198] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:45:38.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 08:45:38.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429924,ok=429924,error=0, records=41
[INFO ] 2026-06-02 08:45:39.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:45:43.054 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21511/300s
[WARN ] 2026-06-02 08:45:52.793 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:45:53.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 08:45:53.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429925,ok=429925,error=0, records=41
[INFO ] 2026-06-02 08:45:54.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:46:07.798 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:46:08.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:46:08.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429926,ok=429926,error=0, records=41
[INFO ] 2026-06-02 08:46:08.925 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21498/300s
[INFO ] 2026-06-02 08:46:09.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:46:22.803 [21203] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:46:23.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:46:23.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429927,ok=429927,error=0, records=41
[INFO ] 2026-06-02 08:46:24.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:46:37.808 [21183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:46:38.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 08:46:38.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429928,ok=429928,error=0, records=41
[INFO ] 2026-06-02 08:46:39.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:46:50.686 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21507/300s
[WARN ] 2026-06-02 08:46:52.813 [21170] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:46:53.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 08:46:53.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429929,ok=429929,error=0, records=41
[INFO ] 2026-06-02 08:46:54.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:46:59.750 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21498/300s
[WARN ] 2026-06-02 08:47:07.818 [21750] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:47:08.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 08:47:08.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429930,ok=429930,error=0, records=41
[INFO ] 2026-06-02 08:47:09.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:47:09.266 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21510/300s
[INFO ] 2026-06-02 08:47:11.160 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17909/300s
[INFO ] 2026-06-02 08:47:11.161 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849188},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:47:11.320 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:47:11.321 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 08:47:11.321 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:47:11.321 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:47:11.321 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:47:11.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:47:11.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:47:11.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:47:11.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:47:11.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:47:11.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:47:22.823 [21780] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:47:23.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 08:47:23.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429931,ok=429931,error=0, records=41
[INFO ] 2026-06-02 08:47:24.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:47:37.827 [21188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:47:38.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 08:47:38.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429932,ok=429932,error=0, records=41
[INFO ] 2026-06-02 08:47:39.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:47:52.833 [21808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:47:53.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 08:47:53.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429933,ok=429933,error=0, records=41
[INFO ] 2026-06-02 08:47:54.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:47:57.616 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21508/300s
[INFO ] 2026-06-02 08:47:59.418 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21508/300s
[INFO ] 2026-06-02 08:48:06.325 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21508/300s
[WARN ] 2026-06-02 08:48:07.838 [21822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:48:08.971 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 08:48:08.971 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429934,ok=429934,error=0, records=41
[INFO ] 2026-06-02 08:48:09.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:48:22.843 [21831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:48:23.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 08:48:23.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429935,ok=429935,error=0, records=41
[INFO ] 2026-06-02 08:48:24.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:48:37.849 [21808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:48:38.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 08:48:38.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429936,ok=429936,error=0, records=41
[INFO ] 2026-06-02 08:48:39.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:48:52.853 [21808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:48:54.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 08:48:54.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429937,ok=429937,error=0, records=41
[INFO ] 2026-06-02 08:48:54.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:49:07.859 [21808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:49:09.026 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 08:49:09.026 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429938,ok=429938,error=0, records=41
[INFO ] 2026-06-02 08:49:09.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:49:22.864 [21845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:49:24.031 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 08:49:24.031 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429939,ok=429939,error=0, records=41
[INFO ] 2026-06-02 08:49:24.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:49:37.869 [21859] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:49:39.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 08:49:39.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429940,ok=429940,error=0, records=41
[INFO ] 2026-06-02 08:49:39.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:49:52.873 [21873] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:49:54.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 08:49:54.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429941,ok=429941,error=0, records=41
[INFO ] 2026-06-02 08:49:54.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:50:01.902 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21512/300s
[WARN ] 2026-06-02 08:50:07.877 [21937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:50:09.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10387, records=41
[INFO ] 2026-06-02 08:50:09.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429942,ok=429942,error=0, records=41
[INFO ] 2026-06-02 08:50:09.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:50:11.323 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:50:11.505 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:50:11.506 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 08:50:11.506 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:50:11.506 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:50:11.506 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:50:11.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:50:11.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:50:11.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:50:11.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:50:11.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:50:11.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:50:22.882 [21949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:50:24.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 08:50:24.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429943,ok=429943,error=0, records=41
[INFO ] 2026-06-02 08:50:24.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:50:29.384 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21503/300s
[WARN ] 2026-06-02 08:50:37.888 [21976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:50:39.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 08:50:39.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429944,ok=429944,error=0, records=41
[INFO ] 2026-06-02 08:50:39.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:50:43.060 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21512/300s
[WARN ] 2026-06-02 08:50:52.893 [21982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:50:54.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-02 08:50:54.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429945,ok=429945,error=0, records=41
[INFO ] 2026-06-02 08:50:54.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:51:07.897 [22004] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:51:09.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 08:51:09.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429946,ok=429946,error=0, records=41
[INFO ] 2026-06-02 08:51:09.073 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21499/300s
[INFO ] 2026-06-02 08:51:09.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:51:22.903 [22004] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:51:24.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 08:51:24.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429947,ok=429947,error=0, records=41
[INFO ] 2026-06-02 08:51:24.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:51:37.909 [22037] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:51:39.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 08:51:39.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429948,ok=429948,error=0, records=41
[INFO ] 2026-06-02 08:51:39.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:51:50.742 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21508/300s
[WARN ] 2026-06-02 08:51:52.914 [22025] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:51:54.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 08:51:54.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429949,ok=429949,error=0, records=41
[INFO ] 2026-06-02 08:51:54.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:51:59.931 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21499/300s
[WARN ] 2026-06-02 08:52:07.919 [22065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:52:09.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 08:52:09.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429950,ok=429950,error=0, records=41
[INFO ] 2026-06-02 08:52:09.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:52:09.278 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21511/300s
[WARN ] 2026-06-02 08:52:22.925 [22071] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:52:24.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 08:52:24.106 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429951,ok=429951,error=0, records=41
[INFO ] 2026-06-02 08:52:24.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:52:37.930 [22107] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:52:39.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 08:52:39.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429952,ok=429952,error=0, records=41
[INFO ] 2026-06-02 08:52:39.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:52:52.935 [22096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:52:54.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-02 08:52:54.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429953,ok=429953,error=0, records=41
[INFO ] 2026-06-02 08:52:54.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:52:57.676 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21509/300s
[INFO ] 2026-06-02 08:52:59.477 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21509/300s
[INFO ] 2026-06-02 08:53:06.384 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21509/300s
[WARN ] 2026-06-02 08:53:07.941 [22096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:53:09.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:53:09.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429954,ok=429954,error=0, records=41
[INFO ] 2026-06-02 08:53:09.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:53:11.506 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17910/300s
[INFO ] 2026-06-02 08:53:11.508 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20849044},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:53:11.683 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:53:11.683 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 08:53:11.683 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:53:11.684 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:53:11.684 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:53:11.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:53:11.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:53:11.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:53:11.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:53:11.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:53:11.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:53:22.946 [22124] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:53:24.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 08:53:24.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429955,ok=429955,error=0, records=41
[INFO ] 2026-06-02 08:53:24.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:53:37.952 [22135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:53:39.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 08:53:39.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429956,ok=429956,error=0, records=41
[INFO ] 2026-06-02 08:53:39.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 08:53:39.282 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 08:53:52.957 [22135] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:53:54.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 08:53:54.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429957,ok=429957,error=0, records=41
[INFO ] 2026-06-02 08:53:54.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:53:54.283 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 08:54:07.962 [22182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:54:09.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 08:54:09.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429958,ok=429958,error=0, records=41
[INFO ] 2026-06-02 08:54:09.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:54:22.967 [22152] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:54:24.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 08:54:24.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429959,ok=429959,error=0, records=41
[INFO ] 2026-06-02 08:54:24.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:54:37.971 [22224] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:54:39.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 08:54:39.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429960,ok=429960,error=0, records=41
[INFO ] 2026-06-02 08:54:39.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:54:52.975 [22196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:54:54.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:54:54.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429961,ok=429961,error=0, records=41
[INFO ] 2026-06-02 08:54:54.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:55:01.906 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21513/300s
[WARN ] 2026-06-02 08:55:07.979 [22252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:55:09.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 08:55:09.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429962,ok=429962,error=0, records=41
[INFO ] 2026-06-02 08:55:09.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:55:22.985 [22224] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:55:24.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 08:55:24.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429963,ok=429963,error=0, records=41
[INFO ] 2026-06-02 08:55:24.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:55:29.487 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21504/300s
[WARN ] 2026-06-02 08:55:37.990 [22224] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:55:39.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 08:55:39.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429964,ok=429964,error=0, records=41
[INFO ] 2026-06-02 08:55:39.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:55:43.066 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21513/300s
[WARN ] 2026-06-02 08:55:52.996 [22252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:55:54.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 08:55:54.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429965,ok=429965,error=0, records=41
[INFO ] 2026-06-02 08:55:54.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:56:08.000 [22306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:56:09.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 08:56:09.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429966,ok=429966,error=0, records=41
[INFO ] 2026-06-02 08:56:09.220 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21500/300s
[INFO ] 2026-06-02 08:56:09.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:56:11.685 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848984},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:56:11.850 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:56:11.850 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 08:56:11.850 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:56:11.850 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:56:11.850 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:56:11.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:56:11.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:56:11.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:56:11.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:56:11.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:56:11.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:56:23.006 [22279] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:56:24.226 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 08:56:24.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429967,ok=429967,error=0, records=41
[INFO ] 2026-06-02 08:56:24.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:56:38.010 [22335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:56:39.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 08:56:39.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429968,ok=429968,error=0, records=41
[INFO ] 2026-06-02 08:56:39.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:56:50.797 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21509/300s
[WARN ] 2026-06-02 08:56:53.015 [22306] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:56:54.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 08:56:54.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429969,ok=429969,error=0, records=41
[INFO ] 2026-06-02 08:56:54.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:57:00.116 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21500/300s
[WARN ] 2026-06-02 08:57:08.020 [22335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:57:09.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 08:57:09.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429970,ok=429970,error=0, records=41
[INFO ] 2026-06-02 08:57:09.292 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:57:09.292 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21512/300s
[WARN ] 2026-06-02 08:57:23.024 [22377] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:57:24.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 08:57:24.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429971,ok=429971,error=0, records=41
[INFO ] 2026-06-02 08:57:24.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:57:38.029 [22391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:57:39.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 08:57:39.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429972,ok=429972,error=0, records=41
[INFO ] 2026-06-02 08:57:39.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:57:53.033 [22405] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:57:54.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 08:57:54.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429973,ok=429973,error=0, records=41
[INFO ] 2026-06-02 08:57:54.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:57:57.742 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21510/300s
[INFO ] 2026-06-02 08:57:59.543 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21510/300s
[INFO ] 2026-06-02 08:58:06.450 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21510/300s
[WARN ] 2026-06-02 08:58:08.041 [22363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:58:09.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 08:58:09.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429974,ok=429974,error=0, records=41
[INFO ] 2026-06-02 08:58:09.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:58:23.047 [22363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:58:24.271 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 08:58:24.271 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429975,ok=429975,error=0, records=41
[INFO ] 2026-06-02 08:58:24.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:58:38.052 [22442] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:58:39.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 08:58:39.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429976,ok=429976,error=0, records=41
[INFO ] 2026-06-02 08:58:39.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:58:52.557 [22469] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:58:54.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 08:58:54.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429977,ok=429977,error=0, records=41
[INFO ] 2026-06-02 08:58:54.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:59:07.562 [22493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:59:09.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 08:59:09.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429978,ok=429978,error=0, records=41
[INFO ] 2026-06-02 08:59:09.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:59:11.850 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17911/300s
[INFO ] 2026-06-02 08:59:11.852 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848916},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 08:59:12.040 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 08:59:12.040 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 08:59:12.040 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 08:59:12.040 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 08:59:12.040 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 08:59:12.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:59:12.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 08:59:12.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:59:12.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 08:59:12.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 08:59:12.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 08:59:22.568 [22507] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:59:24.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 08:59:24.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429979,ok=429979,error=0, records=41
[INFO ] 2026-06-02 08:59:24.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 08:59:37.574 [22493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:59:39.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=15[>=300 0/4]
[INFO ] 2026-06-02 08:59:39.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 08:59:39.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429980,ok=429980,error=0, records=41
[WARN ] 2026-06-02 08:59:52.578 [22501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 08:59:54.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 08:59:54.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 08:59:54.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429981,ok=429981,error=0, records=41
[INFO ] 2026-06-02 09:00:01.910 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21514/300s
[WARN ] 2026-06-02 09:00:07.584 [22561] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:00:09.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:00:09.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:00:09.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429982,ok=429982,error=0, records=41
[WARN ] 2026-06-02 09:00:22.589 [22568] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:00:24.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:00:24.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 09:00:24.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429983,ok=429983,error=0, records=41
[INFO ] 2026-06-02 09:00:29.591 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21505/300s
[WARN ] 2026-06-02 09:00:37.594 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:00:39.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:00:39.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 09:00:39.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429984,ok=429984,error=0, records=41
[INFO ] 2026-06-02 09:00:43.072 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21514/300s
[WARN ] 2026-06-02 09:00:52.599 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:00:54.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:00:54.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 09:00:54.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429985,ok=429985,error=0, records=41
[WARN ] 2026-06-02 09:01:07.603 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:01:09.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:01:09.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 09:01:09.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429986,ok=429986,error=0, records=41
[INFO ] 2026-06-02 09:01:09.335 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21501/300s
[WARN ] 2026-06-02 09:01:22.609 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:01:24.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:01:24.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 09:01:24.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429987,ok=429987,error=0, records=41
[WARN ] 2026-06-02 09:01:37.614 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:01:39.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:01:39.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 09:01:39.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429988,ok=429988,error=0, records=41
[INFO ] 2026-06-02 09:01:50.856 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21510/300s
[WARN ] 2026-06-02 09:01:52.620 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:01:54.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:01:54.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 09:01:54.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429989,ok=429989,error=0, records=41
[INFO ] 2026-06-02 09:02:00.299 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21501/300s
[WARN ] 2026-06-02 09:02:07.625 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:02:09.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:02:09.305 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21513/300s
[INFO ] 2026-06-02 09:02:09.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:02:09.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429990,ok=429990,error=0, records=41
[INFO ] 2026-06-02 09:02:12.041 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:02:12.215 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:02:12.215 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 09:02:12.216 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:02:12.216 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:02:12.216 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:02:12.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:02:12.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:02:12.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:02:12.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:02:12.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:02:12.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:02:22.631 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:02:24.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:02:24.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 09:02:24.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429991,ok=429991,error=0, records=41
[WARN ] 2026-06-02 09:02:37.637 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:02:39.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:02:39.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 09:02:39.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429992,ok=429992,error=0, records=41
[WARN ] 2026-06-02 09:02:52.642 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:02:54.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:02:54.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 09:02:54.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429993,ok=429993,error=0, records=41
[INFO ] 2026-06-02 09:02:57.803 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21511/300s
[INFO ] 2026-06-02 09:02:59.604 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21511/300s
[INFO ] 2026-06-02 09:03:06.512 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21511/300s
[WARN ] 2026-06-02 09:03:07.647 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:03:09.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:03:09.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 09:03:09.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429994,ok=429994,error=0, records=41
[WARN ] 2026-06-02 09:03:22.652 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:03:24.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:03:24.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 09:03:24.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429995,ok=429995,error=0, records=41
[WARN ] 2026-06-02 09:03:37.657 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:03:39.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 09:03:39.309 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 09:03:39.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 09:03:39.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429996,ok=429996,error=0, records=41
[WARN ] 2026-06-02 09:03:52.663 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:03:54.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:03:54.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10144, records=41
[INFO ] 2026-06-02 09:03:54.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429997,ok=429997,error=0, records=41
[WARN ] 2026-06-02 09:04:07.668 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:04:09.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:04:09.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 09:04:09.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429998,ok=429998,error=0, records=41
[WARN ] 2026-06-02 09:04:22.672 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:04:24.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:04:24.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:04:24.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=429999,ok=429999,error=0, records=41
[WARN ] 2026-06-02 09:04:37.678 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:04:39.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:04:39.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 09:04:39.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430000,ok=430000,error=0, records=41
[WARN ] 2026-06-02 09:04:52.683 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:04:54.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:04:54.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 09:04:54.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430001,ok=430001,error=0, records=41
[INFO ] 2026-06-02 09:05:01.914 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21515/300s
[WARN ] 2026-06-02 09:05:07.687 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:05:09.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:05:09.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 09:05:09.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430002,ok=430002,error=0, records=41
[INFO ] 2026-06-02 09:05:12.216 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17912/300s
[INFO ] 2026-06-02 09:05:12.217 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:05:12.392 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:05:12.392 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 09:05:12.392 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:05:12.392 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:05:12.392 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:05:12.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:05:12.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:05:12.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:05:12.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:05:12.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:05:12.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:05:22.691 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:05:24.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:05:24.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 09:05:24.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430003,ok=430003,error=0, records=41
[INFO ] 2026-06-02 09:05:29.694 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21506/300s
[WARN ] 2026-06-02 09:05:37.696 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:05:39.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:05:39.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 09:05:39.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430004,ok=430004,error=0, records=41
[INFO ] 2026-06-02 09:05:43.079 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21515/300s
[WARN ] 2026-06-02 09:05:52.701 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:05:54.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:05:54.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 09:05:54.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430005,ok=430005,error=0, records=41
[WARN ] 2026-06-02 09:06:07.706 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:06:09.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:06:09.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 09:06:09.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430006,ok=430006,error=0, records=41
[INFO ] 2026-06-02 09:06:09.456 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21502/300s
[WARN ] 2026-06-02 09:06:22.711 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:06:24.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:06:24.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10310, records=41
[INFO ] 2026-06-02 09:06:24.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430007,ok=430007,error=0, records=41
[WARN ] 2026-06-02 09:06:37.716 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:06:39.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:06:39.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 09:06:39.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430008,ok=430008,error=0, records=41
[INFO ] 2026-06-02 09:06:50.912 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21511/300s
[WARN ] 2026-06-02 09:06:52.722 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:06:54.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:06:54.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 09:06:54.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430009,ok=430009,error=0, records=41
[INFO ] 2026-06-02 09:07:00.483 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21502/300s
[WARN ] 2026-06-02 09:07:07.726 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:07:09.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:07:09.317 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21514/300s
[INFO ] 2026-06-02 09:07:09.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 09:07:09.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430010,ok=430010,error=0, records=41
[WARN ] 2026-06-02 09:07:22.731 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:07:24.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:07:24.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 09:07:24.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430011,ok=430011,error=0, records=41
[WARN ] 2026-06-02 09:07:37.736 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:07:39.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:07:39.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 09:07:39.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430012,ok=430012,error=0, records=41
[WARN ] 2026-06-02 09:07:52.741 [22602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:07:54.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:07:54.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 09:07:54.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430013,ok=430013,error=0, records=41
[INFO ] 2026-06-02 09:07:57.868 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21512/300s
[INFO ] 2026-06-02 09:07:59.670 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21512/300s
[INFO ] 2026-06-02 09:08:06.576 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21512/300s
[WARN ] 2026-06-02 09:08:07.746 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:08:09.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:08:09.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 09:08:09.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430014,ok=430014,error=0, records=41
[INFO ] 2026-06-02 09:08:12.394 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:08:12.577 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:08:12.577 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:08:12.577 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:08:12.577 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:08:12.577 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:08:12.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:08:12.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:08:12.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:08:12.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:08:12.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:08:12.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:08:22.750 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:08:24.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:08:24.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 09:08:24.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430015,ok=430015,error=0, records=41
[WARN ] 2026-06-02 09:08:37.755 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:08:39.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:08:39.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:08:39.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430016,ok=430016,error=0, records=41
[WARN ] 2026-06-02 09:08:52.760 [22612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:08:54.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:08:54.322 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 09:08:54.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 09:08:54.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430017,ok=430017,error=0, records=41
[WARN ] 2026-06-02 09:09:07.765 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:09:09.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:09:09.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 09:09:09.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430018,ok=430018,error=0, records=41
[WARN ] 2026-06-02 09:09:22.770 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:09:24.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=24.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:09:24.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 09:09:24.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430019,ok=430019,error=0, records=41
[WARN ] 2026-06-02 09:09:37.776 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:09:39.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:09:39.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 09:09:39.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430020,ok=430020,error=0, records=41
[WARN ] 2026-06-02 09:09:52.781 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:09:54.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=24.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:09:54.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 09:09:54.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430021,ok=430021,error=0, records=41
[INFO ] 2026-06-02 09:10:01.917 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21516/300s
[WARN ] 2026-06-02 09:10:07.785 [22545] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:10:09.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:10:09.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:10:09.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430022,ok=430022,error=0, records=41
[WARN ] 2026-06-02 09:10:22.790 [22587] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:10:24.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:10:24.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 09:10:24.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430023,ok=430023,error=0, records=41
[INFO ] 2026-06-02 09:10:29.792 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21507/300s
[WARN ] 2026-06-02 09:10:37.796 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:10:39.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:10:39.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 09:10:39.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430024,ok=430024,error=0, records=41
[INFO ] 2026-06-02 09:10:43.085 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21516/300s
[WARN ] 2026-06-02 09:10:52.800 [22597] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:10:54.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:10:54.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 09:10:54.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430025,ok=430025,error=0, records=41
[WARN ] 2026-06-02 09:11:07.805 [23171] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:11:09.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:11:09.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 09:11:09.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430026,ok=430026,error=0, records=41
[INFO ] 2026-06-02 09:11:09.602 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21503/300s
[INFO ] 2026-06-02 09:11:12.578 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17913/300s
[INFO ] 2026-06-02 09:11:12.579 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848612},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:11:12.748 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:11:12.748 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:11:12.749 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:11:12.749 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:11:12.749 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:11:12.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:11:12.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:11:12.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:11:12.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:11:12.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:11:12.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:11:22.810 [23182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:11:24.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:11:24.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 09:11:24.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430027,ok=430027,error=0, records=41
[WARN ] 2026-06-02 09:11:37.816 [23182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:11:39.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:11:39.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10142, records=41
[INFO ] 2026-06-02 09:11:39.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430028,ok=430028,error=0, records=41
[INFO ] 2026-06-02 09:11:50.969 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21512/300s
[WARN ] 2026-06-02 09:11:52.821 [23182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:11:54.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:11:54.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-02 09:11:54.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430029,ok=430029,error=0, records=41
[INFO ] 2026-06-02 09:12:00.666 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21503/300s
[WARN ] 2026-06-02 09:12:07.826 [23197] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:12:09.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.01MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:12:09.330 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21515/300s
[INFO ] 2026-06-02 09:12:09.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 09:12:09.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430030,ok=430030,error=0, records=41
[WARN ] 2026-06-02 09:12:22.832 [23230] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:12:24.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:12:24.637 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 09:12:24.637 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430031,ok=430031,error=0, records=41
[WARN ] 2026-06-02 09:12:37.836 [23258] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:12:39.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:12:39.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 09:12:39.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430032,ok=430032,error=0, records=41
[WARN ] 2026-06-02 09:12:52.841 [23197] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:12:54.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:12:54.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 09:12:54.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430033,ok=430033,error=0, records=41
[INFO ] 2026-06-02 09:12:57.940 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21513/300s
[INFO ] 2026-06-02 09:12:59.742 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21513/300s
[INFO ] 2026-06-02 09:13:06.648 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21513/300s
[WARN ] 2026-06-02 09:13:07.847 [23282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:13:09.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:13:09.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 09:13:09.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430034,ok=430034,error=0, records=41
[WARN ] 2026-06-02 09:13:22.852 [23296] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:13:24.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:13:24.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 09:13:24.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430035,ok=430035,error=0, records=41
[WARN ] 2026-06-02 09:13:37.858 [23312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:13:39.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 09:13:39.334 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 09:13:39.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 09:13:39.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430036,ok=430036,error=0, records=41
[WARN ] 2026-06-02 09:13:52.862 [23268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:13:54.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:13:54.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 09:13:54.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430037,ok=430037,error=0, records=41
[WARN ] 2026-06-02 09:14:07.866 [23340] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:14:09.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:14:09.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 09:14:09.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430038,ok=430038,error=0, records=41
[INFO ] 2026-06-02 09:14:12.750 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848544},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:14:12.929 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:14:12.929 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:14:12.929 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:14:12.929 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:14:12.929 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:14:12.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:14:12.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:14:12.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:14:12.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:14:12.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:14:12.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:14:22.871 [23282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:14:24.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:14:24.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 09:14:24.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430039,ok=430039,error=0, records=41
[WARN ] 2026-06-02 09:14:37.877 [23374] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:14:39.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:14:39.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 09:14:39.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430040,ok=430040,error=0, records=41
[WARN ] 2026-06-02 09:14:52.882 [23391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:14:54.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:14:54.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 09:14:54.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430041,ok=430041,error=0, records=41
[INFO ] 2026-06-02 09:15:01.921 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21517/300s
[WARN ] 2026-06-02 09:15:07.888 [23391] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:15:09.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:15:09.852 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 09:15:09.852 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430042,ok=430042,error=0, records=41
[WARN ] 2026-06-02 09:15:22.894 [23409] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:15:24.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:15:24.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 09:15:24.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430043,ok=430043,error=0, records=41
[INFO ] 2026-06-02 09:15:29.896 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21508/300s
[WARN ] 2026-06-02 09:15:37.899 [23438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:15:39.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:15:39.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 09:15:39.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430044,ok=430044,error=0, records=41
[INFO ] 2026-06-02 09:15:43.092 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21517/300s
[WARN ] 2026-06-02 09:15:52.904 [23459] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:15:54.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:15:54.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 09:15:54.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430045,ok=430045,error=0, records=41
[WARN ] 2026-06-02 09:16:07.908 [23476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:16:09.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:16:09.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 09:16:09.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430046,ok=430046,error=0, records=41
[INFO ] 2026-06-02 09:16:09.891 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21504/300s
[WARN ] 2026-06-02 09:16:22.913 [23493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:16:24.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:16:24.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 09:16:24.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430047,ok=430047,error=0, records=41
[WARN ] 2026-06-02 09:16:37.920 [23499] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:16:39.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:16:39.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 09:16:39.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430048,ok=430048,error=0, records=41
[INFO ] 2026-06-02 09:16:51.026 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21513/300s
[WARN ] 2026-06-02 09:16:52.926 [23516] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:16:54.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:16:54.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 09:16:54.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430049,ok=430049,error=0, records=41
[INFO ] 2026-06-02 09:17:00.851 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21504/300s
[WARN ] 2026-06-02 09:17:07.931 [23510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:17:09.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:17:09.343 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21516/300s
[INFO ] 2026-06-02 09:17:09.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 09:17:09.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430050,ok=430050,error=0, records=41
[INFO ] 2026-06-02 09:17:12.929 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17914/300s
[INFO ] 2026-06-02 09:17:12.931 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848476},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:17:13.093 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:17:13.093 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 09:17:13.093 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:17:13.093 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:17:13.093 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:17:13.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:17:13.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:17:13.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:17:13.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:17:13.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:17:13.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:17:22.935 [23525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:17:24.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:17:24.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:17:24.991 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430051,ok=430051,error=0, records=41
[WARN ] 2026-06-02 09:17:37.941 [23535] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:17:39.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:17:39.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 09:17:39.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430052,ok=430052,error=0, records=41
[WARN ] 2026-06-02 09:17:52.947 [23510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:17:54.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:17:55.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 09:17:55.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430053,ok=430053,error=0, records=41
[INFO ] 2026-06-02 09:17:58.007 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21514/300s
[INFO ] 2026-06-02 09:17:59.809 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21514/300s
[INFO ] 2026-06-02 09:18:06.715 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21514/300s
[WARN ] 2026-06-02 09:18:07.953 [23599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:18:09.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:18:10.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 09:18:10.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430054,ok=430054,error=0, records=41
[WARN ] 2026-06-02 09:18:22.958 [23613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:18:24.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:18:25.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 09:18:25.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430055,ok=430055,error=0, records=41
[WARN ] 2026-06-02 09:18:37.963 [23627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:18:39.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:18:40.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 09:18:40.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430056,ok=430056,error=0, records=41
[WARN ] 2026-06-02 09:18:52.967 [23599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:18:54.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:18:55.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10378, records=41
[INFO ] 2026-06-02 09:18:55.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430057,ok=430057,error=0, records=41
[WARN ] 2026-06-02 09:19:07.972 [23613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:19:09.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:19:10.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 09:19:10.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430058,ok=430058,error=0, records=41
[WARN ] 2026-06-02 09:19:22.985 [23599] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:19:24.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:19:25.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:19:25.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430059,ok=430059,error=0, records=41
[WARN ] 2026-06-02 09:19:37.990 [23685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:19:39.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:19:40.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 09:19:40.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430060,ok=430060,error=0, records=41
[WARN ] 2026-06-02 09:19:52.995 [23579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:19:54.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:19:55.057 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 09:19:55.057 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430061,ok=430061,error=0, records=41
[INFO ] 2026-06-02 09:20:01.925 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21518/300s
[WARN ] 2026-06-02 09:20:08.000 [23685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:20:09.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:20:10.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 09:20:10.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430062,ok=430062,error=0, records=41
[INFO ] 2026-06-02 09:20:13.095 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848408},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:20:13.253 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:20:13.254 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 09:20:13.254 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:20:13.254 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:20:13.254 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:20:13.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:20:13.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:20:13.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:20:13.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:20:13.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:20:13.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:20:23.005 [23732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:20:24.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:20:25.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 09:20:25.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430063,ok=430063,error=0, records=41
[INFO ] 2026-06-02 09:20:30.007 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21509/300s
[WARN ] 2026-06-02 09:20:38.010 [23671] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:20:39.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:20:40.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 09:20:40.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430064,ok=430064,error=0, records=41
[INFO ] 2026-06-02 09:20:43.098 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21518/300s
[WARN ] 2026-06-02 09:20:53.016 [23760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:20:54.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:20:55.082 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:20:55.082 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430065,ok=430065,error=0, records=41
[WARN ] 2026-06-02 09:21:08.021 [23760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:21:09.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:21:10.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 09:21:10.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430066,ok=430066,error=0, records=41
[INFO ] 2026-06-02 09:21:10.087 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21505/300s
[WARN ] 2026-06-02 09:21:23.027 [23774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:21:24.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:21:25.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 09:21:25.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430067,ok=430067,error=0, records=41
[WARN ] 2026-06-02 09:21:38.032 [23732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:21:39.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:21:40.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 09:21:40.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430068,ok=430068,error=0, records=41
[INFO ] 2026-06-02 09:21:51.081 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21514/300s
[WARN ] 2026-06-02 09:21:53.037 [23774] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:21:54.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:21:55.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 09:21:55.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430069,ok=430069,error=0, records=41
[INFO ] 2026-06-02 09:22:01.034 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21505/300s
[WARN ] 2026-06-02 09:22:08.042 [23830] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:22:09.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:22:09.356 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21517/300s
[INFO ] 2026-06-02 09:22:10.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 09:22:10.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430070,ok=430070,error=0, records=41
[WARN ] 2026-06-02 09:22:23.047 [23836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:22:24.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:22:25.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 09:22:25.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430071,ok=430071,error=0, records=41
[WARN ] 2026-06-02 09:22:38.053 [23831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:22:39.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:22:40.122 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 09:22:40.122 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430072,ok=430072,error=0, records=41
[WARN ] 2026-06-02 09:22:52.558 [23881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:22:54.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:22:55.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 09:22:55.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430073,ok=430073,error=0, records=41
[INFO ] 2026-06-02 09:22:58.100 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21515/300s
[INFO ] 2026-06-02 09:22:59.902 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21515/300s
[INFO ] 2026-06-02 09:23:06.790 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21515/300s
[WARN ] 2026-06-02 09:23:07.565 [23895] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:23:09.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:23:10.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 09:23:10.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430074,ok=430074,error=0, records=41
[INFO ] 2026-06-02 09:23:13.254 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17915/300s
[INFO ] 2026-06-02 09:23:13.256 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848340},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:23:13.437 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:23:13.437 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:23:13.437 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:23:13.437 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:23:13.437 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:23:13.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:23:13.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:23:13.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:23:13.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:23:13.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:23:13.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:23:22.571 [23896] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:23:24.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:23:25.140 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 09:23:25.140 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430075,ok=430075,error=0, records=41
[WARN ] 2026-06-02 09:23:37.577 [23915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:23:39.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 09:23:39.360 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 09:23:40.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10306, records=41
[INFO ] 2026-06-02 09:23:40.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430076,ok=430076,error=0, records=41
[WARN ] 2026-06-02 09:23:52.583 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:23:54.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:23:54.361 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 09:23:55.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-02 09:23:55.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430077,ok=430077,error=0, records=41
[WARN ] 2026-06-02 09:24:07.589 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:24:09.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:24:10.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 09:24:10.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430078,ok=430078,error=0, records=41
[WARN ] 2026-06-02 09:24:22.593 [23915] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:24:24.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:24:25.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 09:24:25.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430079,ok=430079,error=0, records=41
[WARN ] 2026-06-02 09:24:37.598 [23959] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:24:39.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:24:40.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 09:24:40.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430080,ok=430080,error=0, records=41
[WARN ] 2026-06-02 09:24:52.602 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:24:54.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:24:55.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:24:55.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430081,ok=430081,error=0, records=41
[INFO ] 2026-06-02 09:25:01.928 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21519/300s
[WARN ] 2026-06-02 09:25:07.606 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:25:09.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:25:10.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 09:25:10.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430082,ok=430082,error=0, records=41
[WARN ] 2026-06-02 09:25:22.612 [23959] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:25:24.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:25:25.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 09:25:25.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430083,ok=430083,error=0, records=41
[INFO ] 2026-06-02 09:25:30.114 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21510/300s
[WARN ] 2026-06-02 09:25:37.617 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:25:39.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:25:40.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 09:25:40.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430084,ok=430084,error=0, records=41
[INFO ] 2026-06-02 09:25:43.104 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21519/300s
[WARN ] 2026-06-02 09:25:52.622 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:25:54.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:25:55.231 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 09:25:55.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430085,ok=430085,error=0, records=41
[WARN ] 2026-06-02 09:26:07.628 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:26:09.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:26:10.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-02 09:26:10.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430086,ok=430086,error=0, records=41
[INFO ] 2026-06-02 09:26:10.238 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21506/300s
[INFO ] 2026-06-02 09:26:13.439 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848280},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:26:13.611 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:26:13.611 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:26:13.612 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:26:13.612 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:26:13.612 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:26:13.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:26:13.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:26:13.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:26:13.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:26:13.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:26:13.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:26:22.634 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:26:24.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:26:25.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10317, records=41
[INFO ] 2026-06-02 09:26:25.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430087,ok=430087,error=0, records=41
[WARN ] 2026-06-02 09:26:37.640 [23959] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:26:39.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:26:40.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10302, records=41
[INFO ] 2026-06-02 09:26:40.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430088,ok=430088,error=0, records=41
[INFO ] 2026-06-02 09:26:51.139 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21515/300s
[WARN ] 2026-06-02 09:26:52.644 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:26:54.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:26:55.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 09:26:55.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430089,ok=430089,error=0, records=41
[INFO ] 2026-06-02 09:27:01.211 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21506/300s
[WARN ] 2026-06-02 09:27:07.649 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:27:09.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:27:09.370 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21518/300s
[INFO ] 2026-06-02 09:27:10.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 09:27:10.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430090,ok=430090,error=0, records=41
[WARN ] 2026-06-02 09:27:22.653 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:27:24.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:27:25.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-02 09:27:25.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430091,ok=430091,error=0, records=41
[WARN ] 2026-06-02 09:27:37.658 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:27:39.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:27:40.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 09:27:40.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430092,ok=430092,error=0, records=41
[WARN ] 2026-06-02 09:27:52.662 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:27:54.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:27:55.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 09:27:55.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430093,ok=430093,error=0, records=41
[INFO ] 2026-06-02 09:27:58.168 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21516/300s
[INFO ] 2026-06-02 09:27:59.970 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21516/300s
[INFO ] 2026-06-02 09:28:06.848 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21516/300s
[WARN ] 2026-06-02 09:28:07.668 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:28:09.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:28:10.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 09:28:10.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430094,ok=430094,error=0, records=41
[WARN ] 2026-06-02 09:28:22.673 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:28:24.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:28:25.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 09:28:25.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430095,ok=430095,error=0, records=41
[WARN ] 2026-06-02 09:28:37.677 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:28:39.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:28:40.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 09:28:40.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430096,ok=430096,error=0, records=41
[WARN ] 2026-06-02 09:28:52.682 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:28:54.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:28:55.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 09:28:55.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430097,ok=430097,error=0, records=41
[WARN ] 2026-06-02 09:29:07.687 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:29:09.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:29:10.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 09:29:10.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430098,ok=430098,error=0, records=41
[INFO ] 2026-06-02 09:29:13.612 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17916/300s
[INFO ] 2026-06-02 09:29:13.613 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848212},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:29:13.787 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:29:13.787 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 09:29:13.787 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:29:13.787 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:29:13.787 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:29:13.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:29:13.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:29:13.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:29:13.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:29:13.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:29:13.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:29:22.694 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:29:24.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:29:25.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 09:29:25.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430099,ok=430099,error=0, records=41
[WARN ] 2026-06-02 09:29:37.699 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:29:39.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:29:40.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 09:29:40.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430100,ok=430100,error=0, records=41
[WARN ] 2026-06-02 09:29:52.703 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:29:54.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:29:55.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 09:29:55.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430101,ok=430101,error=0, records=41
[INFO ] 2026-06-02 09:30:01.932 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21520/300s
[WARN ] 2026-06-02 09:30:07.709 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:30:09.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:30:10.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 09:30:10.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430102,ok=430102,error=0, records=41
[WARN ] 2026-06-02 09:30:22.715 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:30:24.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:30:25.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 09:30:25.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430103,ok=430103,error=0, records=41
[INFO ] 2026-06-02 09:30:30.217 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21511/300s
[WARN ] 2026-06-02 09:30:37.719 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:30:39.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:30:40.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 09:30:40.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430104,ok=430104,error=0, records=41
[INFO ] 2026-06-02 09:30:43.111 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21520/300s
[WARN ] 2026-06-02 09:30:52.724 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:30:54.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:30:55.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 09:30:55.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430105,ok=430105,error=0, records=41
[WARN ] 2026-06-02 09:31:07.729 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:31:09.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:31:10.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 09:31:10.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430106,ok=430106,error=0, records=41
[INFO ] 2026-06-02 09:31:10.492 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21507/300s
[WARN ] 2026-06-02 09:31:22.734 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:31:24.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:31:25.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 09:31:25.499 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430107,ok=430107,error=0, records=41
[WARN ] 2026-06-02 09:31:37.739 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:31:39.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:31:40.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 09:31:40.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430108,ok=430108,error=0, records=41
[INFO ] 2026-06-02 09:31:51.194 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21516/300s
[WARN ] 2026-06-02 09:31:52.745 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:31:54.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:31:55.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 09:31:55.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430109,ok=430109,error=0, records=41
[INFO ] 2026-06-02 09:32:01.393 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21507/300s
[WARN ] 2026-06-02 09:32:07.750 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:32:09.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:32:09.382 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21519/300s
[INFO ] 2026-06-02 09:32:10.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 09:32:10.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430110,ok=430110,error=0, records=41
[INFO ] 2026-06-02 09:32:13.788 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848136},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:32:13.957 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:32:13.957 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 09:32:13.957 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:32:13.957 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:32:13.957 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:32:13.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:32:13.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:32:13.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:32:13.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:32:13.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:32:13.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:32:22.755 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:32:24.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:32:25.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 09:32:25.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430111,ok=430111,error=0, records=41
[WARN ] 2026-06-02 09:32:37.761 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:32:39.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:32:40.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 09:32:40.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430112,ok=430112,error=0, records=41
[WARN ] 2026-06-02 09:32:52.766 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:32:54.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:32:55.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 09:32:55.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430113,ok=430113,error=0, records=41
[INFO ] 2026-06-02 09:32:58.236 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21517/300s
[INFO ] 2026-06-02 09:33:00.038 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21517/300s
[INFO ] 2026-06-02 09:33:06.901 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21517/300s
[WARN ] 2026-06-02 09:33:07.771 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:33:09.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:33:10.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 09:33:10.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430114,ok=430114,error=0, records=41
[WARN ] 2026-06-02 09:33:22.777 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:33:24.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:33:25.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 09:33:25.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430115,ok=430115,error=0, records=41
[WARN ] 2026-06-02 09:33:37.782 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:33:39.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 09:33:39.386 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 09:33:40.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 09:33:40.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430116,ok=430116,error=0, records=41
[WARN ] 2026-06-02 09:33:52.787 [23965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:33:54.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:33:55.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:33:55.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430117,ok=430117,error=0, records=41
[WARN ] 2026-06-02 09:34:07.793 [23998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:34:09.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:34:10.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 09:34:10.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430118,ok=430118,error=0, records=41
[WARN ] 2026-06-02 09:34:22.798 [23947] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:34:24.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:34:25.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-02 09:34:25.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430119,ok=430119,error=0, records=41
[WARN ] 2026-06-02 09:34:37.802 [23959] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:34:39.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:34:40.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10141, records=41
[INFO ] 2026-06-02 09:34:40.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430120,ok=430120,error=0, records=41
[WARN ] 2026-06-02 09:34:52.808 [24531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:34:54.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:34:55.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10140, records=41
[INFO ] 2026-06-02 09:34:55.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430121,ok=430121,error=0, records=41
[INFO ] 2026-06-02 09:35:01.935 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21521/300s
[WARN ] 2026-06-02 09:35:07.813 [24541] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:35:09.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:35:10.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 09:35:10.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430122,ok=430122,error=0, records=41
[INFO ] 2026-06-02 09:35:13.957 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17917/300s
[INFO ] 2026-06-02 09:35:13.959 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848076},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:35:14.116 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:35:14.116 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:35:14.116 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:35:14.116 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:35:14.116 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:35:14.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:35:14.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:35:14.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:35:14.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:35:14.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:35:14.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:35:22.820 [24547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:35:24.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:35:25.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 09:35:25.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430123,ok=430123,error=0, records=41
[INFO ] 2026-06-02 09:35:30.323 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21512/300s
[WARN ] 2026-06-02 09:35:37.826 [24531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:35:39.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:35:40.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 09:35:40.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430124,ok=430124,error=0, records=41
[INFO ] 2026-06-02 09:35:43.117 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21521/300s
[WARN ] 2026-06-02 09:35:52.831 [24591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:35:54.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:35:55.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 09:35:55.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430125,ok=430125,error=0, records=41
[WARN ] 2026-06-02 09:36:07.836 [24577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:36:09.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:36:10.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 09:36:10.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430126,ok=430126,error=0, records=41
[INFO ] 2026-06-02 09:36:10.684 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21508/300s
[WARN ] 2026-06-02 09:36:22.842 [24591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:36:24.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:36:25.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:36:25.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430127,ok=430127,error=0, records=41
[WARN ] 2026-06-02 09:36:37.847 [24547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:36:39.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:36:40.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 09:36:40.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430128,ok=430128,error=0, records=41
[INFO ] 2026-06-02 09:36:51.249 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21517/300s
[WARN ] 2026-06-02 09:36:52.851 [24627] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:36:54.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:36:55.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 09:36:55.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430129,ok=430129,error=0, records=41
[INFO ] 2026-06-02 09:37:01.576 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21508/300s
[WARN ] 2026-06-02 09:37:07.856 [24547] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:37:09.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:37:09.395 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21520/300s
[INFO ] 2026-06-02 09:37:10.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 09:37:10.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430130,ok=430130,error=0, records=41
[WARN ] 2026-06-02 09:37:22.862 [24557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:37:24.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:37:25.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 09:37:25.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430131,ok=430131,error=0, records=41
[WARN ] 2026-06-02 09:37:37.866 [24697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:37:39.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:37:40.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:37:40.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430132,ok=430132,error=0, records=41
[WARN ] 2026-06-02 09:37:52.871 [24655] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:37:54.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:37:55.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 09:37:55.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430133,ok=430133,error=0, records=41
[INFO ] 2026-06-02 09:37:58.310 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21518/300s
[INFO ] 2026-06-02 09:38:00.112 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21518/300s
[INFO ] 2026-06-02 09:38:06.958 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21518/300s
[WARN ] 2026-06-02 09:38:07.877 [24726] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:38:09.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:38:10.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 09:38:10.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430134,ok=430134,error=0, records=41
[INFO ] 2026-06-02 09:38:14.118 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20848008},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:38:14.286 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:38:14.286 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 09:38:14.287 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:38:14.287 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:38:14.287 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:38:14.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:38:14.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:38:14.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:38:14.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:38:14.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:38:14.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:38:22.884 [24737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:38:24.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:38:25.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 09:38:25.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430135,ok=430135,error=0, records=41
[WARN ] 2026-06-02 09:38:37.888 [24765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:38:39.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:38:40.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 09:38:40.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430136,ok=430136,error=0, records=41
[WARN ] 2026-06-02 09:38:52.894 [24744] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:38:54.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:38:54.400 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 09:38:55.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 09:38:55.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430137,ok=430137,error=0, records=41
[WARN ] 2026-06-02 09:39:07.901 [24765] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:39:09.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:39:10.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 09:39:10.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430138,ok=430138,error=0, records=41
[WARN ] 2026-06-02 09:39:22.906 [24811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:39:24.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:39:25.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:39:25.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430139,ok=430139,error=0, records=41
[WARN ] 2026-06-02 09:39:37.911 [24817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:39:39.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:39:40.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 09:39:40.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430140,ok=430140,error=0, records=41
[WARN ] 2026-06-02 09:39:52.916 [24817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:39:54.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:39:55.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 09:39:55.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430141,ok=430141,error=0, records=41
[INFO ] 2026-06-02 09:40:01.939 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21522/300s
[WARN ] 2026-06-02 09:40:07.922 [24863] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:40:09.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:40:10.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 09:40:10.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430142,ok=430142,error=0, records=41
[WARN ] 2026-06-02 09:40:22.927 [24869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:40:24.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:40:25.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 09:40:25.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430143,ok=430143,error=0, records=41
[INFO ] 2026-06-02 09:40:30.429 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21513/300s
[WARN ] 2026-06-02 09:40:37.932 [24869] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:40:39.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:40:40.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 09:40:40.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430144,ok=430144,error=0, records=41
[INFO ] 2026-06-02 09:40:43.124 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21522/300s
[WARN ] 2026-06-02 09:40:52.938 [24806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:40:54.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:40:55.816 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 09:40:55.816 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430145,ok=430145,error=0, records=41
[WARN ] 2026-06-02 09:41:07.944 [24874] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:41:09.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:41:10.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 09:41:10.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430146,ok=430146,error=0, records=41
[INFO ] 2026-06-02 09:41:10.820 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21509/300s
[INFO ] 2026-06-02 09:41:14.287 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17918/300s
[INFO ] 2026-06-02 09:41:14.288 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847944},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:41:14.455 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:41:14.455 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 09:41:14.455 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:41:14.455 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:41:14.455 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:41:14.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:41:14.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:41:14.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:41:14.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:41:14.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:41:14.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:41:22.949 [24892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:41:24.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:41:25.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 09:41:25.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430147,ok=430147,error=0, records=41
[WARN ] 2026-06-02 09:41:37.954 [24954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:41:39.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:41:40.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 09:41:40.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430148,ok=430148,error=0, records=41
[INFO ] 2026-06-02 09:41:51.307 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21518/300s
[WARN ] 2026-06-02 09:41:52.958 [24954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:41:54.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:41:55.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 09:41:55.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430149,ok=430149,error=0, records=41
[INFO ] 2026-06-02 09:42:01.761 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21509/300s
[WARN ] 2026-06-02 09:42:07.962 [24968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:42:09.409 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:42:09.409 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21521/300s
[INFO ] 2026-06-02 09:42:10.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:42:10.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430150,ok=430150,error=0, records=41
[WARN ] 2026-06-02 09:42:22.967 [24996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:42:24.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:42:25.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 09:42:25.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430151,ok=430151,error=0, records=41
[WARN ] 2026-06-02 09:42:37.972 [24982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:42:39.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:42:40.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 09:42:40.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430152,ok=430152,error=0, records=41
[WARN ] 2026-06-02 09:42:52.978 [24933] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:42:54.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:42:55.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 09:42:55.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430153,ok=430153,error=0, records=41
[INFO ] 2026-06-02 09:42:58.390 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21519/300s
[INFO ] 2026-06-02 09:43:00.192 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21519/300s
[INFO ] 2026-06-02 09:43:07.017 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21519/300s
[WARN ] 2026-06-02 09:43:07.982 [25036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:43:09.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:43:10.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 09:43:10.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430154,ok=430154,error=0, records=41
[WARN ] 2026-06-02 09:43:22.987 [24968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:43:24.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:43:25.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 09:43:25.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430155,ok=430155,error=0, records=41
[WARN ] 2026-06-02 09:43:37.993 [25036] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:43:39.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 09:43:39.413 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 09:43:40.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 09:43:40.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430156,ok=430156,error=0, records=41
[WARN ] 2026-06-02 09:43:52.999 [24968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:43:54.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:43:55.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 09:43:55.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430157,ok=430157,error=0, records=41
[WARN ] 2026-06-02 09:44:08.005 [25091] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:44:09.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:44:10.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 09:44:10.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430158,ok=430158,error=0, records=41
[INFO ] 2026-06-02 09:44:14.457 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847876},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:44:14.625 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:44:14.625 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 09:44:14.625 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:44:14.626 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:44:14.626 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:44:14.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:44:14.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:44:14.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:44:14.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:44:14.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:44:14.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:44:23.011 [24968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:44:24.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:44:25.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:44:25.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430159,ok=430159,error=0, records=41
[WARN ] 2026-06-02 09:44:38.015 [25091] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:44:39.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:44:40.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 09:44:40.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430160,ok=430160,error=0, records=41
[WARN ] 2026-06-02 09:44:53.020 [25050] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:44:54.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:44:55.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 09:44:55.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430161,ok=430161,error=0, records=41
[INFO ] 2026-06-02 09:45:01.943 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21523/300s
[WARN ] 2026-06-02 09:45:08.025 [25064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:45:09.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:45:10.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 09:45:10.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430162,ok=430162,error=0, records=41
[WARN ] 2026-06-02 09:45:23.030 [25064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:45:24.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:45:25.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 09:45:25.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430163,ok=430163,error=0, records=41
[INFO ] 2026-06-02 09:45:30.532 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21514/300s
[WARN ] 2026-06-02 09:45:38.034 [24968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:45:39.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:45:40.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 09:45:40.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430164,ok=430164,error=0, records=41
[INFO ] 2026-06-02 09:45:43.131 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21523/300s
[WARN ] 2026-06-02 09:45:53.040 [25147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:45:54.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:45:55.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 09:45:55.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430165,ok=430165,error=0, records=41
[WARN ] 2026-06-02 09:46:08.047 [25212] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:46:09.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:46:10.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 09:46:10.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430166,ok=430166,error=0, records=41
[INFO ] 2026-06-02 09:46:10.958 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21510/300s
[WARN ] 2026-06-02 09:46:23.052 [25209] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:46:24.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:46:25.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 09:46:25.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430167,ok=430167,error=0, records=41
[WARN ] 2026-06-02 09:46:37.558 [25246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:46:39.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:46:40.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 09:46:40.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430168,ok=430168,error=0, records=41
[INFO ] 2026-06-02 09:46:51.363 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21519/300s
[WARN ] 2026-06-02 09:46:52.563 [25261] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:46:54.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:46:55.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 09:46:55.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430169,ok=430169,error=0, records=41
[INFO ] 2026-06-02 09:47:01.946 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21510/300s
[WARN ] 2026-06-02 09:47:07.568 [25252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:47:09.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:47:09.422 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21522/300s
[INFO ] 2026-06-02 09:47:10.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 09:47:10.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430170,ok=430170,error=0, records=41
[INFO ] 2026-06-02 09:47:14.626 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17919/300s
[INFO ] 2026-06-02 09:47:14.627 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847812},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:47:14.781 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:47:14.781 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-02 09:47:14.781 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:47:14.781 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:47:14.781 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:47:14.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:47:14.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:47:14.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:47:14.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:47:14.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:47:14.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:47:22.573 [25282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:47:24.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:47:25.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 09:47:25.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430171,ok=430171,error=0, records=41
[WARN ] 2026-06-02 09:47:37.577 [25293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:47:39.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:47:40.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 09:47:40.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430172,ok=430172,error=0, records=41
[WARN ] 2026-06-02 09:47:52.583 [25328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:47:54.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:47:55.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 09:47:55.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430173,ok=430173,error=0, records=41
[INFO ] 2026-06-02 09:47:58.463 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21520/300s
[INFO ] 2026-06-02 09:48:00.265 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21520/300s
[INFO ] 2026-06-02 09:48:07.070 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21520/300s
[WARN ] 2026-06-02 09:48:07.589 [25333] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:48:09.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:48:11.000 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-02 09:48:11.000 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430174,ok=430174,error=0, records=41
[WARN ] 2026-06-02 09:48:22.593 [25334] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:48:24.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:48:26.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 09:48:26.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430175,ok=430175,error=0, records=41
[WARN ] 2026-06-02 09:48:37.598 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:48:39.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:48:41.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 09:48:41.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430176,ok=430176,error=0, records=41
[WARN ] 2026-06-02 09:48:52.603 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:48:54.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:48:56.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-02 09:48:56.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430177,ok=430177,error=0, records=41
[WARN ] 2026-06-02 09:49:07.610 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:49:09.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:49:11.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 09:49:11.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430178,ok=430178,error=0, records=41
[WARN ] 2026-06-02 09:49:22.615 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:49:24.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:49:26.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 09:49:26.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430179,ok=430179,error=0, records=41
[WARN ] 2026-06-02 09:49:37.620 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:49:39.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:49:41.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 09:49:41.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430180,ok=430180,error=0, records=41
[WARN ] 2026-06-02 09:49:52.626 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:49:54.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:49:56.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 09:49:56.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430181,ok=430181,error=0, records=41
[INFO ] 2026-06-02 09:50:01.947 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21524/300s
[WARN ] 2026-06-02 09:50:07.633 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:50:09.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:50:11.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:50:11.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430182,ok=430182,error=0, records=41
[INFO ] 2026-06-02 09:50:14.783 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847744},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:50:14.928 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:50:14.928 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 09:50:14.928 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:50:14.928 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:50:14.928 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:50:14.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:50:14.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:50:14.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:50:14.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:50:14.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:50:14.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:50:22.637 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:50:24.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:50:26.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-02 09:50:26.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430183,ok=430183,error=0, records=41
[INFO ] 2026-06-02 09:50:30.639 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21515/300s
[WARN ] 2026-06-02 09:50:37.642 [25368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:50:39.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:50:41.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 09:50:41.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430184,ok=430184,error=0, records=41
[INFO ] 2026-06-02 09:50:43.136 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21524/300s
[WARN ] 2026-06-02 09:50:52.646 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:50:54.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:50:56.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 09:50:56.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430185,ok=430185,error=0, records=41
[WARN ] 2026-06-02 09:51:07.652 [25368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:51:09.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:51:11.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 09:51:11.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430186,ok=430186,error=0, records=41
[INFO ] 2026-06-02 09:51:11.167 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21511/300s
[WARN ] 2026-06-02 09:51:22.660 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:51:24.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:51:26.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 09:51:26.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430187,ok=430187,error=0, records=41
[WARN ] 2026-06-02 09:51:37.664 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:51:39.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:51:41.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:51:41.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430188,ok=430188,error=0, records=41
[INFO ] 2026-06-02 09:51:51.417 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21520/300s
[WARN ] 2026-06-02 09:51:52.668 [25368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:51:54.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:51:56.185 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 09:51:56.185 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430189,ok=430189,error=0, records=41
[INFO ] 2026-06-02 09:52:02.126 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21511/300s
[WARN ] 2026-06-02 09:52:07.673 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:52:09.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:52:09.434 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21523/300s
[INFO ] 2026-06-02 09:52:11.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 09:52:11.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430190,ok=430190,error=0, records=41
[WARN ] 2026-06-02 09:52:22.677 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:52:24.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:52:26.195 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 09:52:26.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430191,ok=430191,error=0, records=41
[WARN ] 2026-06-02 09:52:37.681 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:52:39.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:52:41.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 09:52:41.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430192,ok=430192,error=0, records=41
[WARN ] 2026-06-02 09:52:52.686 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:52:54.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:52:56.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:52:56.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430193,ok=430193,error=0, records=41
[INFO ] 2026-06-02 09:52:58.506 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21521/300s
[INFO ] 2026-06-02 09:53:00.308 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21521/300s
[INFO ] 2026-06-02 09:53:07.114 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21521/300s
[WARN ] 2026-06-02 09:53:07.691 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:53:09.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:53:11.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 09:53:11.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430194,ok=430194,error=0, records=41
[INFO ] 2026-06-02 09:53:14.928 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17920/300s
[INFO ] 2026-06-02 09:53:14.930 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847684},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:53:15.097 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:53:15.097 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:53:15.097 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:53:15.097 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:53:15.097 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:53:15.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:53:15.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:53:15.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:53:15.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:53:15.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:53:15.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:53:22.696 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:53:24.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:53:26.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 09:53:26.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430195,ok=430195,error=0, records=41
[WARN ] 2026-06-02 09:53:37.701 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:53:39.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 09:53:39.438 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 09:53:41.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 09:53:41.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430196,ok=430196,error=0, records=41
[WARN ] 2026-06-02 09:53:52.706 [25368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:53:54.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:53:54.439 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 09:53:56.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 09:53:56.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430197,ok=430197,error=0, records=41
[WARN ] 2026-06-02 09:54:07.713 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:54:09.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:54:11.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 09:54:11.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430198,ok=430198,error=0, records=41
[WARN ] 2026-06-02 09:54:22.718 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:54:24.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:54:26.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 09:54:26.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430199,ok=430199,error=0, records=41
[WARN ] 2026-06-02 09:54:37.723 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:54:39.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:54:41.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 09:54:41.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430200,ok=430200,error=0, records=41
[WARN ] 2026-06-02 09:54:52.728 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:54:54.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:54:56.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 09:54:56.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430201,ok=430201,error=0, records=41
[INFO ] 2026-06-02 09:55:01.950 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21525/300s
[WARN ] 2026-06-02 09:55:07.732 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:55:09.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:55:11.255 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 09:55:11.255 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430202,ok=430202,error=0, records=41
[WARN ] 2026-06-02 09:55:22.737 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:55:24.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:55:26.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 09:55:26.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430203,ok=430203,error=0, records=41
[INFO ] 2026-06-02 09:55:30.739 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21516/300s
[WARN ] 2026-06-02 09:55:37.742 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:55:39.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:55:41.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 09:55:41.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430204,ok=430204,error=0, records=41
[INFO ] 2026-06-02 09:55:43.143 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21525/300s
[WARN ] 2026-06-02 09:55:52.746 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:55:54.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:55:56.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 09:55:56.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430205,ok=430205,error=0, records=41
[WARN ] 2026-06-02 09:56:07.751 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:56:09.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:56:11.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 09:56:11.277 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430206,ok=430206,error=0, records=41
[INFO ] 2026-06-02 09:56:11.277 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21512/300s
[INFO ] 2026-06-02 09:56:15.099 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847608},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:56:15.264 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:56:15.264 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 09:56:15.264 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:56:15.264 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:56:15.264 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:56:15.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:56:15.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:56:15.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:56:15.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:56:15.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:56:15.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:56:22.757 [25368] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:56:24.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:56:26.284 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 09:56:26.284 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430207,ok=430207,error=0, records=41
[WARN ] 2026-06-02 09:56:37.762 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:56:39.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:56:41.288 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 09:56:41.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430208,ok=430208,error=0, records=41
[INFO ] 2026-06-02 09:56:51.472 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21521/300s
[WARN ] 2026-06-02 09:56:52.766 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:56:54.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:56:56.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 09:56:56.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430209,ok=430209,error=0, records=41
[INFO ] 2026-06-02 09:57:02.304 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21512/300s
[WARN ] 2026-06-02 09:57:07.771 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:57:09.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:57:09.448 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21524/300s
[INFO ] 2026-06-02 09:57:11.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:57:11.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430210,ok=430210,error=0, records=41
[WARN ] 2026-06-02 09:57:22.775 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:57:24.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:57:26.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 09:57:26.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430211,ok=430211,error=0, records=41
[WARN ] 2026-06-02 09:57:37.780 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:57:39.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:57:41.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 09:57:41.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430212,ok=430212,error=0, records=41
[WARN ] 2026-06-02 09:57:52.784 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:57:54.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:57:56.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 09:57:56.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430213,ok=430213,error=0, records=41
[INFO ] 2026-06-02 09:57:58.572 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21522/300s
[INFO ] 2026-06-02 09:58:00.374 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21522/300s
[INFO ] 2026-06-02 09:58:07.179 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21522/300s
[WARN ] 2026-06-02 09:58:07.789 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:58:09.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:58:11.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 09:58:11.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430214,ok=430214,error=0, records=41
[WARN ] 2026-06-02 09:58:22.795 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:58:24.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:58:26.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 09:58:26.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430215,ok=430215,error=0, records=41
[WARN ] 2026-06-02 09:58:37.799 [25383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:58:39.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:58:41.344 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 09:58:41.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430216,ok=430216,error=0, records=41
[WARN ] 2026-06-02 09:58:52.804 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:58:54.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:58:56.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 09:58:56.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430217,ok=430217,error=0, records=41
[WARN ] 2026-06-02 09:59:07.810 [25345] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:59:09.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:59:11.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 09:59:11.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430218,ok=430218,error=0, records=41
[INFO ] 2026-06-02 09:59:15.265 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17921/300s
[INFO ] 2026-06-02 09:59:15.266 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847548},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 09:59:15.435 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 09:59:15.436 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 09:59:15.436 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 09:59:15.436 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 09:59:15.436 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 09:59:15.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:59:15.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 09:59:15.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:59:15.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 09:59:15.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 09:59:15.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 09:59:22.815 [25378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:59:24.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:59:26.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 09:59:26.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430219,ok=430219,error=0, records=41
[WARN ] 2026-06-02 09:59:37.819 [25327] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:59:39.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:59:41.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 09:59:41.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430220,ok=430220,error=0, records=41
[WARN ] 2026-06-02 09:59:52.824 [25957] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 09:59:54.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 09:59:56.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 09:59:56.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430221,ok=430221,error=0, records=41
[INFO ] 2026-06-02 10:00:01.954 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21526/300s
[WARN ] 2026-06-02 10:00:07.829 [25971] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:00:09.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:00:11.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 10:00:11.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430222,ok=430222,error=0, records=41
[WARN ] 2026-06-02 10:00:22.836 [26018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:00:24.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:00:26.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 10:00:26.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430223,ok=430223,error=0, records=41
[INFO ] 2026-06-02 10:00:30.839 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21517/300s
[WARN ] 2026-06-02 10:00:37.842 [26018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:00:39.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:00:41.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 10:00:41.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430224,ok=430224,error=0, records=41
[INFO ] 2026-06-02 10:00:43.150 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21526/300s
[WARN ] 2026-06-02 10:00:52.847 [25971] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:00:54.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:00:56.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 10:00:56.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430225,ok=430225,error=0, records=41
[WARN ] 2026-06-02 10:01:07.851 [26068] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:01:09.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:01:11.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 10:01:11.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430226,ok=430226,error=0, records=41
[INFO ] 2026-06-02 10:01:11.415 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21513/300s
[WARN ] 2026-06-02 10:01:22.857 [26004] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:01:24.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:01:26.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 10:01:26.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430227,ok=430227,error=0, records=41
[WARN ] 2026-06-02 10:01:37.862 [26096] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:01:39.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:01:41.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 10:01:41.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430228,ok=430228,error=0, records=41
[INFO ] 2026-06-02 10:01:51.530 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21522/300s
[WARN ] 2026-06-02 10:01:52.866 [26004] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:01:54.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:01:56.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 10:01:56.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430229,ok=430229,error=0, records=41
[INFO ] 2026-06-02 10:02:02.489 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21513/300s
[WARN ] 2026-06-02 10:02:07.870 [26028] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:02:09.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:02:09.460 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21525/300s
[INFO ] 2026-06-02 10:02:11.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 10:02:11.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430230,ok=430230,error=0, records=41
[INFO ] 2026-06-02 10:02:15.437 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847476},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:02:15.597 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:02:15.597 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[WARN ] 2026-06-02 10:02:22.876 [26140] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:02:24.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:02:26.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 10:02:26.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430231,ok=430231,error=0, records=41
[WARN ] 2026-06-02 10:02:37.882 [26162] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:02:39.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:02:41.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 10:02:41.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430232,ok=430232,error=0, records=41
[WARN ] 2026-06-02 10:02:52.889 [26179] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:02:54.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:02:56.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 10:02:56.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430233,ok=430233,error=0, records=41
[INFO ] 2026-06-02 10:02:58.637 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21523/300s
[INFO ] 2026-06-02 10:03:00.438 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21523/300s
[INFO ] 2026-06-02 10:03:07.244 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21523/300s
[WARN ] 2026-06-02 10:03:07.894 [26195] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:03:09.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:03:11.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 10:03:11.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430234,ok=430234,error=0, records=41
[WARN ] 2026-06-02 10:03:22.900 [26206] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:03:24.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:03:26.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 10:03:26.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430235,ok=430235,error=0, records=41
[WARN ] 2026-06-02 10:03:37.907 [26229] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:03:39.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 10:03:39.464 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 10:03:41.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 10:03:41.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430236,ok=430236,error=0, records=41
[WARN ] 2026-06-02 10:03:52.913 [26246] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:03:54.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:03:56.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 10:03:56.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430237,ok=430237,error=0, records=41
[WARN ] 2026-06-02 10:04:07.918 [26260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:04:09.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:04:11.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 10:04:11.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430238,ok=430238,error=0, records=41
[WARN ] 2026-06-02 10:04:22.923 [26271] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:04:24.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:04:26.492 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 10:04:26.492 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430239,ok=430239,error=0, records=41
[WARN ] 2026-06-02 10:04:37.927 [26235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:04:39.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:04:41.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 10:04:41.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430240,ok=430240,error=0, records=41
[WARN ] 2026-06-02 10:04:52.932 [26260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:04:54.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:04:56.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 10:04:56.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430241,ok=430241,error=0, records=41
[INFO ] 2026-06-02 10:05:01.957 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21527/300s
[WARN ] 2026-06-02 10:05:07.938 [26301] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:05:09.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:05:11.508 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 10:05:11.508 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430242,ok=430242,error=0, records=41
[INFO ] 2026-06-02 10:05:15.597 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17922/300s
[INFO ] 2026-06-02 10:05:15.599 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847412},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:05:15.752 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:05:15.752 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 10:05:15.752 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:05:15.752 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:05:15.752 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:05:15.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:05:15.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:05:15.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:05:15.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:05:15.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:05:15.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:05:22.943 [26312] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:05:24.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:05:26.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 10:05:26.513 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430243,ok=430243,error=0, records=41
[INFO ] 2026-06-02 10:05:30.946 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21518/300s
[WARN ] 2026-06-02 10:05:37.949 [26352] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:05:39.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:05:41.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 10:05:41.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430244,ok=430244,error=0, records=41
[INFO ] 2026-06-02 10:05:43.156 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21527/300s
[WARN ] 2026-06-02 10:05:52.954 [26335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:05:54.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:05:56.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 10:05:56.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430245,ok=430245,error=0, records=41
[WARN ] 2026-06-02 10:06:07.960 [26335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:06:09.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:06:11.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 10:06:11.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430246,ok=430246,error=0, records=41
[INFO ] 2026-06-02 10:06:11.532 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21514/300s
[WARN ] 2026-06-02 10:06:22.964 [26394] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:06:24.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:06:26.537 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 10:06:26.537 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430247,ok=430247,error=0, records=41
[WARN ] 2026-06-02 10:06:37.969 [26335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:06:39.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:06:41.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 10:06:41.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430248,ok=430248,error=0, records=41
[INFO ] 2026-06-02 10:06:51.586 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21523/300s
[WARN ] 2026-06-02 10:06:52.973 [26335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:06:54.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:06:56.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 10:06:56.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430249,ok=430249,error=0, records=41
[INFO ] 2026-06-02 10:07:02.674 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21514/300s
[WARN ] 2026-06-02 10:07:07.978 [26334] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:07:09.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:07:09.473 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21526/300s
[INFO ] 2026-06-02 10:07:11.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 10:07:11.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430250,ok=430250,error=0, records=41
[WARN ] 2026-06-02 10:07:22.983 [26380] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:07:24.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:07:26.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 10:07:26.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430251,ok=430251,error=0, records=41
[WARN ] 2026-06-02 10:07:37.988 [26380] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:07:39.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:07:41.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 10:07:41.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430252,ok=430252,error=0, records=41
[WARN ] 2026-06-02 10:07:52.993 [26380] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:07:54.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:07:56.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 10:07:56.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430253,ok=430253,error=0, records=41
[INFO ] 2026-06-02 10:07:58.699 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21524/300s
[INFO ] 2026-06-02 10:08:00.500 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21524/300s
[INFO ] 2026-06-02 10:08:07.306 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21524/300s
[WARN ] 2026-06-02 10:08:07.998 [26493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:08:09.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:08:11.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 10:08:11.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430254,ok=430254,error=0, records=41
[INFO ] 2026-06-02 10:08:15.754 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847348},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:08:15.910 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:08:15.910 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 10:08:15.910 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:08:15.910 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:08:15.910 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:08:15.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:08:15.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:08:15.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:08:15.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:08:15.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:08:15.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:08:23.003 [26508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:08:24.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:08:26.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 10:08:26.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430255,ok=430255,error=0, records=41
[WARN ] 2026-06-02 10:08:38.008 [26380] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:08:39.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:08:41.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 10:08:41.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430256,ok=430256,error=0, records=41
[WARN ] 2026-06-02 10:08:53.012 [26522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:08:54.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:08:54.477 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 10:08:56.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 10:08:56.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430257,ok=430257,error=0, records=41
[WARN ] 2026-06-02 10:09:08.017 [26479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:09:09.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:09:11.612 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 10:09:11.612 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430258,ok=430258,error=0, records=41
[WARN ] 2026-06-02 10:09:23.023 [26508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:09:24.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:09:26.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 10:09:26.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430259,ok=430259,error=0, records=41
[WARN ] 2026-06-02 10:09:38.028 [26550] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:09:39.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:09:41.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 10:09:41.626 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430260,ok=430260,error=0, records=41
[WARN ] 2026-06-02 10:09:53.032 [26522] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:09:54.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:09:56.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 10:09:56.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430261,ok=430261,error=0, records=41
[INFO ] 2026-06-02 10:10:01.961 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21528/300s
[WARN ] 2026-06-02 10:10:08.037 [26614] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:10:09.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:10:11.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 10:10:11.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430262,ok=430262,error=0, records=41
[WARN ] 2026-06-02 10:10:23.042 [26564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:10:24.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:10:26.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 10:10:26.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430263,ok=430263,error=0, records=41
[INFO ] 2026-06-02 10:10:31.044 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21519/300s
[WARN ] 2026-06-02 10:10:38.047 [26641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:10:39.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:10:41.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 10:10:41.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430264,ok=430264,error=0, records=41
[INFO ] 2026-06-02 10:10:43.163 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21528/300s
[WARN ] 2026-06-02 10:10:53.052 [26564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:10:54.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:10:56.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 10:10:56.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430265,ok=430265,error=0, records=41
[WARN ] 2026-06-02 10:11:07.557 [26681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:11:09.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:11:11.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 10:11:11.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430266,ok=430266,error=0, records=41
[INFO ] 2026-06-02 10:11:11.665 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21515/300s
[INFO ] 2026-06-02 10:11:15.910 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17923/300s
[INFO ] 2026-06-02 10:11:15.912 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847284},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:11:16.093 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:11:16.093 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 10:11:16.093 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:11:16.093 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:11:16.093 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:11:16.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:11:16.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:11:16.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:11:16.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:11:16.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:11:16.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:11:22.562 [26704] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:11:24.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:11:26.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 10:11:26.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430267,ok=430267,error=0, records=41
[WARN ] 2026-06-02 10:11:37.567 [26673] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:11:39.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:11:41.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 10:11:41.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430268,ok=430268,error=0, records=41
[INFO ] 2026-06-02 10:11:51.644 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21524/300s
[WARN ] 2026-06-02 10:11:52.573 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:11:54.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:11:56.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 10:11:56.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430269,ok=430269,error=0, records=41
[INFO ] 2026-06-02 10:12:02.858 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21515/300s
[WARN ] 2026-06-02 10:12:07.577 [26755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:12:09.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:12:09.486 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21527/300s
[INFO ] 2026-06-02 10:12:11.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 10:12:11.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430270,ok=430270,error=0, records=41
[WARN ] 2026-06-02 10:12:22.582 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:12:24.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:12:26.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 10:12:26.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430271,ok=430271,error=0, records=41
[WARN ] 2026-06-02 10:12:37.587 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:12:39.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:12:41.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 10:12:41.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430272,ok=430272,error=0, records=41
[WARN ] 2026-06-02 10:12:52.594 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:12:54.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:12:56.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 10:12:56.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430273,ok=430273,error=0, records=41
[INFO ] 2026-06-02 10:12:58.763 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21525/300s
[INFO ] 2026-06-02 10:13:00.565 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21525/300s
[INFO ] 2026-06-02 10:13:07.371 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21525/300s
[WARN ] 2026-06-02 10:13:07.598 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:13:09.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:13:11.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 10:13:11.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430274,ok=430274,error=0, records=41
[WARN ] 2026-06-02 10:13:22.603 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:13:24.489 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:13:26.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:13:26.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430275,ok=430275,error=0, records=41
[WARN ] 2026-06-02 10:13:37.608 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:13:39.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 10:13:39.490 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 10:13:41.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 10:13:41.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430276,ok=430276,error=0, records=41
[WARN ] 2026-06-02 10:13:52.613 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:13:54.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:13:56.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 10:13:56.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430277,ok=430277,error=0, records=41
[WARN ] 2026-06-02 10:14:07.618 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:14:09.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:14:11.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 10:14:11.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430278,ok=430278,error=0, records=41
[INFO ] 2026-06-02 10:14:16.094 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847216},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:14:16.262 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:14:16.262 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 10:14:16.262 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:14:16.263 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:14:16.263 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:14:16.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:14:16.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:14:16.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:14:16.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:14:16.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:14:16.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:14:22.623 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:14:24.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:14:26.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 10:14:26.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430279,ok=430279,error=0, records=41
[WARN ] 2026-06-02 10:14:37.629 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:14:39.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:14:41.766 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 10:14:41.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430280,ok=430280,error=0, records=41
[WARN ] 2026-06-02 10:14:52.636 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:14:54.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:14:56.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 10:14:56.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430281,ok=430281,error=0, records=41
[INFO ] 2026-06-02 10:15:01.964 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21529/300s
[WARN ] 2026-06-02 10:15:07.640 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:15:09.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:15:11.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 10:15:11.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430282,ok=430282,error=0, records=41
[WARN ] 2026-06-02 10:15:22.645 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:15:24.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:15:26.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 10:15:26.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430283,ok=430283,error=0, records=41
[INFO ] 2026-06-02 10:15:31.147 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21520/300s
[WARN ] 2026-06-02 10:15:37.650 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:15:39.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:15:41.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:15:41.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430284,ok=430284,error=0, records=41
[INFO ] 2026-06-02 10:15:43.169 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21529/300s
[WARN ] 2026-06-02 10:15:52.656 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:15:54.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:15:56.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:15:56.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430285,ok=430285,error=0, records=41
[WARN ] 2026-06-02 10:16:07.662 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:16:09.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:16:11.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 10:16:11.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430286,ok=430286,error=0, records=41
[INFO ] 2026-06-02 10:16:11.804 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21516/300s
[WARN ] 2026-06-02 10:16:22.666 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:16:24.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:16:26.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 10:16:26.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430287,ok=430287,error=0, records=41
[WARN ] 2026-06-02 10:16:37.671 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:16:39.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:16:41.818 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 10:16:41.818 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430288,ok=430288,error=0, records=41
[INFO ] 2026-06-02 10:16:51.702 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21525/300s
[WARN ] 2026-06-02 10:16:52.676 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:16:54.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:16:56.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 10:16:56.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430289,ok=430289,error=0, records=41
[INFO ] 2026-06-02 10:17:03.040 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21516/300s
[WARN ] 2026-06-02 10:17:07.680 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:17:09.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:17:09.499 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21528/300s
[INFO ] 2026-06-02 10:17:11.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 10:17:11.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430290,ok=430290,error=0, records=41
[INFO ] 2026-06-02 10:17:16.263 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17924/300s
[INFO ] 2026-06-02 10:17:16.264 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847148},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:17:16.432 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:17:16.432 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 10:17:16.432 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:17:16.432 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:17:16.432 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:17:16.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:17:16.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:17:16.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:17:16.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:17:16.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:17:16.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:17:22.686 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:17:24.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:17:26.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 10:17:26.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430291,ok=430291,error=0, records=41
[WARN ] 2026-06-02 10:17:37.691 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:17:39.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:17:41.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 10:17:41.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430292,ok=430292,error=0, records=41
[WARN ] 2026-06-02 10:17:52.695 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:17:54.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:17:56.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 10:17:56.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430293,ok=430293,error=0, records=41
[INFO ] 2026-06-02 10:17:58.833 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21526/300s
[INFO ] 2026-06-02 10:18:00.635 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21526/300s
[INFO ] 2026-06-02 10:18:07.442 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21526/300s
[WARN ] 2026-06-02 10:18:07.702 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:18:09.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:18:11.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 10:18:11.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430294,ok=430294,error=0, records=41
[WARN ] 2026-06-02 10:18:22.708 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:18:24.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:18:26.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 10:18:26.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430295,ok=430295,error=0, records=41
[WARN ] 2026-06-02 10:18:37.714 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:18:39.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:18:41.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 10:18:41.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430296,ok=430296,error=0, records=41
[WARN ] 2026-06-02 10:18:52.720 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:18:54.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:18:56.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 10:18:56.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430297,ok=430297,error=0, records=41
[WARN ] 2026-06-02 10:19:07.725 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:19:09.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:19:11.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-02 10:19:11.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430298,ok=430298,error=0, records=41
[WARN ] 2026-06-02 10:19:22.731 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:19:24.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:19:26.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 10:19:26.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430299,ok=430299,error=0, records=41
[WARN ] 2026-06-02 10:19:37.736 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:19:39.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:19:41.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 10:19:41.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430300,ok=430300,error=0, records=41
[WARN ] 2026-06-02 10:19:52.741 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:19:54.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:19:56.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 10:19:56.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430301,ok=430301,error=0, records=41
[INFO ] 2026-06-02 10:20:01.968 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21530/300s
[WARN ] 2026-06-02 10:20:07.746 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:20:09.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:20:11.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 10:20:11.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430302,ok=430302,error=0, records=41
[INFO ] 2026-06-02 10:20:16.433 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20847068},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:20:16.601 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:20:16.601 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 10:20:16.602 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:20:16.602 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:20:16.602 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:20:16.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:20:16.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:20:16.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:20:16.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:20:16.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:20:16.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:20:22.751 [26761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:20:24.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:20:26.933 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 10:20:26.933 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430303,ok=430303,error=0, records=41
[INFO ] 2026-06-02 10:20:31.254 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21521/300s
[WARN ] 2026-06-02 10:20:37.757 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:20:39.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:20:41.939 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 10:20:41.939 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430304,ok=430304,error=0, records=41
[INFO ] 2026-06-02 10:20:43.175 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21530/300s
[WARN ] 2026-06-02 10:20:52.762 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:20:54.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:20:56.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 10:20:56.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430305,ok=430305,error=0, records=41
[WARN ] 2026-06-02 10:21:07.767 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:21:09.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:21:11.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 10:21:11.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430306,ok=430306,error=0, records=41
[INFO ] 2026-06-02 10:21:11.949 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21517/300s
[WARN ] 2026-06-02 10:21:22.773 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:21:24.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:21:26.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 10:21:26.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430307,ok=430307,error=0, records=41
[WARN ] 2026-06-02 10:21:37.778 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:21:39.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:21:41.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 10:21:41.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430308,ok=430308,error=0, records=41
[INFO ] 2026-06-02 10:21:51.766 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21526/300s
[WARN ] 2026-06-02 10:21:52.784 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:21:54.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:21:56.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 10:21:56.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430309,ok=430309,error=0, records=41
[INFO ] 2026-06-02 10:22:03.223 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21517/300s
[WARN ] 2026-06-02 10:22:07.790 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:22:09.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:22:09.512 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21529/300s
[INFO ] 2026-06-02 10:22:11.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 10:22:11.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430310,ok=430310,error=0, records=41
[WARN ] 2026-06-02 10:22:22.796 [26783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:22:24.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:22:27.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 10:22:27.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430311,ok=430311,error=0, records=41
[WARN ] 2026-06-02 10:22:37.802 [26798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:22:39.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:22:42.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 10:22:42.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430312,ok=430312,error=0, records=41
[WARN ] 2026-06-02 10:22:52.807 [27331] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:22:54.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:22:57.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 10:22:57.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430313,ok=430313,error=0, records=41
[INFO ] 2026-06-02 10:22:58.900 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21527/300s
[INFO ] 2026-06-02 10:23:00.702 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21527/300s
[INFO ] 2026-06-02 10:23:07.506 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21527/300s
[WARN ] 2026-06-02 10:23:07.812 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:23:09.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:23:12.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 10:23:12.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430314,ok=430314,error=0, records=41
[INFO ] 2026-06-02 10:23:16.602 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17925/300s
[INFO ] 2026-06-02 10:23:16.603 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846992},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:23:16.744 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:23:16.744 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 10:23:16.744 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:23:16.744 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:23:16.744 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:23:16.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:23:16.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:23:16.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:23:16.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:23:16.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:23:16.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:23:22.818 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:23:24.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:23:27.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 10:23:27.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430315,ok=430315,error=0, records=41
[WARN ] 2026-06-02 10:23:37.823 [26737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:23:39.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 10:23:39.516 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 10:23:42.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 10:23:42.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430316,ok=430316,error=0, records=41
[WARN ] 2026-06-02 10:23:52.829 [27351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:23:54.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:23:54.516 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 10:23:57.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 10:23:57.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430317,ok=430317,error=0, records=41
[WARN ] 2026-06-02 10:24:07.834 [27366] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:24:09.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:24:12.051 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 10:24:12.051 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430318,ok=430318,error=0, records=41
[WARN ] 2026-06-02 10:24:22.840 [26817] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:24:24.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:24:27.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 10:24:27.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430319,ok=430319,error=0, records=41
[WARN ] 2026-06-02 10:24:37.846 [27346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:24:39.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:24:42.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 10:24:42.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430320,ok=430320,error=0, records=41
[WARN ] 2026-06-02 10:24:52.850 [27346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:24:54.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:24:57.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:24:57.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430321,ok=430321,error=0, records=41
[INFO ] 2026-06-02 10:25:01.971 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21531/300s
[WARN ] 2026-06-02 10:25:07.856 [27351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:25:09.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:25:12.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 10:25:12.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430322,ok=430322,error=0, records=41
[WARN ] 2026-06-02 10:25:22.861 [27472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:25:24.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:25:27.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 10:25:27.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430323,ok=430323,error=0, records=41
[INFO ] 2026-06-02 10:25:31.363 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21522/300s
[WARN ] 2026-06-02 10:25:37.866 [27351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:25:39.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:25:42.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 10:25:42.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430324,ok=430324,error=0, records=41
[INFO ] 2026-06-02 10:25:43.181 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21531/300s
[WARN ] 2026-06-02 10:25:52.871 [27351] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:25:54.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:25:57.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 10:25:57.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430325,ok=430325,error=0, records=41
[WARN ] 2026-06-02 10:26:07.877 [27513] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:26:09.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:26:12.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 10:26:12.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430326,ok=430326,error=0, records=41
[INFO ] 2026-06-02 10:26:12.125 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21518/300s
[INFO ] 2026-06-02 10:26:16.745 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:26:16.897 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:26:16.897 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 10:26:16.898 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:26:16.898 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:26:16.898 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:26:16.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:26:16.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:26:16.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:26:16.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:26:16.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:26:16.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:26:22.882 [27458] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:26:24.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:26:27.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 10:26:27.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430327,ok=430327,error=0, records=41
[WARN ] 2026-06-02 10:26:37.887 [27346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:26:39.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:26:42.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 10:26:42.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430328,ok=430328,error=0, records=41
[INFO ] 2026-06-02 10:26:51.822 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21527/300s
[WARN ] 2026-06-02 10:26:52.892 [27519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:26:54.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:26:57.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 10:26:57.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430329,ok=430329,error=0, records=41
[INFO ] 2026-06-02 10:27:03.402 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21518/300s
[WARN ] 2026-06-02 10:27:07.899 [27584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:27:09.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:27:09.525 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21530/300s
[INFO ] 2026-06-02 10:27:12.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 10:27:12.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430330,ok=430330,error=0, records=41
[WARN ] 2026-06-02 10:27:22.905 [27531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:27:24.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:27:27.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 10:27:27.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430331,ok=430331,error=0, records=41
[WARN ] 2026-06-02 10:27:37.910 [27615] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:27:39.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:27:42.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 10:27:42.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430332,ok=430332,error=0, records=41
[WARN ] 2026-06-02 10:27:52.916 [27564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:27:54.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:27:57.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 10:27:57.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430333,ok=430333,error=0, records=41
[INFO ] 2026-06-02 10:27:58.971 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21528/300s
[INFO ] 2026-06-02 10:28:00.772 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21528/300s
[INFO ] 2026-06-02 10:28:07.578 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21528/300s
[WARN ] 2026-06-02 10:28:07.922 [27564] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:28:09.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:28:12.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10388, records=41
[INFO ] 2026-06-02 10:28:12.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430334,ok=430334,error=0, records=41
[WARN ] 2026-06-02 10:28:22.927 [27659] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:28:24.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:28:27.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 10:28:27.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430335,ok=430335,error=0, records=41
[WARN ] 2026-06-02 10:28:37.933 [27681] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:28:39.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:28:42.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 10:28:42.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430336,ok=430336,error=0, records=41
[WARN ] 2026-06-02 10:28:52.938 [27626] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:28:54.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:28:57.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10372, records=41
[INFO ] 2026-06-02 10:28:57.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430337,ok=430337,error=0, records=41
[WARN ] 2026-06-02 10:29:07.945 [27687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:29:09.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:29:12.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 10:29:12.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430338,ok=430338,error=0, records=41
[INFO ] 2026-06-02 10:29:16.898 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17926/300s
[INFO ] 2026-06-02 10:29:16.899 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:29:17.072 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:29:17.072 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 10:29:17.072 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:29:17.072 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:29:17.072 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:29:17.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:29:17.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:29:17.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:29:17.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:29:17.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:29:17.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:29:22.949 [27713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:29:24.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:29:27.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 10:29:27.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430339,ok=430339,error=0, records=41
[WARN ] 2026-06-02 10:29:37.954 [27740] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:29:39.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:29:42.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 10:29:42.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430340,ok=430340,error=0, records=41
[WARN ] 2026-06-02 10:29:52.960 [27754] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:29:54.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:29:57.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 10:29:57.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430341,ok=430341,error=0, records=41
[INFO ] 2026-06-02 10:30:01.975 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21532/300s
[WARN ] 2026-06-02 10:30:07.965 [27719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:30:09.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:30:12.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 10:30:12.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430342,ok=430342,error=0, records=41
[WARN ] 2026-06-02 10:30:22.970 [27713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:30:24.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:30:27.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 10:30:27.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430343,ok=430343,error=0, records=41
[INFO ] 2026-06-02 10:30:31.472 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21523/300s
[WARN ] 2026-06-02 10:30:37.976 [27719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:30:39.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:30:42.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 10:30:42.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430344,ok=430344,error=0, records=41
[INFO ] 2026-06-02 10:30:43.188 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21532/300s
[WARN ] 2026-06-02 10:30:52.982 [27687] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:30:54.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:30:57.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 10:30:57.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430345,ok=430345,error=0, records=41
[WARN ] 2026-06-02 10:31:07.986 [27814] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:31:09.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:31:12.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 10:31:12.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430346,ok=430346,error=0, records=41
[INFO ] 2026-06-02 10:31:12.269 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21519/300s
[WARN ] 2026-06-02 10:31:22.991 [27842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:31:24.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:31:27.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 10:31:27.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430347,ok=430347,error=0, records=41
[WARN ] 2026-06-02 10:31:37.996 [27713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:31:39.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:31:42.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 10:31:42.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430348,ok=430348,error=0, records=41
[INFO ] 2026-06-02 10:31:51.879 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21528/300s
[WARN ] 2026-06-02 10:31:53.002 [27870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:31:54.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:31:57.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 10:31:57.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430349,ok=430349,error=0, records=41
[INFO ] 2026-06-02 10:32:03.588 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21519/300s
[WARN ] 2026-06-02 10:32:08.007 [27713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:32:09.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:32:09.538 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21531/300s
[INFO ] 2026-06-02 10:32:12.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 10:32:12.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430350,ok=430350,error=0, records=41
[INFO ] 2026-06-02 10:32:17.074 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846792},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:32:17.240 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:32:17.240 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 10:32:17.240 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:32:17.240 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:32:17.240 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:32:17.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:32:17.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:32:17.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:32:17.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:32:17.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:32:17.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:32:23.011 [27884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:32:24.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:32:27.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 10:32:27.296 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430351,ok=430351,error=0, records=41
[WARN ] 2026-06-02 10:32:38.017 [27842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:32:39.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:32:42.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 10:32:42.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430352,ok=430352,error=0, records=41
[WARN ] 2026-06-02 10:32:53.023 [27856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:32:54.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:32:57.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 10:32:57.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430353,ok=430353,error=0, records=41
[INFO ] 2026-06-02 10:32:59.046 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21529/300s
[INFO ] 2026-06-02 10:33:00.848 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21529/300s
[INFO ] 2026-06-02 10:33:07.653 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21529/300s
[WARN ] 2026-06-02 10:33:08.028 [27884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:33:09.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:33:12.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 10:33:12.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430354,ok=430354,error=0, records=41
[WARN ] 2026-06-02 10:33:23.033 [27884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:33:24.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:33:27.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 10:33:27.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430355,ok=430355,error=0, records=41
[WARN ] 2026-06-02 10:33:38.039 [27842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:33:39.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 10:33:39.542 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 10:33:42.430 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 10:33:42.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430356,ok=430356,error=0, records=41
[WARN ] 2026-06-02 10:33:53.044 [27977] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:33:54.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:33:57.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 10:33:57.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430357,ok=430357,error=0, records=41
[WARN ] 2026-06-02 10:34:08.048 [28000] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:34:09.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:34:12.459 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 10:34:12.459 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430358,ok=430358,error=0, records=41
[WARN ] 2026-06-02 10:34:23.054 [28005] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:34:24.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:34:27.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 10:34:27.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430359,ok=430359,error=0, records=41
[WARN ] 2026-06-02 10:34:37.559 [28039] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:34:39.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:34:42.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 10:34:42.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430360,ok=430360,error=0, records=41
[WARN ] 2026-06-02 10:34:52.565 [28046] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:34:54.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:34:57.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 10:34:57.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430361,ok=430361,error=0, records=41
[INFO ] 2026-06-02 10:35:01.978 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21533/300s
[WARN ] 2026-06-02 10:35:07.572 [28076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:35:09.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:35:12.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 10:35:12.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430362,ok=430362,error=0, records=41
[INFO ] 2026-06-02 10:35:17.240 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17927/300s
[INFO ] 2026-06-02 10:35:17.242 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:35:17.398 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:35:17.398 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 10:35:17.398 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:35:17.398 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:35:17.398 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:35:17.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:35:17.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:35:17.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:35:17.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:35:17.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:35:17.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:35:22.577 [28081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:35:24.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:35:27.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 10:35:27.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430363,ok=430363,error=0, records=41
[INFO ] 2026-06-02 10:35:31.581 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21524/300s
[WARN ] 2026-06-02 10:35:37.583 [28095] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:35:39.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:35:42.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 10:35:42.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430364,ok=430364,error=0, records=41
[INFO ] 2026-06-02 10:35:43.195 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21533/300s
[WARN ] 2026-06-02 10:35:52.589 [28119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:35:54.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:35:57.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-02 10:35:57.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430365,ok=430365,error=0, records=41
[WARN ] 2026-06-02 10:36:07.594 [28130] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:36:09.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:36:12.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 10:36:12.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430366,ok=430366,error=0, records=41
[INFO ] 2026-06-02 10:36:12.516 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21520/300s
[WARN ] 2026-06-02 10:36:22.601 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:36:24.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:36:27.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 10:36:27.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430367,ok=430367,error=0, records=41
[WARN ] 2026-06-02 10:36:37.606 [28142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:36:39.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:36:42.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 10:36:42.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430368,ok=430368,error=0, records=41
[INFO ] 2026-06-02 10:36:51.941 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21529/300s
[WARN ] 2026-06-02 10:36:52.611 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:36:54.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:36:57.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 10:36:57.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430369,ok=430369,error=0, records=41
[INFO ] 2026-06-02 10:37:03.775 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21520/300s
[WARN ] 2026-06-02 10:37:07.617 [28136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:37:09.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:37:09.551 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21532/300s
[INFO ] 2026-06-02 10:37:12.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 10:37:12.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430370,ok=430370,error=0, records=41
[WARN ] 2026-06-02 10:37:22.623 [28142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:37:24.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:37:27.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 10:37:27.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430371,ok=430371,error=0, records=41
[WARN ] 2026-06-02 10:37:37.628 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:37:39.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:37:42.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 10:37:42.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430372,ok=430372,error=0, records=41
[WARN ] 2026-06-02 10:37:52.632 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:37:54.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=33.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:37:57.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 10:37:57.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430373,ok=430373,error=0, records=41
[INFO ] 2026-06-02 10:37:59.123 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21530/300s
[INFO ] 2026-06-02 10:38:00.925 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21530/300s
[WARN ] 2026-06-02 10:38:07.638 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:38:07.732 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21530/300s
[INFO ] 2026-06-02 10:38:09.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:38:12.567 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 10:38:12.567 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430374,ok=430374,error=0, records=41
[INFO ] 2026-06-02 10:38:17.400 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846652},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:38:17.546 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:38:17.546 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 10:38:17.547 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:38:17.547 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:38:17.547 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:38:17.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:38:17.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:38:17.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:38:17.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:38:17.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:38:17.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:38:22.643 [28136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:38:24.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=33.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:38:27.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 10:38:27.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430375,ok=430375,error=0, records=41
[WARN ] 2026-06-02 10:38:37.648 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:38:39.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=33.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:38:42.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 10:38:42.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430376,ok=430376,error=0, records=41
[WARN ] 2026-06-02 10:38:52.654 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:38:54.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=33.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:38:54.556 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 10:38:57.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 10:38:57.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430377,ok=430377,error=0, records=41
[WARN ] 2026-06-02 10:39:07.661 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:39:09.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:39:12.588 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 10:39:12.588 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430378,ok=430378,error=0, records=41
[WARN ] 2026-06-02 10:39:22.666 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:39:24.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:39:27.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 10:39:27.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430379,ok=430379,error=0, records=41
[WARN ] 2026-06-02 10:39:37.671 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:39:39.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:39:42.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 10:39:42.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430380,ok=430380,error=0, records=41
[WARN ] 2026-06-02 10:39:52.677 [28136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:39:54.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:39:57.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 10:39:57.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430381,ok=430381,error=0, records=41
[INFO ] 2026-06-02 10:40:01.982 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21534/300s
[WARN ] 2026-06-02 10:40:07.682 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:40:09.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:40:12.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:40:12.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430382,ok=430382,error=0, records=41
[WARN ] 2026-06-02 10:40:22.687 [28142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:40:24.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:40:27.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 10:40:27.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430383,ok=430383,error=0, records=41
[INFO ] 2026-06-02 10:40:31.690 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21525/300s
[WARN ] 2026-06-02 10:40:37.693 [28142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:40:39.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:40:42.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 10:40:42.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430384,ok=430384,error=0, records=41
[INFO ] 2026-06-02 10:40:43.201 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21534/300s
[WARN ] 2026-06-02 10:40:52.699 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:40:54.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:40:57.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 10:40:57.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430385,ok=430385,error=0, records=41
[WARN ] 2026-06-02 10:41:07.704 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:41:09.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:41:12.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 10:41:12.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430386,ok=430386,error=0, records=41
[INFO ] 2026-06-02 10:41:12.674 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21521/300s
[INFO ] 2026-06-02 10:41:17.547 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17928/300s
[INFO ] 2026-06-02 10:41:17.548 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846588},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:41:17.716 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:41:17.716 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 10:41:17.716 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:41:17.716 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:41:17.716 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:41:17.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:41:17.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:41:17.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:41:17.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:41:17.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:41:17.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:41:22.710 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:41:24.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:41:27.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 10:41:27.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430387,ok=430387,error=0, records=41
[WARN ] 2026-06-02 10:41:37.715 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:41:39.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:41:42.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 10:41:42.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430388,ok=430388,error=0, records=41
[INFO ] 2026-06-02 10:41:52.002 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21530/300s
[WARN ] 2026-06-02 10:41:52.721 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:41:54.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:41:57.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 10:41:57.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430389,ok=430389,error=0, records=41
[INFO ] 2026-06-02 10:42:03.962 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21521/300s
[WARN ] 2026-06-02 10:42:07.727 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:42:09.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:42:09.565 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21533/300s
[INFO ] 2026-06-02 10:42:12.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 10:42:12.694 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430390,ok=430390,error=0, records=41
[WARN ] 2026-06-02 10:42:22.735 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:42:24.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:42:27.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:42:27.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430391,ok=430391,error=0, records=41
[WARN ] 2026-06-02 10:42:37.739 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:42:39.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:42:42.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 10:42:42.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430392,ok=430392,error=0, records=41
[WARN ] 2026-06-02 10:42:52.745 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:42:54.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:42:57.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 10:42:57.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430393,ok=430393,error=0, records=41
[INFO ] 2026-06-02 10:42:59.200 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21531/300s
[INFO ] 2026-06-02 10:43:01.001 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21531/300s
[WARN ] 2026-06-02 10:43:07.751 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:43:07.808 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21531/300s
[INFO ] 2026-06-02 10:43:09.567 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:43:12.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:43:12.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430394,ok=430394,error=0, records=41
[WARN ] 2026-06-02 10:43:22.757 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:43:24.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:43:27.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 10:43:27.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430395,ok=430395,error=0, records=41
[WARN ] 2026-06-02 10:43:37.762 [28136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:43:39.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 10:43:39.568 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 10:43:42.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 10:43:42.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430396,ok=430396,error=0, records=41
[WARN ] 2026-06-02 10:43:52.765 [28147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:43:54.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:43:57.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 10:43:57.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430397,ok=430397,error=0, records=41
[WARN ] 2026-06-02 10:44:07.771 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:44:09.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:44:12.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 10:44:12.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430398,ok=430398,error=0, records=41
[INFO ] 2026-06-02 10:44:17.718 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846528},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:44:17.884 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:44:17.884 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 10:44:17.884 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:44:17.884 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:44:17.884 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:44:17.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:44:17.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:44:17.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:44:17.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:44:17.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:44:17.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:44:22.775 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:44:24.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:44:27.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 10:44:27.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430399,ok=430399,error=0, records=41
[WARN ] 2026-06-02 10:44:37.780 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:44:39.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:44:42.825 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 10:44:42.825 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430400,ok=430400,error=0, records=41
[WARN ] 2026-06-02 10:44:52.786 [28136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:44:54.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:44:57.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 10:44:57.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430401,ok=430401,error=0, records=41
[INFO ] 2026-06-02 10:45:01.986 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21535/300s
[WARN ] 2026-06-02 10:45:07.791 [28161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:45:09.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:45:12.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 10:45:12.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430402,ok=430402,error=0, records=41
[WARN ] 2026-06-02 10:45:22.796 [28129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:45:24.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:45:27.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 10:45:27.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430403,ok=430403,error=0, records=41
[INFO ] 2026-06-02 10:45:31.799 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21526/300s
[WARN ] 2026-06-02 10:45:37.802 [28142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:45:39.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:45:42.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 10:45:42.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430404,ok=430404,error=0, records=41
[INFO ] 2026-06-02 10:45:43.208 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21535/300s
[WARN ] 2026-06-02 10:45:52.807 [28689] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:45:54.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:45:57.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 10:45:57.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430405,ok=430405,error=0, records=41
[WARN ] 2026-06-02 10:46:07.812 [28142] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:46:09.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:46:12.929 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 10:46:12.929 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430406,ok=430406,error=0, records=41
[INFO ] 2026-06-02 10:46:12.929 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21522/300s
[WARN ] 2026-06-02 10:46:22.818 [28726] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:46:24.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:46:27.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 10:46:27.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430407,ok=430407,error=0, records=41
[WARN ] 2026-06-02 10:46:37.824 [28136] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:46:39.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:46:42.979 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 10:46:42.979 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430408,ok=430408,error=0, records=41
[INFO ] 2026-06-02 10:46:52.062 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21531/300s
[WARN ] 2026-06-02 10:46:52.830 [28726] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:46:54.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:46:58.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 10:46:58.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430409,ok=430409,error=0, records=41
[INFO ] 2026-06-02 10:47:04.146 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21522/300s
[WARN ] 2026-06-02 10:47:07.835 [28775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:47:09.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:47:09.577 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21534/300s
[INFO ] 2026-06-02 10:47:13.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 10:47:13.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430410,ok=430410,error=0, records=41
[INFO ] 2026-06-02 10:47:17.884 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17929/300s
[INFO ] 2026-06-02 10:47:17.886 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846452},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:47:18.057 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:47:18.057 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 10:47:18.057 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:47:18.057 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:47:18.057 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:47:18.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:47:18.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:47:18.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:47:18.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:47:18.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:47:18.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:47:22.840 [28720] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:47:24.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:47:28.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 10:47:28.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430411,ok=430411,error=0, records=41
[WARN ] 2026-06-02 10:47:37.845 [28801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:47:39.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:47:43.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 10:47:43.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430412,ok=430412,error=0, records=41
[WARN ] 2026-06-02 10:47:52.850 [28819] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:47:54.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:47:58.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 10:47:58.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430413,ok=430413,error=0, records=41
[INFO ] 2026-06-02 10:47:59.269 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21532/300s
[INFO ] 2026-06-02 10:48:01.071 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21532/300s
[WARN ] 2026-06-02 10:48:07.855 [28836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:48:07.876 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21532/300s
[INFO ] 2026-06-02 10:48:09.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:48:13.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 10:48:13.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430414,ok=430414,error=0, records=41
[WARN ] 2026-06-02 10:48:22.861 [28726] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:48:24.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:48:28.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 10:48:28.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430415,ok=430415,error=0, records=41
[WARN ] 2026-06-02 10:48:37.866 [28854] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:48:39.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:48:43.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 10:48:43.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430416,ok=430416,error=0, records=41
[WARN ] 2026-06-02 10:48:52.871 [28836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:48:54.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:48:58.069 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 10:48:58.069 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430417,ok=430417,error=0, records=41
[WARN ] 2026-06-02 10:49:07.876 [28854] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:49:09.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:49:13.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 10:49:13.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430418,ok=430418,error=0, records=41
[WARN ] 2026-06-02 10:49:22.881 [28836] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:49:24.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:49:28.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 10:49:28.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430419,ok=430419,error=0, records=41
[WARN ] 2026-06-02 10:49:37.887 [28941] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:49:39.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:49:43.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 10:49:43.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430420,ok=430420,error=0, records=41
[WARN ] 2026-06-02 10:49:52.893 [28819] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:49:54.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:49:58.091 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 10:49:58.091 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430421,ok=430421,error=0, records=41
[INFO ] 2026-06-02 10:50:01.990 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21536/300s
[WARN ] 2026-06-02 10:50:07.898 [28974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:50:09.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:50:13.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 10:50:13.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430422,ok=430422,error=0, records=41
[INFO ] 2026-06-02 10:50:18.059 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846376},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:50:18.226 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:50:18.226 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 10:50:18.226 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:50:18.226 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:50:18.226 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:50:18.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:50:18.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:50:18.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:50:18.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:50:18.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:50:18.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:50:22.903 [28908] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:50:24.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:50:28.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 10:50:28.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430423,ok=430423,error=0, records=41
[INFO ] 2026-06-02 10:50:31.906 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21527/300s
[WARN ] 2026-06-02 10:50:37.908 [29040] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:50:39.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:50:43.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 10:50:43.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430424,ok=430424,error=0, records=41
[INFO ] 2026-06-02 10:50:43.215 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21536/300s
[WARN ] 2026-06-02 10:50:52.913 [29056] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:50:54.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:50:58.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 10:50:58.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430425,ok=430425,error=0, records=41
[WARN ] 2026-06-02 10:51:07.919 [29075] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:51:09.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:51:13.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 10:51:13.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430426,ok=430426,error=0, records=41
[INFO ] 2026-06-02 10:51:13.124 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21523/300s
[WARN ] 2026-06-02 10:51:22.925 [29076] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:51:24.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:51:28.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 10:51:28.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430427,ok=430427,error=0, records=41
[WARN ] 2026-06-02 10:51:37.930 [29125] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:51:39.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:51:43.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 10:51:43.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430428,ok=430428,error=0, records=41
[INFO ] 2026-06-02 10:51:52.113 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21532/300s
[WARN ] 2026-06-02 10:51:52.937 [29089] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:51:54.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:51:58.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 10:51:58.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430429,ok=430429,error=0, records=41
[INFO ] 2026-06-02 10:52:04.330 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21523/300s
[WARN ] 2026-06-02 10:52:07.942 [29119] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:52:09.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:52:09.589 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21535/300s
[INFO ] 2026-06-02 10:52:13.149 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 10:52:13.149 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430430,ok=430430,error=0, records=41
[WARN ] 2026-06-02 10:52:22.947 [29153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:52:24.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:52:28.155 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 10:52:28.155 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430431,ok=430431,error=0, records=41
[WARN ] 2026-06-02 10:52:37.953 [29189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:52:39.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:52:43.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 10:52:43.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430432,ok=430432,error=0, records=41
[WARN ] 2026-06-02 10:52:52.959 [29153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:52:54.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:52:58.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 10:52:58.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430433,ok=430433,error=0, records=41
[INFO ] 2026-06-02 10:52:59.305 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21533/300s
[INFO ] 2026-06-02 10:53:01.107 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21533/300s
[INFO ] 2026-06-02 10:53:07.911 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21533/300s
[WARN ] 2026-06-02 10:53:07.963 [29153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:53:09.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:53:13.172 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 10:53:13.172 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430434,ok=430434,error=0, records=41
[INFO ] 2026-06-02 10:53:18.226 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17930/300s
[INFO ] 2026-06-02 10:53:18.228 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846304},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:53:18.383 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:53:18.383 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 10:53:18.384 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:53:18.384 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:53:18.384 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:53:18.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:53:18.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:53:18.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:53:18.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:53:18.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:53:18.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:53:22.968 [29237] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:53:24.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:53:28.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 10:53:28.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430435,ok=430435,error=0, records=41
[WARN ] 2026-06-02 10:53:37.974 [29256] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:53:39.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 10:53:39.592 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 10:53:43.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 10:53:43.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430436,ok=430436,error=0, records=41
[WARN ] 2026-06-02 10:53:52.979 [29153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:53:54.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:53:54.593 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 10:53:58.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 10:53:58.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430437,ok=430437,error=0, records=41
[WARN ] 2026-06-02 10:54:07.984 [29256] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:54:09.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:54:13.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 10:54:13.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430438,ok=430438,error=0, records=41
[WARN ] 2026-06-02 10:54:22.989 [29183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:54:24.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:54:28.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 10:54:28.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430439,ok=430439,error=0, records=41
[WARN ] 2026-06-02 10:54:37.993 [29153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:54:39.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:54:43.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 10:54:43.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430440,ok=430440,error=0, records=41
[WARN ] 2026-06-02 10:54:52.998 [29189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:54:54.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:54:58.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 10:54:58.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430441,ok=430441,error=0, records=41
[INFO ] 2026-06-02 10:55:01.993 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21537/300s
[WARN ] 2026-06-02 10:55:08.003 [29153] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:55:09.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:55:13.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 10:55:13.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430442,ok=430442,error=0, records=41
[WARN ] 2026-06-02 10:55:23.008 [29189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:55:24.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:55:28.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 10:55:28.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430443,ok=430443,error=0, records=41
[INFO ] 2026-06-02 10:55:32.010 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21528/300s
[WARN ] 2026-06-02 10:55:38.013 [29420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:55:39.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:55:43.221 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21537/300s
[INFO ] 2026-06-02 10:55:43.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 10:55:43.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430444,ok=430444,error=0, records=41
[WARN ] 2026-06-02 10:55:53.018 [29406] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:55:54.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:55:58.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 10:55:58.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430445,ok=430445,error=0, records=41
[WARN ] 2026-06-02 10:56:08.024 [29448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:56:09.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:56:13.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 10:56:13.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430446,ok=430446,error=0, records=41
[INFO ] 2026-06-02 10:56:13.509 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21524/300s
[INFO ] 2026-06-02 10:56:18.385 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846236},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:56:18.547 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:56:18.547 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[WARN ] 2026-06-02 10:56:23.029 [29434] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:56:24.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:56:28.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 10:56:28.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430447,ok=430447,error=0, records=41
[WARN ] 2026-06-02 10:56:38.035 [29462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:56:39.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:56:43.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 10:56:43.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430448,ok=430448,error=0, records=41
[INFO ] 2026-06-02 10:56:52.169 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21533/300s
[WARN ] 2026-06-02 10:56:53.041 [29434] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:56:54.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:56:58.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 10:56:58.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430449,ok=430449,error=0, records=41
[INFO ] 2026-06-02 10:57:04.510 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21524/300s
[WARN ] 2026-06-02 10:57:08.046 [29420] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:57:09.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:57:09.603 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21536/300s
[INFO ] 2026-06-02 10:57:13.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 10:57:13.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430450,ok=430450,error=0, records=41
[WARN ] 2026-06-02 10:57:23.051 [29519] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:57:24.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:57:28.542 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 10:57:28.542 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430451,ok=430451,error=0, records=41
[WARN ] 2026-06-02 10:57:37.557 [29535] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:57:39.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:57:43.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 10:57:43.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430452,ok=430452,error=0, records=41
[WARN ] 2026-06-02 10:57:52.562 [29535] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:57:54.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:57:58.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 10:57:58.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430453,ok=430453,error=0, records=41
[INFO ] 2026-06-02 10:57:59.376 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21534/300s
[INFO ] 2026-06-02 10:58:01.178 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21534/300s
[WARN ] 2026-06-02 10:58:07.567 [29559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:58:07.985 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21534/300s
[INFO ] 2026-06-02 10:58:09.605 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:58:13.560 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 10:58:13.560 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430454,ok=430454,error=0, records=41
[WARN ] 2026-06-02 10:58:22.573 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:58:24.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:58:28.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 10:58:28.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430455,ok=430455,error=0, records=41
[WARN ] 2026-06-02 10:58:37.579 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:58:39.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:58:43.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 10:58:43.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430456,ok=430456,error=0, records=41
[WARN ] 2026-06-02 10:58:52.583 [29607] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:58:54.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:58:58.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 10:58:58.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430457,ok=430457,error=0, records=41
[WARN ] 2026-06-02 10:59:07.591 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:59:09.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:59:13.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 10:59:13.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430458,ok=430458,error=0, records=41
[INFO ] 2026-06-02 10:59:18.547 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17931/300s
[INFO ] 2026-06-02 10:59:18.549 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846164},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 10:59:18.707 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 10:59:18.707 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 10:59:18.707 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 10:59:18.707 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 10:59:18.707 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 10:59:18.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:59:18.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 10:59:18.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:59:18.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 10:59:18.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 10:59:18.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 10:59:22.596 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:59:24.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:59:28.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 10:59:28.659 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430459,ok=430459,error=0, records=41
[WARN ] 2026-06-02 10:59:37.602 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:59:39.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:59:43.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 10:59:43.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430460,ok=430460,error=0, records=41
[WARN ] 2026-06-02 10:59:52.608 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 10:59:54.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 10:59:58.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 10:59:58.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430461,ok=430461,error=0, records=41
[INFO ] 2026-06-02 11:00:01.996 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21538/300s
[WARN ] 2026-06-02 11:00:07.615 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:00:09.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:00:13.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 11:00:13.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430462,ok=430462,error=0, records=41
[WARN ] 2026-06-02 11:00:22.622 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:00:24.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:00:28.687 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 11:00:28.687 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430463,ok=430463,error=0, records=41
[INFO ] 2026-06-02 11:00:32.124 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21529/300s
[WARN ] 2026-06-02 11:00:37.627 [29663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:00:39.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:00:43.228 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21538/300s
[INFO ] 2026-06-02 11:00:43.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 11:00:43.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430464,ok=430464,error=0, records=41
[WARN ] 2026-06-02 11:00:52.632 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:00:54.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:00:58.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:00:58.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430465,ok=430465,error=0, records=41
[WARN ] 2026-06-02 11:01:07.638 [29663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:01:09.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:01:13.703 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 11:01:13.703 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430466,ok=430466,error=0, records=41
[INFO ] 2026-06-02 11:01:13.703 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21525/300s
[WARN ] 2026-06-02 11:01:22.644 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:01:24.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:01:28.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 11:01:28.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430467,ok=430467,error=0, records=41
[WARN ] 2026-06-02 11:01:37.650 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:01:39.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:01:43.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 11:01:43.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430468,ok=430468,error=0, records=41
[INFO ] 2026-06-02 11:01:52.227 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21534/300s
[WARN ] 2026-06-02 11:01:52.656 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:01:54.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:01:58.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-02 11:01:58.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430469,ok=430469,error=0, records=41
[INFO ] 2026-06-02 11:02:04.691 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21525/300s
[WARN ] 2026-06-02 11:02:07.661 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:02:09.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:02:09.615 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21537/300s
[INFO ] 2026-06-02 11:02:13.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 11:02:13.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430470,ok=430470,error=0, records=41
[INFO ] 2026-06-02 11:02:18.709 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846100},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:02:18.887 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:02:18.887 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 11:02:18.887 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:02:18.887 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:02:18.887 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:02:18.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:02:18.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:02:18.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:02:18.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:02:18.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:02:18.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:02:22.665 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:02:24.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:02:28.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 11:02:28.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430471,ok=430471,error=0, records=41
[WARN ] 2026-06-02 11:02:37.671 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:02:39.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:02:43.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 11:02:43.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430472,ok=430472,error=0, records=41
[WARN ] 2026-06-02 11:02:52.675 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:02:54.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:02:58.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 11:02:58.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430473,ok=430473,error=0, records=41
[INFO ] 2026-06-02 11:02:59.448 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21535/300s
[INFO ] 2026-06-02 11:03:01.249 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21535/300s
[WARN ] 2026-06-02 11:03:07.680 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:03:08.056 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21535/300s
[INFO ] 2026-06-02 11:03:09.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:03:13.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 11:03:13.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430474,ok=430474,error=0, records=41
[WARN ] 2026-06-02 11:03:22.685 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:03:24.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:03:28.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 11:03:28.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430475,ok=430475,error=0, records=41
[WARN ] 2026-06-02 11:03:37.690 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:03:39.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 11:03:39.619 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 11:03:43.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 11:03:43.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430476,ok=430476,error=0, records=41
[WARN ] 2026-06-02 11:03:52.695 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:03:54.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:03:58.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:03:58.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430477,ok=430477,error=0, records=41
[WARN ] 2026-06-02 11:04:07.700 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:04:09.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:04:13.781 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 11:04:13.781 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430478,ok=430478,error=0, records=41
[WARN ] 2026-06-02 11:04:22.705 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:04:24.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:04:28.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 11:04:28.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430479,ok=430479,error=0, records=41
[WARN ] 2026-06-02 11:04:37.710 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:04:39.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:04:43.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 11:04:43.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430480,ok=430480,error=0, records=41
[WARN ] 2026-06-02 11:04:52.715 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:04:54.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:04:58.804 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 11:04:58.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430481,ok=430481,error=0, records=41
[INFO ] 2026-06-02 11:05:02.000 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21539/300s
[WARN ] 2026-06-02 11:05:07.721 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:05:09.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:05:13.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 11:05:13.811 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430482,ok=430482,error=0, records=41
[INFO ] 2026-06-02 11:05:18.887 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17932/300s
[INFO ] 2026-06-02 11:05:18.889 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20846028},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:05:19.039 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:05:19.039 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:05:19.039 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:05:19.039 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:05:19.039 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:05:19.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:05:19.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:05:19.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:05:19.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:05:19.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:05:19.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:05:22.726 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:05:24.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:05:28.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 11:05:28.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430483,ok=430483,error=0, records=41
[INFO ] 2026-06-02 11:05:32.229 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21530/300s
[WARN ] 2026-06-02 11:05:37.731 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:05:39.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:05:43.234 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21539/300s
[INFO ] 2026-06-02 11:05:43.833 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 11:05:43.833 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430484,ok=430484,error=0, records=41
[WARN ] 2026-06-02 11:05:52.738 [29663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:05:54.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:05:58.838 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 11:05:58.838 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430485,ok=430485,error=0, records=41
[WARN ] 2026-06-02 11:06:07.744 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:06:09.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:06:13.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 11:06:13.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430486,ok=430486,error=0, records=41
[INFO ] 2026-06-02 11:06:13.847 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21526/300s
[WARN ] 2026-06-02 11:06:22.749 [29663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:06:24.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:06:28.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 11:06:28.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430487,ok=430487,error=0, records=41
[WARN ] 2026-06-02 11:06:37.756 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:06:39.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:06:43.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 11:06:43.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430488,ok=430488,error=0, records=41
[INFO ] 2026-06-02 11:06:52.282 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21535/300s
[WARN ] 2026-06-02 11:06:52.760 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:06:54.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:06:58.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 11:06:58.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430489,ok=430489,error=0, records=41
[INFO ] 2026-06-02 11:07:04.875 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21526/300s
[WARN ] 2026-06-02 11:07:07.766 [29663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:07:09.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:07:09.628 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21538/300s
[INFO ] 2026-06-02 11:07:13.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 11:07:13.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430490,ok=430490,error=0, records=41
[WARN ] 2026-06-02 11:07:22.770 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:07:24.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:07:28.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 11:07:28.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430491,ok=430491,error=0, records=41
[WARN ] 2026-06-02 11:07:37.775 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:07:39.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:07:43.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 11:07:43.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430492,ok=430492,error=0, records=41
[WARN ] 2026-06-02 11:07:52.780 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:07:54.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:07:58.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 11:07:58.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430493,ok=430493,error=0, records=41
[INFO ] 2026-06-02 11:07:59.515 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21536/300s
[INFO ] 2026-06-02 11:08:01.316 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21536/300s
[WARN ] 2026-06-02 11:08:07.786 [29663] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:08:08.123 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21536/300s
[INFO ] 2026-06-02 11:08:09.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.83MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:08:13.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 11:08:13.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430494,ok=430494,error=0, records=41
[INFO ] 2026-06-02 11:08:19.041 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845952},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:08:19.183 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:08:19.183 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 11:08:19.183 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:08:19.183 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:08:19.184 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:08:19.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:08:19.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:08:19.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:08:19.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:08:19.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:08:19.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:08:22.791 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:08:24.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:08:28.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 11:08:28.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430495,ok=430495,error=0, records=41
[WARN ] 2026-06-02 11:08:37.796 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:08:39.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:08:43.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 11:08:43.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430496,ok=430496,error=0, records=41
[WARN ] 2026-06-02 11:08:52.802 [29641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:08:54.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:08:54.633 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 11:08:58.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 11:08:58.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430497,ok=430497,error=0, records=41
[WARN ] 2026-06-02 11:09:07.808 [29640] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:09:09.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:09:13.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 11:09:13.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430498,ok=430498,error=0, records=41
[WARN ] 2026-06-02 11:09:22.815 [30188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:09:24.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:09:28.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:09:28.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430499,ok=430499,error=0, records=41
[WARN ] 2026-06-02 11:09:37.820 [30208] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:09:39.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:09:43.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 11:09:43.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430500,ok=430500,error=0, records=41
[WARN ] 2026-06-02 11:09:52.824 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:09:54.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:09:58.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 11:09:58.936 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430501,ok=430501,error=0, records=41
[INFO ] 2026-06-02 11:10:02.004 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21540/300s
[WARN ] 2026-06-02 11:10:07.829 [30236] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:10:09.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:10:13.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 11:10:13.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430502,ok=430502,error=0, records=41
[WARN ] 2026-06-02 11:10:22.835 [30269] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:10:24.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:10:28.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 11:10:28.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430503,ok=430503,error=0, records=41
[INFO ] 2026-06-02 11:10:32.338 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21531/300s
[WARN ] 2026-06-02 11:10:37.840 [29577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:10:39.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:10:43.241 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21540/300s
[INFO ] 2026-06-02 11:10:44.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 11:10:44.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430504,ok=430504,error=0, records=41
[WARN ] 2026-06-02 11:10:52.846 [30291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:10:54.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:10:59.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 11:10:59.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430505,ok=430505,error=0, records=41
[WARN ] 2026-06-02 11:11:07.851 [30305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:11:09.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:11:14.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 11:11:14.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430506,ok=430506,error=0, records=41
[INFO ] 2026-06-02 11:11:14.059 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21527/300s
[INFO ] 2026-06-02 11:11:19.184 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17933/300s
[INFO ] 2026-06-02 11:11:19.185 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845880},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:11:19.361 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:11:19.361 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 11:11:19.361 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:11:19.361 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:11:19.361 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:11:19.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:11:19.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:11:19.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:11:19.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:11:19.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:11:19.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:11:22.856 [30319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:11:24.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:11:29.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 11:11:29.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430507,ok=430507,error=0, records=41
[WARN ] 2026-06-02 11:11:37.861 [30319] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:11:39.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:11:44.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 11:11:44.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430508,ok=430508,error=0, records=41
[INFO ] 2026-06-02 11:11:52.338 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21536/300s
[WARN ] 2026-06-02 11:11:52.868 [30305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:11:54.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:11:59.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 11:11:59.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430509,ok=430509,error=0, records=41
[INFO ] 2026-06-02 11:12:05.059 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21527/300s
[WARN ] 2026-06-02 11:12:07.872 [29657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:12:09.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:12:09.642 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21539/300s
[INFO ] 2026-06-02 11:12:14.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 11:12:14.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430510,ok=430510,error=0, records=41
[WARN ] 2026-06-02 11:12:22.877 [30370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:12:24.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:12:29.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-02 11:12:29.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430511,ok=430511,error=0, records=41
[WARN ] 2026-06-02 11:12:37.882 [30377] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:12:39.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:12:44.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 11:12:44.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430512,ok=430512,error=0, records=41
[WARN ] 2026-06-02 11:12:52.888 [30416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:12:54.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:12:59.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 11:12:59.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430513,ok=430513,error=0, records=41
[INFO ] 2026-06-02 11:12:59.580 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21537/300s
[INFO ] 2026-06-02 11:13:01.382 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21537/300s
[WARN ] 2026-06-02 11:13:07.893 [30421] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:13:08.189 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21537/300s
[INFO ] 2026-06-02 11:13:09.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:13:14.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 11:13:14.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430514,ok=430514,error=0, records=41
[WARN ] 2026-06-02 11:13:22.899 [30447] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:13:24.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:13:29.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 11:13:29.114 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430515,ok=430515,error=0, records=41
[WARN ] 2026-06-02 11:13:37.903 [30464] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:13:39.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 11:13:39.646 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 11:13:44.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 11:13:44.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430516,ok=430516,error=0, records=41
[WARN ] 2026-06-02 11:13:52.909 [30469] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:13:54.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:13:59.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 11:13:59.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430517,ok=430517,error=0, records=41
[WARN ] 2026-06-02 11:14:07.916 [30479] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:14:09.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:14:14.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 11:14:14.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430518,ok=430518,error=0, records=41
[INFO ] 2026-06-02 11:14:19.363 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845808},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:14:19.528 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:14:19.528 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 11:14:19.529 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:14:19.529 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:14:19.529 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:14:19.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:14:19.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:14:19.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:14:19.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:14:19.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:14:19.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:14:22.922 [30505] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:14:24.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:14:29.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 11:14:29.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430519,ok=430519,error=0, records=41
[WARN ] 2026-06-02 11:14:37.927 [30529] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:14:39.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:14:44.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 11:14:44.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430520,ok=430520,error=0, records=41
[WARN ] 2026-06-02 11:14:52.933 [30491] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:14:54.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:14:59.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 11:14:59.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430521,ok=430521,error=0, records=41
[INFO ] 2026-06-02 11:15:02.008 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21541/300s
[WARN ] 2026-06-02 11:15:07.938 [30540] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:15:09.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:15:14.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 11:15:14.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430522,ok=430522,error=0, records=41
[WARN ] 2026-06-02 11:15:22.943 [30579] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:15:24.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:15:29.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 11:15:29.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430523,ok=430523,error=0, records=41
[INFO ] 2026-06-02 11:15:32.446 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21532/300s
[WARN ] 2026-06-02 11:15:37.949 [30557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:15:39.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:15:43.247 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21541/300s
[INFO ] 2026-06-02 11:15:44.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 11:15:44.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430524,ok=430524,error=0, records=41
[WARN ] 2026-06-02 11:15:52.953 [30589] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:15:54.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:15:59.171 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 11:15:59.171 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430525,ok=430525,error=0, records=41
[WARN ] 2026-06-02 11:16:07.958 [30557] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:16:09.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:16:14.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 11:16:14.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430526,ok=430526,error=0, records=41
[INFO ] 2026-06-02 11:16:14.176 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21528/300s
[WARN ] 2026-06-02 11:16:22.962 [30595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:16:24.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:16:29.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 11:16:29.180 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430527,ok=430527,error=0, records=41
[WARN ] 2026-06-02 11:16:37.967 [30595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:16:39.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:16:44.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 11:16:44.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430528,ok=430528,error=0, records=41
[INFO ] 2026-06-02 11:16:52.393 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21537/300s
[WARN ] 2026-06-02 11:16:52.972 [30573] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:16:54.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:16:59.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 11:16:59.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430529,ok=430529,error=0, records=41
[INFO ] 2026-06-02 11:17:05.246 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21528/300s
[WARN ] 2026-06-02 11:17:07.977 [30619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:17:09.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:17:09.655 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21540/300s
[INFO ] 2026-06-02 11:17:14.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 11:17:14.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430530,ok=430530,error=0, records=41
[INFO ] 2026-06-02 11:17:19.529 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17934/300s
[INFO ] 2026-06-02 11:17:19.530 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845740},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:17:19.679 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:17:19.679 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:17:19.679 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:17:19.679 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:17:19.679 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:17:19.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:17:19.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:17:19.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:17:19.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:17:19.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:17:19.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:17:22.982 [30688] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:17:24.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:17:29.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:17:29.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430531,ok=430531,error=0, records=41
[WARN ] 2026-06-02 11:17:37.988 [30660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:17:39.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:17:44.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 11:17:44.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430532,ok=430532,error=0, records=41
[WARN ] 2026-06-02 11:17:52.992 [30660] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:17:54.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:17:59.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 11:17:59.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430533,ok=430533,error=0, records=41
[INFO ] 2026-06-02 11:17:59.649 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21538/300s
[INFO ] 2026-06-02 11:18:01.451 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21538/300s
[WARN ] 2026-06-02 11:18:07.997 [30731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:18:08.257 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21538/300s
[INFO ] 2026-06-02 11:18:09.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:18:14.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 11:18:14.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430534,ok=430534,error=0, records=41
[WARN ] 2026-06-02 11:18:23.003 [30703] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:18:24.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:18:29.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 11:18:29.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430535,ok=430535,error=0, records=41
[WARN ] 2026-06-02 11:18:38.007 [30731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:18:39.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:18:44.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 11:18:44.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430536,ok=430536,error=0, records=41
[WARN ] 2026-06-02 11:18:53.013 [30760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:18:54.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:18:59.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 11:18:59.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430537,ok=430537,error=0, records=41
[WARN ] 2026-06-02 11:19:08.017 [30760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:19:09.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:19:14.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 11:19:14.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430538,ok=430538,error=0, records=41
[WARN ] 2026-06-02 11:19:23.023 [30674] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:19:24.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:19:29.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 11:19:29.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430539,ok=430539,error=0, records=41
[WARN ] 2026-06-02 11:19:38.028 [30801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:19:39.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:19:44.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 11:19:44.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430540,ok=430540,error=0, records=41
[WARN ] 2026-06-02 11:19:53.034 [30746] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:19:54.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:19:59.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 11:19:59.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430541,ok=430541,error=0, records=41
[INFO ] 2026-06-02 11:20:02.012 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21542/300s
[WARN ] 2026-06-02 11:20:08.039 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:20:09.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:20:14.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 11:20:14.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430542,ok=430542,error=0, records=41
[INFO ] 2026-06-02 11:20:19.681 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845676},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:20:19.851 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:20:19.851 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 11:20:19.852 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:20:19.852 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:20:19.852 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:20:19.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:20:19.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:20:19.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:20:19.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:20:19.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:20:19.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:20:23.046 [30855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:20:24.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:20:29.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 11:20:29.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430543,ok=430543,error=0, records=41
[INFO ] 2026-06-02 11:20:32.550 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21533/300s
[WARN ] 2026-06-02 11:20:38.052 [30890] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:20:39.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:20:43.254 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21542/300s
[INFO ] 2026-06-02 11:20:44.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 11:20:44.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430544,ok=430544,error=0, records=41
[WARN ] 2026-06-02 11:20:52.558 [30860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:20:54.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:20:59.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 11:20:59.288 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430545,ok=430545,error=0, records=41
[WARN ] 2026-06-02 11:21:07.563 [30885] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:21:09.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:21:14.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 11:21:14.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430546,ok=430546,error=0, records=41
[INFO ] 2026-06-02 11:21:14.292 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21529/300s
[WARN ] 2026-06-02 11:21:22.567 [30935] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:21:24.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:21:29.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 11:21:29.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430547,ok=430547,error=0, records=41
[WARN ] 2026-06-02 11:21:37.572 [30959] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:21:39.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:21:44.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 11:21:44.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430548,ok=430548,error=0, records=41
[INFO ] 2026-06-02 11:21:52.452 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21538/300s
[WARN ] 2026-06-02 11:21:52.578 [30952] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:21:54.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:21:59.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 11:21:59.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430549,ok=430549,error=0, records=41
[INFO ] 2026-06-02 11:22:05.429 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21529/300s
[WARN ] 2026-06-02 11:22:07.584 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:22:09.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:22:09.668 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21541/300s
[INFO ] 2026-06-02 11:22:14.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 11:22:14.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430550,ok=430550,error=0, records=41
[WARN ] 2026-06-02 11:22:22.590 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:22:24.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:22:29.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 11:22:29.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430551,ok=430551,error=0, records=41
[WARN ] 2026-06-02 11:22:37.595 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:22:39.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:22:44.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 11:22:44.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430552,ok=430552,error=0, records=41
[WARN ] 2026-06-02 11:22:52.600 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:22:54.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:22:59.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 11:22:59.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430553,ok=430553,error=0, records=41
[INFO ] 2026-06-02 11:22:59.709 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21539/300s
[INFO ] 2026-06-02 11:23:01.510 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21539/300s
[WARN ] 2026-06-02 11:23:07.605 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:23:08.317 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21539/300s
[INFO ] 2026-06-02 11:23:09.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:23:14.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 11:23:14.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430554,ok=430554,error=0, records=41
[INFO ] 2026-06-02 11:23:19.852 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17935/300s
[INFO ] 2026-06-02 11:23:19.853 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845604},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:23:20.015 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:23:20.015 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 11:23:20.015 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:23:20.015 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:23:20.015 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:23:20.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:23:20.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:23:20.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:23:20.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:23:20.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:23:20.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:23:22.610 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:23:24.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:23:29.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:23:29.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430555,ok=430555,error=0, records=41
[WARN ] 2026-06-02 11:23:37.615 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:23:39.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 11:23:39.671 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 11:23:44.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 11:23:44.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430556,ok=430556,error=0, records=41
[WARN ] 2026-06-02 11:23:52.620 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:23:54.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:23:54.672 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 11:23:59.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 11:23:59.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430557,ok=430557,error=0, records=41
[WARN ] 2026-06-02 11:24:07.625 [30986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:24:09.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:24:14.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 11:24:14.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430558,ok=430558,error=0, records=41
[WARN ] 2026-06-02 11:24:22.632 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:24:24.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:24:29.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 11:24:29.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430559,ok=430559,error=0, records=41
[WARN ] 2026-06-02 11:24:37.637 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:24:39.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:24:44.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 11:24:44.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430560,ok=430560,error=0, records=41
[WARN ] 2026-06-02 11:24:52.641 [30986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:24:54.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:24:59.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 11:24:59.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430561,ok=430561,error=0, records=41
[INFO ] 2026-06-02 11:25:02.016 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21543/300s
[WARN ] 2026-06-02 11:25:07.646 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:25:09.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:25:14.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 11:25:14.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430562,ok=430562,error=0, records=41
[WARN ] 2026-06-02 11:25:22.653 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:25:24.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:25:29.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 11:25:29.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430563,ok=430563,error=0, records=41
[INFO ] 2026-06-02 11:25:32.656 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21534/300s
[WARN ] 2026-06-02 11:25:37.659 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:25:39.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:25:43.260 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21543/300s
[INFO ] 2026-06-02 11:25:44.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 11:25:44.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430564,ok=430564,error=0, records=41
[WARN ] 2026-06-02 11:25:52.663 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:25:54.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:25:59.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 11:25:59.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430565,ok=430565,error=0, records=41
[WARN ] 2026-06-02 11:26:07.669 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:26:09.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:26:14.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 11:26:14.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430566,ok=430566,error=0, records=41
[INFO ] 2026-06-02 11:26:14.451 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21530/300s
[INFO ] 2026-06-02 11:26:20.017 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845536},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:26:20.196 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:26:20.196 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 11:26:20.196 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:26:20.196 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:26:20.196 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:26:20.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:26:20.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:26:20.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:26:20.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:26:20.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:26:20.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:26:22.674 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:26:24.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:26:29.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 11:26:29.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430567,ok=430567,error=0, records=41
[WARN ] 2026-06-02 11:26:37.679 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:26:39.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:26:44.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 11:26:44.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430568,ok=430568,error=0, records=41
[INFO ] 2026-06-02 11:26:52.506 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21539/300s
[WARN ] 2026-06-02 11:26:52.684 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:26:54.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:27:00.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 11:27:00.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430569,ok=430569,error=0, records=41
[INFO ] 2026-06-02 11:27:05.612 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21530/300s
[WARN ] 2026-06-02 11:27:07.689 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:27:09.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:27:09.681 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21542/300s
[INFO ] 2026-06-02 11:27:15.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 11:27:15.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430570,ok=430570,error=0, records=41
[WARN ] 2026-06-02 11:27:22.695 [30986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:27:24.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:27:30.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 11:27:30.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430571,ok=430571,error=0, records=41
[WARN ] 2026-06-02 11:27:37.700 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:27:39.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:27:45.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 11:27:45.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430572,ok=430572,error=0, records=41
[WARN ] 2026-06-02 11:27:52.704 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:27:54.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:27:59.790 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21540/300s
[INFO ] 2026-06-02 11:28:00.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:28:00.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430573,ok=430573,error=0, records=41
[INFO ] 2026-06-02 11:28:01.593 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21540/300s
[WARN ] 2026-06-02 11:28:07.710 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:28:08.398 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21540/300s
[INFO ] 2026-06-02 11:28:09.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:28:15.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:28:15.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430574,ok=430574,error=0, records=41
[WARN ] 2026-06-02 11:28:22.715 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:28:24.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:28:30.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 11:28:30.659 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430575,ok=430575,error=0, records=41
[WARN ] 2026-06-02 11:28:37.720 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:28:39.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:28:45.664 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 11:28:45.664 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430576,ok=430576,error=0, records=41
[WARN ] 2026-06-02 11:28:52.725 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:28:54.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:29:00.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 11:29:00.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430577,ok=430577,error=0, records=41
[WARN ] 2026-06-02 11:29:07.730 [30986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:29:09.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:29:15.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-02 11:29:15.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430578,ok=430578,error=0, records=41
[INFO ] 2026-06-02 11:29:20.196 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17936/300s
[INFO ] 2026-06-02 11:29:20.198 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845468},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:29:20.343 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:29:20.343 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:29:20.343 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:29:20.343 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:29:20.343 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:29:20.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:29:20.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:29:20.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:29:20.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:29:20.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:29:20.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:29:22.736 [30986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:29:24.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:29:30.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 11:29:30.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430579,ok=430579,error=0, records=41
[WARN ] 2026-06-02 11:29:32.740 [31010] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/18003/stat), No such file or directory
[WARN ] 2026-06-02 11:29:32.740 [31010] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17303/stat), No such file or directory
[WARN ] 2026-06-02 11:29:37.741 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:29:39.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:29:45.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 11:29:45.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430580,ok=430580,error=0, records=41
[WARN ] 2026-06-02 11:29:47.745 [30991] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/18003/stat), No such file or directory
[WARN ] 2026-06-02 11:29:47.745 [30991] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17303/stat), No such file or directory
[WARN ] 2026-06-02 11:29:52.745 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:29:54.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:30:00.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 11:30:00.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430581,ok=430581,error=0, records=41
[INFO ] 2026-06-02 11:30:02.020 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21544/300s
[WARN ] 2026-06-02 11:30:07.751 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:30:09.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:30:15.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 11:30:15.699 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430582,ok=430582,error=0, records=41
[WARN ] 2026-06-02 11:30:22.756 [30991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:30:24.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:30:30.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10141, records=41
[INFO ] 2026-06-02 11:30:30.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430583,ok=430583,error=0, records=41
[INFO ] 2026-06-02 11:30:32.759 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21535/300s
[WARN ] 2026-06-02 11:30:37.760 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:30:39.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:30:43.266 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21544/300s
[INFO ] 2026-06-02 11:30:45.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 11:30:45.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430584,ok=430584,error=0, records=41
[WARN ] 2026-06-02 11:30:52.765 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:30:54.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:31:00.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-02 11:31:00.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430585,ok=430585,error=0, records=41
[WARN ] 2026-06-02 11:31:07.770 [30986] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:31:09.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:31:15.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 11:31:15.719 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430586,ok=430586,error=0, records=41
[INFO ] 2026-06-02 11:31:15.719 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21531/300s
[WARN ] 2026-06-02 11:31:22.775 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:31:24.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:31:30.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 11:31:30.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430587,ok=430587,error=0, records=41
[WARN ] 2026-06-02 11:31:37.780 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:31:39.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:31:45.731 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10399, records=41
[INFO ] 2026-06-02 11:31:45.731 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430588,ok=430588,error=0, records=41
[INFO ] 2026-06-02 11:31:52.562 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21540/300s
[WARN ] 2026-06-02 11:31:52.785 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:31:54.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:32:00.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10414, records=41
[INFO ] 2026-06-02 11:32:00.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430589,ok=430589,error=0, records=41
[INFO ] 2026-06-02 11:32:05.789 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21531/300s
[WARN ] 2026-06-02 11:32:07.791 [31010] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:32:09.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:32:09.693 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21543/300s
[INFO ] 2026-06-02 11:32:15.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 11:32:15.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430590,ok=430590,error=0, records=41
[WARN ] 2026-06-02 11:32:17.797 [30986] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17758/stat), No such file or directory
[WARN ] 2026-06-02 11:32:17.801 [30986] cloudMonitor/base_collect.cpp:241: SicGetProcessState failed, err: FeadFileContent(/proc/31520/stat), No such file or directory
[INFO ] 2026-06-02 11:32:20.345 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845384},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:32:20.501 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:32:20.501 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:32:20.501 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:32:20.501 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:32:20.501 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:32:20.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:32:20.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:32:20.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:32:20.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:32:20.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:32:20.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:32:22.797 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:32:24.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:32:30.749 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-02 11:32:30.749 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430591,ok=430591,error=0, records=41
[WARN ] 2026-06-02 11:32:32.803 [31010] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17758/stat), No such file or directory
[WARN ] 2026-06-02 11:32:37.803 [30923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:32:39.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:32:45.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10131, records=41
[INFO ] 2026-06-02 11:32:45.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430592,ok=430592,error=0, records=41
[WARN ] 2026-06-02 11:32:47.809 [30991] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17758/stat), No such file or directory
[WARN ] 2026-06-02 11:32:52.809 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:32:54.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:32:59.839 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21541/300s
[INFO ] 2026-06-02 11:33:00.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10112, records=41
[INFO ] 2026-06-02 11:33:00.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430593,ok=430593,error=0, records=41
[INFO ] 2026-06-02 11:33:01.640 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21541/300s
[WARN ] 2026-06-02 11:33:07.815 [30997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:33:08.445 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21541/300s
[INFO ] 2026-06-02 11:33:09.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:33:15.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 11:33:15.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430594,ok=430594,error=0, records=41
[WARN ] 2026-06-02 11:33:22.820 [31604] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:33:24.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:33:30.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 11:33:30.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430595,ok=430595,error=0, records=41
[WARN ] 2026-06-02 11:33:37.826 [31604] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:33:39.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 11:33:39.697 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 11:33:45.774 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 11:33:45.774 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430596,ok=430596,error=0, records=41
[WARN ] 2026-06-02 11:33:52.832 [31666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:33:54.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.51MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:34:00.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 11:34:00.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430597,ok=430597,error=0, records=41
[WARN ] 2026-06-02 11:34:07.837 [31666] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:34:09.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:34:15.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 11:34:15.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430598,ok=430598,error=0, records=41
[WARN ] 2026-06-02 11:34:22.842 [31638] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:34:24.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:34:30.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 11:34:30.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430599,ok=430599,error=0, records=41
[WARN ] 2026-06-02 11:34:37.848 [31638] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:34:39.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:34:45.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 11:34:45.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430600,ok=430600,error=0, records=41
[WARN ] 2026-06-02 11:34:52.853 [31717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:34:54.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:35:00.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 11:35:00.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430601,ok=430601,error=0, records=41
[INFO ] 2026-06-02 11:35:02.023 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21545/300s
[WARN ] 2026-06-02 11:35:07.859 [31731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:35:09.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:35:15.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 11:35:15.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430602,ok=430602,error=0, records=41
[INFO ] 2026-06-02 11:35:20.502 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17937/300s
[INFO ] 2026-06-02 11:35:20.503 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845316},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:35:20.659 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:35:20.659 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:35:20.659 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:35:20.659 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:35:20.659 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:35:20.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:35:20.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:35:20.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:35:20.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:35:20.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:35:20.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:35:22.863 [31717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:35:24.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:35:30.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 11:35:30.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430603,ok=430603,error=0, records=41
[INFO ] 2026-06-02 11:35:32.866 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21536/300s
[WARN ] 2026-06-02 11:35:37.868 [31717] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:35:39.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:35:43.272 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21545/300s
[INFO ] 2026-06-02 11:35:45.823 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 11:35:45.823 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430604,ok=430604,error=0, records=41
[WARN ] 2026-06-02 11:35:52.873 [31731] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:35:54.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:36:00.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 11:36:00.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430605,ok=430605,error=0, records=41
[WARN ] 2026-06-02 11:36:07.879 [31790] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:36:09.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:36:15.844 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 11:36:15.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430606,ok=430606,error=0, records=41
[INFO ] 2026-06-02 11:36:15.844 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21532/300s
[WARN ] 2026-06-02 11:36:22.884 [31811] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:36:24.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:36:30.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 11:36:30.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430607,ok=430607,error=0, records=41
[WARN ] 2026-06-02 11:36:37.889 [31821] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:36:39.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:36:45.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 11:36:45.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430608,ok=430608,error=0, records=41
[INFO ] 2026-06-02 11:36:52.617 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21541/300s
[WARN ] 2026-06-02 11:36:52.895 [31806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:36:54.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:37:00.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 11:37:00.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430609,ok=430609,error=0, records=41
[INFO ] 2026-06-02 11:37:05.972 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21532/300s
[WARN ] 2026-06-02 11:37:07.900 [31860] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:37:09.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:37:09.706 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21544/300s
[INFO ] 2026-06-02 11:37:15.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 11:37:15.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430610,ok=430610,error=0, records=41
[WARN ] 2026-06-02 11:37:22.906 [31872] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:37:24.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:37:30.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 11:37:30.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430611,ok=430611,error=0, records=41
[WARN ] 2026-06-02 11:37:37.912 [31855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:37:39.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:37:45.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:37:45.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430612,ok=430612,error=0, records=41
[WARN ] 2026-06-02 11:37:52.917 [31881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:37:54.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:37:59.905 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21542/300s
[INFO ] 2026-06-02 11:38:00.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 11:38:00.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430613,ok=430613,error=0, records=41
[INFO ] 2026-06-02 11:38:01.707 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21542/300s
[WARN ] 2026-06-02 11:38:07.922 [31855] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:38:08.510 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21542/300s
[INFO ] 2026-06-02 11:38:09.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:38:15.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 11:38:15.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430614,ok=430614,error=0, records=41
[INFO ] 2026-06-02 11:38:20.660 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845248},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:38:20.820 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:38:20.820 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:38:20.820 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:38:20.820 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:38:20.820 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:38:20.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:38:20.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:38:20.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:38:20.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:38:20.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:38:20.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:38:22.928 [31904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:38:24.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:38:30.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 11:38:30.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430615,ok=430615,error=0, records=41
[WARN ] 2026-06-02 11:38:37.934 [31881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:38:39.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:38:45.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 11:38:45.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430616,ok=430616,error=0, records=41
[WARN ] 2026-06-02 11:38:52.940 [31968] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:38:54.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:38:54.710 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 11:39:00.922 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 11:39:00.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430617,ok=430617,error=0, records=41
[WARN ] 2026-06-02 11:39:07.946 [31930] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:39:09.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:39:15.927 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 11:39:15.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430618,ok=430618,error=0, records=41
[WARN ] 2026-06-02 11:39:22.951 [31930] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:39:24.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:39:30.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 11:39:30.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430619,ok=430619,error=0, records=41
[WARN ] 2026-06-02 11:39:37.956 [31974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:39:39.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:39:45.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 11:39:45.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430620,ok=430620,error=0, records=41
[WARN ] 2026-06-02 11:39:52.961 [32100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:39:54.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:40:00.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10297, records=41
[INFO ] 2026-06-02 11:40:00.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430621,ok=430621,error=0, records=41
[INFO ] 2026-06-02 11:40:02.027 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21546/300s
[WARN ] 2026-06-02 11:40:07.972 [32123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:40:09.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:40:15.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 11:40:15.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430622,ok=430622,error=0, records=41
[WARN ] 2026-06-02 11:40:17.477 [31962] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31518/stat), No such file or directory
[WARN ] 2026-06-02 11:40:17.477 [31962] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31510/stat), No such file or directory
[WARN ] 2026-06-02 11:40:17.477 [31962] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31522/stat), No such file or directory
[WARN ] 2026-06-02 11:40:22.978 [32100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:40:24.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:40:30.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10060, records=41
[INFO ] 2026-06-02 11:40:30.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430623,ok=430623,error=0, records=41
[WARN ] 2026-06-02 11:40:32.489 [32123] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31518/stat), No such file or directory
[WARN ] 2026-06-02 11:40:32.489 [32123] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31510/stat), No such file or directory
[WARN ] 2026-06-02 11:40:32.489 [32123] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31522/stat), No such file or directory
[INFO ] 2026-06-02 11:40:32.987 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21537/300s
[WARN ] 2026-06-02 11:40:37.989 [31974] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:40:39.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:40:43.278 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21546/300s
[INFO ] 2026-06-02 11:40:45.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10089, records=41
[INFO ] 2026-06-02 11:40:45.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430624,ok=430624,error=0, records=41
[WARN ] 2026-06-02 11:40:47.495 [32167] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31521/stat), No such file or directory
[WARN ] 2026-06-02 11:40:47.496 [32167] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31518/stat), No such file or directory
[WARN ] 2026-06-02 11:40:47.496 [32167] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31510/stat), No such file or directory
[WARN ] 2026-06-02 11:40:47.496 [32167] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/31522/stat), No such file or directory
[WARN ] 2026-06-02 11:40:52.996 [32100] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:40:54.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:41:00.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10111, records=41
[INFO ] 2026-06-02 11:41:00.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430625,ok=430625,error=0, records=41
[WARN ] 2026-06-02 11:41:08.001 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:41:09.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:41:15.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 11:41:15.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430626,ok=430626,error=0, records=41
[INFO ] 2026-06-02 11:41:15.993 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21533/300s
[INFO ] 2026-06-02 11:41:20.820 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17938/300s
[INFO ] 2026-06-02 11:41:20.821 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845112},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:41:21.000 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:41:21.000 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 11:41:21.001 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:41:21.001 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:41:21.001 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:41:21.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:41:21.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:41:21.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:41:21.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:41:21.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:41:21.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:41:23.006 [31962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:41:24.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:41:30.998 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 11:41:30.998 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430627,ok=430627,error=0, records=41
[WARN ] 2026-06-02 11:41:38.012 [32182] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:41:39.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:41:46.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 11:41:46.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430628,ok=430628,error=0, records=41
[INFO ] 2026-06-02 11:41:52.680 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21542/300s
[WARN ] 2026-06-02 11:41:53.016 [32123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:41:54.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:42:01.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 11:42:01.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430629,ok=430629,error=0, records=41
[INFO ] 2026-06-02 11:42:06.161 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21533/300s
[WARN ] 2026-06-02 11:42:08.022 [32196] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:42:09.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:42:09.719 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21545/300s
[INFO ] 2026-06-02 11:42:16.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-02 11:42:16.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430630,ok=430630,error=0, records=41
[WARN ] 2026-06-02 11:42:23.026 [32123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:42:24.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:42:31.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 11:42:31.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430631,ok=430631,error=0, records=41
[WARN ] 2026-06-02 11:42:38.031 [32267] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:42:39.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:42:46.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 11:42:46.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430632,ok=430632,error=0, records=41
[WARN ] 2026-06-02 11:42:53.036 [32123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:42:54.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:42:59.914 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21543/300s
[INFO ] 2026-06-02 11:43:01.033 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 11:43:01.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430633,ok=430633,error=0, records=41
[INFO ] 2026-06-02 11:43:01.716 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21543/300s
[WARN ] 2026-06-02 11:43:08.040 [32297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:43:08.514 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21543/300s
[INFO ] 2026-06-02 11:43:09.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:43:16.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-02 11:43:16.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430634,ok=430634,error=0, records=41
[WARN ] 2026-06-02 11:43:23.045 [32304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:43:24.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:43:31.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 11:43:31.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430635,ok=430635,error=0, records=41
[WARN ] 2026-06-02 11:43:38.050 [32337] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:43:39.722 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 11:43:39.723 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 11:43:46.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 11:43:46.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430636,ok=430636,error=0, records=41
[WARN ] 2026-06-02 11:43:52.555 [32354] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:43:54.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:44:01.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 11:44:01.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430637,ok=430637,error=0, records=41
[WARN ] 2026-06-02 11:44:07.561 [32365] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:44:09.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:44:16.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 11:44:16.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430638,ok=430638,error=0, records=41
[INFO ] 2026-06-02 11:44:21.002 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20845036},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:44:21.154 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:44:21.155 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 11:44:21.155 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:44:21.155 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:44:21.155 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:44:21.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:44:21.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:44:21.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:44:21.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:44:21.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:44:21.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:44:22.565 [32383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:44:24.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:44:31.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 11:44:31.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430639,ok=430639,error=0, records=41
[WARN ] 2026-06-02 11:44:37.571 [32304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:44:39.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:44:46.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10320, records=41
[INFO ] 2026-06-02 11:44:46.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430640,ok=430640,error=0, records=41
[WARN ] 2026-06-02 11:44:52.575 [32425] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:44:54.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:45:01.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 11:45:01.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430641,ok=430641,error=0, records=41
[INFO ] 2026-06-02 11:45:02.031 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21547/300s
[WARN ] 2026-06-02 11:45:07.582 [32304] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:45:09.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:45:16.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 11:45:16.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430642,ok=430642,error=0, records=41
[WARN ] 2026-06-02 11:45:22.587 [32448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:45:24.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:45:31.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 11:45:31.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430643,ok=430643,error=0, records=41
[INFO ] 2026-06-02 11:45:33.090 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21538/300s
[WARN ] 2026-06-02 11:45:37.592 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:45:39.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:45:43.285 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21547/300s
[INFO ] 2026-06-02 11:45:46.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 11:45:46.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430644,ok=430644,error=0, records=41
[WARN ] 2026-06-02 11:45:52.596 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:45:54.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:46:01.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 11:46:01.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430645,ok=430645,error=0, records=41
[WARN ] 2026-06-02 11:46:07.602 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:46:09.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:46:16.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 11:46:16.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430646,ok=430646,error=0, records=41
[INFO ] 2026-06-02 11:46:16.115 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21534/300s
[WARN ] 2026-06-02 11:46:22.607 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:46:24.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:46:31.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 11:46:31.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430647,ok=430647,error=0, records=41
[WARN ] 2026-06-02 11:46:37.612 [32457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:46:39.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:46:46.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 11:46:46.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430648,ok=430648,error=0, records=41
[WARN ] 2026-06-02 11:46:52.617 [32457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:46:52.736 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21543/300s
[INFO ] 2026-06-02 11:46:54.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:47:01.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 11:47:01.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430649,ok=430649,error=0, records=41
[INFO ] 2026-06-02 11:47:06.345 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21534/300s
[WARN ] 2026-06-02 11:47:07.623 [32457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:47:09.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:47:09.731 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21546/300s
[INFO ] 2026-06-02 11:47:16.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 11:47:16.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430650,ok=430650,error=0, records=41
[INFO ] 2026-06-02 11:47:21.155 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17939/300s
[INFO ] 2026-06-02 11:47:21.156 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844968},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:47:21.333 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:47:21.333 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 11:47:21.333 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:47:21.333 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:47:21.333 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:47:21.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:47:21.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:47:21.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:47:21.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:47:21.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:47:21.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:47:22.627 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:47:24.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:47:31.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 11:47:31.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430651,ok=430651,error=0, records=41
[WARN ] 2026-06-02 11:47:37.631 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:47:39.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:47:46.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 11:47:46.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430652,ok=430652,error=0, records=41
[WARN ] 2026-06-02 11:47:52.637 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:47:54.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:47:59.962 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21544/300s
[INFO ] 2026-06-02 11:48:01.167 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 11:48:01.167 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430653,ok=430653,error=0, records=41
[INFO ] 2026-06-02 11:48:01.764 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21544/300s
[WARN ] 2026-06-02 11:48:07.642 [32438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:48:08.568 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21544/300s
[INFO ] 2026-06-02 11:48:09.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:48:16.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 11:48:16.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430654,ok=430654,error=0, records=41
[WARN ] 2026-06-02 11:48:22.648 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:48:24.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:48:31.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 11:48:31.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430655,ok=430655,error=0, records=41
[WARN ] 2026-06-02 11:48:37.654 [32438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:48:39.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:48:46.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 11:48:46.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430656,ok=430656,error=0, records=41
[WARN ] 2026-06-02 11:48:52.661 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:48:54.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:49:01.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 11:49:01.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430657,ok=430657,error=0, records=41
[WARN ] 2026-06-02 11:49:07.665 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:49:09.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:49:16.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 11:49:16.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430658,ok=430658,error=0, records=41
[WARN ] 2026-06-02 11:49:22.670 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:49:24.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:49:31.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 11:49:31.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430659,ok=430659,error=0, records=41
[WARN ] 2026-06-02 11:49:37.676 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:49:39.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:49:46.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 11:49:46.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430660,ok=430660,error=0, records=41
[WARN ] 2026-06-02 11:49:52.681 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:49:54.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:50:01.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 11:50:01.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430661,ok=430661,error=0, records=41
[INFO ] 2026-06-02 11:50:02.035 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21548/300s
[WARN ] 2026-06-02 11:50:07.687 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:50:09.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:50:16.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 11:50:16.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430662,ok=430662,error=0, records=41
[INFO ] 2026-06-02 11:50:21.335 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844892},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:50:21.517 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:50:21.517 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 11:50:21.517 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:50:21.517 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:50:21.517 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:50:21.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:50:21.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:50:21.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:50:21.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:50:21.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:50:21.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:50:22.692 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:50:24.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:50:31.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 11:50:31.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430663,ok=430663,error=0, records=41
[INFO ] 2026-06-02 11:50:33.195 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21539/300s
[WARN ] 2026-06-02 11:50:37.697 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:50:39.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:50:43.291 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21548/300s
[INFO ] 2026-06-02 11:50:46.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 11:50:46.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430664,ok=430664,error=0, records=41
[WARN ] 2026-06-02 11:50:52.703 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:50:54.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:51:01.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 11:51:01.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430665,ok=430665,error=0, records=41
[WARN ] 2026-06-02 11:51:07.709 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:51:09.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:51:16.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 11:51:16.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430666,ok=430666,error=0, records=41
[INFO ] 2026-06-02 11:51:16.252 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21535/300s
[WARN ] 2026-06-02 11:51:22.714 [32438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:51:24.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:51:31.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 11:51:31.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430667,ok=430667,error=0, records=41
[WARN ] 2026-06-02 11:51:37.720 [32438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:51:39.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:51:46.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 11:51:46.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430668,ok=430668,error=0, records=41
[WARN ] 2026-06-02 11:51:52.727 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:51:52.790 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21544/300s
[INFO ] 2026-06-02 11:51:54.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:52:01.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 11:52:01.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430669,ok=430669,error=0, records=41
[INFO ] 2026-06-02 11:52:06.526 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21535/300s
[WARN ] 2026-06-02 11:52:07.731 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:52:09.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:52:09.743 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21547/300s
[INFO ] 2026-06-02 11:52:16.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-02 11:52:16.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430670,ok=430670,error=0, records=41
[WARN ] 2026-06-02 11:52:22.736 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:52:24.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:52:31.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-02 11:52:31.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430671,ok=430671,error=0, records=41
[WARN ] 2026-06-02 11:52:37.742 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:52:39.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:52:46.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 11:52:46.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430672,ok=430672,error=0, records=41
[WARN ] 2026-06-02 11:52:52.748 [32438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:52:54.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:53:00.015 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21545/300s
[INFO ] 2026-06-02 11:53:01.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 11:53:01.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430673,ok=430673,error=0, records=41
[INFO ] 2026-06-02 11:53:01.817 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21545/300s
[WARN ] 2026-06-02 11:53:07.754 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:53:08.624 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21545/300s
[INFO ] 2026-06-02 11:53:09.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:53:16.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-02 11:53:16.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430674,ok=430674,error=0, records=41
[INFO ] 2026-06-02 11:53:21.517 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17940/300s
[INFO ] 2026-06-02 11:53:21.519 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844828},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:53:21.683 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:53:21.683 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 11:53:21.684 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:53:21.684 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:53:21.684 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:53:21.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:53:21.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:53:21.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:53:21.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:53:21.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:53:21.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:53:22.760 [32457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:53:24.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:53:31.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 11:53:31.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430675,ok=430675,error=0, records=41
[WARN ] 2026-06-02 11:53:37.765 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:53:39.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 11:53:39.747 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 11:53:46.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 11:53:46.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430676,ok=430676,error=0, records=41
[WARN ] 2026-06-02 11:53:52.770 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:53:54.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:53:54.748 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 11:54:01.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 11:54:01.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430677,ok=430677,error=0, records=41
[WARN ] 2026-06-02 11:54:07.774 [32457] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:54:09.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:54:16.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 11:54:16.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430678,ok=430678,error=0, records=41
[WARN ] 2026-06-02 11:54:22.778 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:54:24.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:54:31.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 11:54:31.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430679,ok=430679,error=0, records=41
[WARN ] 2026-06-02 11:54:37.783 [32478] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:54:39.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:54:46.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 11:54:46.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430680,ok=430680,error=0, records=41
[WARN ] 2026-06-02 11:54:52.788 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:54:54.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:55:01.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 11:55:01.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430681,ok=430681,error=0, records=41
[INFO ] 2026-06-02 11:55:02.039 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21549/300s
[WARN ] 2026-06-02 11:55:07.793 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:55:09.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:55:16.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 11:55:16.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430682,ok=430682,error=0, records=41
[WARN ] 2026-06-02 11:55:22.798 [32438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:55:24.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:55:31.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 11:55:31.430 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430683,ok=430683,error=0, records=41
[INFO ] 2026-06-02 11:55:33.301 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21540/300s
[WARN ] 2026-06-02 11:55:37.802 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:55:39.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:55:43.297 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21549/300s
[INFO ] 2026-06-02 11:55:46.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 11:55:46.435 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430684,ok=430684,error=0, records=41
[WARN ] 2026-06-02 11:55:52.808 [32467] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:55:54.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:56:01.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 11:56:01.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430685,ok=430685,error=0, records=41
[WARN ] 2026-06-02 11:56:07.812 [588  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:56:09.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:56:16.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 11:56:16.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430686,ok=430686,error=0, records=41
[INFO ] 2026-06-02 11:56:16.447 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21536/300s
[INFO ] 2026-06-02 11:56:21.685 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844756},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:56:21.836 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:56:21.836 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 11:56:21.836 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:56:21.836 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:56:21.836 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:56:21.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:56:21.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:56:21.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:56:21.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:56:21.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:56:21.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:56:22.816 [598  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:56:24.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:56:31.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 11:56:31.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430687,ok=430687,error=0, records=41
[WARN ] 2026-06-02 11:56:37.821 [32462] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:56:39.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:56:46.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 11:56:46.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430688,ok=430688,error=0, records=41
[WARN ] 2026-06-02 11:56:52.826 [574  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:56:52.847 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21545/300s
[INFO ] 2026-06-02 11:56:54.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:57:01.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 11:57:01.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430689,ok=430689,error=0, records=41
[INFO ] 2026-06-02 11:57:06.708 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21536/300s
[WARN ] 2026-06-02 11:57:07.830 [617  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:57:09.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:57:09.757 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21548/300s
[INFO ] 2026-06-02 11:57:16.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 11:57:16.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430690,ok=430690,error=0, records=41
[WARN ] 2026-06-02 11:57:22.835 [574  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:57:24.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:57:31.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 11:57:31.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430691,ok=430691,error=0, records=41
[WARN ] 2026-06-02 11:57:37.841 [617  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:57:39.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:57:46.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 11:57:46.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430692,ok=430692,error=0, records=41
[WARN ] 2026-06-02 11:57:52.847 [683  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:57:54.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:58:00.079 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21546/300s
[INFO ] 2026-06-02 11:58:01.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 11:58:01.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430693,ok=430693,error=0, records=41
[INFO ] 2026-06-02 11:58:01.881 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21546/300s
[WARN ] 2026-06-02 11:58:07.852 [697  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:58:08.683 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21546/300s
[INFO ] 2026-06-02 11:58:09.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:58:16.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 11:58:16.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430694,ok=430694,error=0, records=41
[WARN ] 2026-06-02 11:58:22.858 [697  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:58:24.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:58:31.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 11:58:31.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430695,ok=430695,error=0, records=41
[WARN ] 2026-06-02 11:58:37.862 [725  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:58:39.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:58:46.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 11:58:46.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430696,ok=430696,error=0, records=41
[WARN ] 2026-06-02 11:58:52.867 [659  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:58:54.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:59:01.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 11:59:01.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430697,ok=430697,error=0, records=41
[WARN ] 2026-06-02 11:59:07.872 [740  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:59:09.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:59:16.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 11:59:16.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430698,ok=430698,error=0, records=41
[INFO ] 2026-06-02 11:59:21.836 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17941/300s
[INFO ] 2026-06-02 11:59:21.838 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844696},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 11:59:22.001 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 11:59:22.001 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 11:59:22.001 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 11:59:22.001 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 11:59:22.001 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 11:59:22.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:59:22.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 11:59:22.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:59:22.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 11:59:22.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 11:59:22.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 11:59:22.878 [776  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:59:24.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:59:31.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 11:59:31.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430699,ok=430699,error=0, records=41
[WARN ] 2026-06-02 11:59:37.882 [781  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:59:39.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 11:59:46.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 11:59:46.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430700,ok=430700,error=0, records=41
[WARN ] 2026-06-02 11:59:52.887 [800  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 11:59:54.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:00:01.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 12:00:01.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430701,ok=430701,error=0, records=41
[INFO ] 2026-06-02 12:00:02.042 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21550/300s
[WARN ] 2026-06-02 12:00:07.892 [781  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:00:09.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:00:16.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 12:00:16.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430702,ok=430702,error=0, records=41
[WARN ] 2026-06-02 12:00:22.898 [844  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:00:24.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:00:31.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 12:00:31.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430703,ok=430703,error=0, records=41
[INFO ] 2026-06-02 12:00:33.402 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21541/300s
[WARN ] 2026-06-02 12:00:37.904 [832  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:00:39.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:00:43.304 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21550/300s
[INFO ] 2026-06-02 12:00:46.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 12:00:46.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430704,ok=430704,error=0, records=41
[WARN ] 2026-06-02 12:00:52.909 [880  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:00:54.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:01:01.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 12:01:01.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430705,ok=430705,error=0, records=41
[WARN ] 2026-06-02 12:01:07.914 [945  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:01:09.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:01:16.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 12:01:16.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430706,ok=430706,error=0, records=41
[INFO ] 2026-06-02 12:01:16.667 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21537/300s
[WARN ] 2026-06-02 12:01:22.920 [956  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:01:24.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:01:31.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:01:31.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430707,ok=430707,error=0, records=41
[WARN ] 2026-06-02 12:01:37.925 [966  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:01:39.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:01:46.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:01:46.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430708,ok=430708,error=0, records=41
[INFO ] 2026-06-02 12:01:52.897 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21546/300s
[WARN ] 2026-06-02 12:01:52.931 [950  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:01:54.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:02:01.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 12:02:01.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430709,ok=430709,error=0, records=41
[INFO ] 2026-06-02 12:02:06.890 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21537/300s
[WARN ] 2026-06-02 12:02:07.937 [973  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:02:09.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:02:09.769 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21549/300s
[INFO ] 2026-06-02 12:02:16.754 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 12:02:16.754 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430710,ok=430710,error=0, records=41
[INFO ] 2026-06-02 12:02:22.003 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844624},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:02:22.152 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:02:22.152 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 12:02:22.152 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:02:22.152 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:02:22.152 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:02:22.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:02:22.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:02:22.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:02:22.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:02:22.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:02:22.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 12:02:22.943 [1042 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:02:24.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:02:31.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 12:02:31.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430711,ok=430711,error=0, records=41
[WARN ] 2026-06-02 12:02:37.949 [973  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:02:39.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:02:46.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 12:02:46.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430712,ok=430712,error=0, records=41
[WARN ] 2026-06-02 12:02:52.955 [1047 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:02:54.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:03:00.130 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21547/300s
[INFO ] 2026-06-02 12:03:01.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 12:03:01.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430713,ok=430713,error=0, records=41
[INFO ] 2026-06-02 12:03:01.932 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21547/300s
[WARN ] 2026-06-02 12:03:07.959 [1081 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:03:08.739 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21547/300s
[INFO ] 2026-06-02 12:03:09.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:03:16.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 12:03:16.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430714,ok=430714,error=0, records=41
[WARN ] 2026-06-02 12:03:22.964 [1005 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:03:24.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:03:31.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 12:03:31.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430715,ok=430715,error=0, records=41
[WARN ] 2026-06-02 12:03:37.968 [950  ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:03:39.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 12:03:39.773 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 12:03:46.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 12:03:46.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430716,ok=430716,error=0, records=41
[WARN ] 2026-06-02 12:03:52.973 [1095 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:03:54.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:04:01.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 12:04:01.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430717,ok=430717,error=0, records=41
[WARN ] 2026-06-02 12:04:07.977 [1123 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:04:09.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:04:16.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 12:04:16.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430718,ok=430718,error=0, records=41
[WARN ] 2026-06-02 12:04:22.982 [1109 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:04:24.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:04:31.896 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 12:04:31.896 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430719,ok=430719,error=0, records=41
[WARN ] 2026-06-02 12:04:37.987 [1165 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:04:39.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:04:46.901 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:04:46.901 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430720,ok=430720,error=0, records=41
[WARN ] 2026-06-02 12:04:52.992 [1005 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:04:54.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:05:01.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 12:05:01.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430721,ok=430721,error=0, records=41
[INFO ] 2026-06-02 12:05:02.046 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21551/300s
[WARN ] 2026-06-02 12:05:07.996 [1165 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:05:09.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:05:16.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 12:05:16.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430722,ok=430722,error=0, records=41
[INFO ] 2026-06-02 12:05:22.152 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17942/300s
[INFO ] 2026-06-02 12:05:22.154 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844556},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:05:22.316 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:05:22.316 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:05:22.316 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:05:22.316 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:05:22.316 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:05:22.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:05:22.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:05:22.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:05:22.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:05:22.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:05:22.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 12:05:23.001 [1109 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:05:24.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:05:31.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 12:05:31.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430723,ok=430723,error=0, records=41
[INFO ] 2026-06-02 12:05:33.504 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21542/300s
[WARN ] 2026-06-02 12:05:38.006 [1207 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:05:39.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:05:43.311 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21551/300s
[INFO ] 2026-06-02 12:05:46.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 12:05:46.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430724,ok=430724,error=0, records=41
[WARN ] 2026-06-02 12:05:53.011 [1005 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:05:54.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:06:01.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 12:06:01.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430725,ok=430725,error=0, records=41
[WARN ] 2026-06-02 12:06:08.017 [1005 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:06:09.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:06:17.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 12:06:17.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430726,ok=430726,error=0, records=41
[INFO ] 2026-06-02 12:06:17.002 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21538/300s
[WARN ] 2026-06-02 12:06:23.023 [1236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:06:24.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:06:32.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 12:06:32.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430727,ok=430727,error=0, records=41
[WARN ] 2026-06-02 12:06:38.029 [1277 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:06:39.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:06:47.014 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 12:06:47.014 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430728,ok=430728,error=0, records=41
[INFO ] 2026-06-02 12:06:52.955 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21547/300s
[WARN ] 2026-06-02 12:06:53.035 [1236 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:06:54.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:07:02.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:07:02.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430729,ok=430729,error=0, records=41
[INFO ] 2026-06-02 12:07:07.071 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21538/300s
[WARN ] 2026-06-02 12:07:08.041 [1308 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:07:09.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:07:09.782 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21550/300s
[INFO ] 2026-06-02 12:07:17.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 12:07:17.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430730,ok=430730,error=0, records=41
[WARN ] 2026-06-02 12:07:23.046 [1324 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:07:24.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:07:32.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 12:07:32.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430731,ok=430731,error=0, records=41
[WARN ] 2026-06-02 12:07:38.052 [1340 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:07:39.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:07:47.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 12:07:47.044 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430732,ok=430732,error=0, records=41
[WARN ] 2026-06-02 12:07:52.557 [1358 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:07:54.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:08:00.205 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21548/300s
[INFO ] 2026-06-02 12:08:02.007 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21548/300s
[INFO ] 2026-06-02 12:08:02.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 12:08:02.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430733,ok=430733,error=0, records=41
[WARN ] 2026-06-02 12:08:07.564 [1329 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:08:08.813 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21548/300s
[INFO ] 2026-06-02 12:08:09.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:08:17.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 12:08:17.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430734,ok=430734,error=0, records=41
[INFO ] 2026-06-02 12:08:22.318 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844488},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:08:22.476 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:08:22.476 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:08:22.477 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:08:22.477 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:08:22.477 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:08:22.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:08:22.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:08:22.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:08:22.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:08:22.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:08:22.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 12:08:22.568 [1386 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:08:24.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:08:32.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 12:08:32.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430735,ok=430735,error=0, records=41
[WARN ] 2026-06-02 12:08:37.574 [1411 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:08:39.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:08:47.064 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 12:08:47.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430736,ok=430736,error=0, records=41
[WARN ] 2026-06-02 12:08:52.578 [1428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:08:54.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:08:54.786 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 12:09:02.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 12:09:02.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430737,ok=430737,error=0, records=41
[WARN ] 2026-06-02 12:09:07.584 [1428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:09:09.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:09:17.076 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 12:09:17.076 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430738,ok=430738,error=0, records=41
[WARN ] 2026-06-02 12:09:22.590 [1453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:09:24.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:09:32.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:09:32.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430739,ok=430739,error=0, records=41
[WARN ] 2026-06-02 12:09:37.596 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:09:39.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:09:47.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:09:47.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430740,ok=430740,error=0, records=41
[WARN ] 2026-06-02 12:09:52.601 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:09:54.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:10:02.050 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21552/300s
[INFO ] 2026-06-02 12:10:02.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:10:02.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430741,ok=430741,error=0, records=41
[WARN ] 2026-06-02 12:10:07.606 [1453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:10:09.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:10:17.114 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 12:10:17.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430742,ok=430742,error=0, records=41
[WARN ] 2026-06-02 12:10:22.613 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:10:24.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:10:32.121 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 12:10:32.121 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430743,ok=430743,error=0, records=41
[INFO ] 2026-06-02 12:10:33.616 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21543/300s
[WARN ] 2026-06-02 12:10:37.618 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:10:39.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:10:43.318 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21552/300s
[INFO ] 2026-06-02 12:10:47.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 12:10:47.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430744,ok=430744,error=0, records=41
[WARN ] 2026-06-02 12:10:52.624 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:10:54.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:11:02.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 12:11:02.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430745,ok=430745,error=0, records=41
[WARN ] 2026-06-02 12:11:07.629 [1453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:11:09.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:11:17.144 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 12:11:17.144 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430746,ok=430746,error=0, records=41
[INFO ] 2026-06-02 12:11:17.144 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21539/300s
[INFO ] 2026-06-02 12:11:22.477 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17943/300s
[INFO ] 2026-06-02 12:11:22.478 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844420},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-02 12:11:22.634 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:11:22.645 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:11:22.645 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:11:22.645 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:11:22.646 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:11:22.646 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:11:22.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:11:22.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:11:22.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:11:22.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:11:22.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:11:22.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:11:24.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:11:32.151 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 12:11:32.151 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430747,ok=430747,error=0, records=41
[WARN ] 2026-06-02 12:11:37.639 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:11:39.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:11:47.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 12:11:47.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430748,ok=430748,error=0, records=41
[WARN ] 2026-06-02 12:11:52.644 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:11:53.011 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21548/300s
[INFO ] 2026-06-02 12:11:54.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:12:02.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 12:12:02.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430749,ok=430749,error=0, records=41
[INFO ] 2026-06-02 12:12:07.248 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21539/300s
[WARN ] 2026-06-02 12:12:07.649 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:12:09.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:12:09.796 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21551/300s
[INFO ] 2026-06-02 12:12:17.184 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 12:12:17.184 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430750,ok=430750,error=0, records=41
[WARN ] 2026-06-02 12:12:22.656 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:12:24.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:12:32.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 12:12:32.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430751,ok=430751,error=0, records=41
[WARN ] 2026-06-02 12:12:37.661 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:12:39.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:12:47.195 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:12:47.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430752,ok=430752,error=0, records=41
[WARN ] 2026-06-02 12:12:52.666 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:12:54.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:13:00.277 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21549/300s
[INFO ] 2026-06-02 12:13:02.079 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21549/300s
[INFO ] 2026-06-02 12:13:02.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 12:13:02.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430753,ok=430753,error=0, records=41
[WARN ] 2026-06-02 12:13:07.670 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:13:08.885 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21549/300s
[INFO ] 2026-06-02 12:13:09.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:13:17.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 12:13:17.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430754,ok=430754,error=0, records=41
[WARN ] 2026-06-02 12:13:22.675 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:13:24.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:13:32.215 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 12:13:32.215 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430755,ok=430755,error=0, records=41
[WARN ] 2026-06-02 12:13:37.681 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:13:39.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 12:13:39.800 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 12:13:47.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 12:13:47.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430756,ok=430756,error=0, records=41
[WARN ] 2026-06-02 12:13:52.687 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:13:54.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:14:02.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-02 12:14:02.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430757,ok=430757,error=0, records=41
[WARN ] 2026-06-02 12:14:07.692 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:14:09.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:14:17.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 12:14:17.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430758,ok=430758,error=0, records=41
[INFO ] 2026-06-02 12:14:22.647 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844352},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-02 12:14:22.699 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:14:22.808 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:14:22.808 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:14:22.808 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:14:22.808 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:14:22.808 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:14:22.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:14:22.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:14:22.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:14:22.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:14:22.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:14:22.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:14:24.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:14:32.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 12:14:32.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430759,ok=430759,error=0, records=41
[WARN ] 2026-06-02 12:14:37.704 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:14:39.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:14:47.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 12:14:47.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430760,ok=430760,error=0, records=41
[WARN ] 2026-06-02 12:14:52.710 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:14:54.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:15:02.053 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21553/300s
[INFO ] 2026-06-02 12:15:02.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 12:15:02.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430761,ok=430761,error=0, records=41
[WARN ] 2026-06-02 12:15:07.714 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:15:09.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:15:17.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10391, records=41
[INFO ] 2026-06-02 12:15:17.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430762,ok=430762,error=0, records=41
[WARN ] 2026-06-02 12:15:22.719 [1453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:15:24.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:15:32.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 12:15:32.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430763,ok=430763,error=0, records=41
[INFO ] 2026-06-02 12:15:33.723 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21544/300s
[WARN ] 2026-06-02 12:15:37.724 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:15:39.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:15:43.324 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21553/300s
[INFO ] 2026-06-02 12:15:47.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10376, records=41
[INFO ] 2026-06-02 12:15:47.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430764,ok=430764,error=0, records=41
[WARN ] 2026-06-02 12:15:52.730 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:15:54.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:16:02.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 12:16:02.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430765,ok=430765,error=0, records=41
[WARN ] 2026-06-02 12:16:07.735 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:16:09.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:16:17.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 12:16:17.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430766,ok=430766,error=0, records=41
[INFO ] 2026-06-02 12:16:17.292 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21540/300s
[WARN ] 2026-06-02 12:16:22.740 [1470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:16:24.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:16:32.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 12:16:32.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430767,ok=430767,error=0, records=41
[WARN ] 2026-06-02 12:16:37.746 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:16:39.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:16:47.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 12:16:47.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430768,ok=430768,error=0, records=41
[WARN ] 2026-06-02 12:16:52.751 [1453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:16:53.070 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21549/300s
[INFO ] 2026-06-02 12:16:54.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:17:02.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 12:17:02.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430769,ok=430769,error=0, records=41
[INFO ] 2026-06-02 12:17:07.434 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21540/300s
[WARN ] 2026-06-02 12:17:07.757 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:17:09.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:17:09.809 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21552/300s
[INFO ] 2026-06-02 12:17:17.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 12:17:17.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430770,ok=430770,error=0, records=41
[WARN ] 2026-06-02 12:17:22.763 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:17:22.808 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17944/300s
[INFO ] 2026-06-02 12:17:22.810 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844288},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:17:22.989 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:17:22.989 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:17:22.990 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:17:22.990 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:17:22.990 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:17:23.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:17:23.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:17:23.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:17:23.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:17:23.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:17:23.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:17:24.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:17:32.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:17:32.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430771,ok=430771,error=0, records=41
[WARN ] 2026-06-02 12:17:37.767 [1453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:17:39.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:17:47.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 12:17:47.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430772,ok=430772,error=0, records=41
[WARN ] 2026-06-02 12:17:52.771 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:17:54.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:18:00.353 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21550/300s
[INFO ] 2026-06-02 12:18:02.155 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21550/300s
[INFO ] 2026-06-02 12:18:02.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 12:18:02.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430773,ok=430773,error=0, records=41
[WARN ] 2026-06-02 12:18:07.775 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:18:08.961 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21550/300s
[INFO ] 2026-06-02 12:18:09.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:18:17.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 12:18:17.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430774,ok=430774,error=0, records=41
[WARN ] 2026-06-02 12:18:22.780 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:18:24.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:18:32.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 12:18:32.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430775,ok=430775,error=0, records=41
[WARN ] 2026-06-02 12:18:37.785 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:18:39.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:18:47.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 12:18:47.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430776,ok=430776,error=0, records=41
[WARN ] 2026-06-02 12:18:52.790 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:18:54.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:19:02.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 12:19:02.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430777,ok=430777,error=0, records=41
[WARN ] 2026-06-02 12:19:07.795 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:19:09.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:19:17.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 12:19:17.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430778,ok=430778,error=0, records=41
[WARN ] 2026-06-02 12:19:22.800 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:19:24.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:19:32.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 12:19:32.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430779,ok=430779,error=0, records=41
[WARN ] 2026-06-02 12:19:37.806 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:19:39.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:19:47.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 12:19:47.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430780,ok=430780,error=0, records=41
[WARN ] 2026-06-02 12:19:52.812 [2031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:19:54.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:20:02.057 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21554/300s
[INFO ] 2026-06-02 12:20:02.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 12:20:02.377 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430781,ok=430781,error=0, records=41
[WARN ] 2026-06-02 12:20:07.817 [1469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:20:09.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:20:17.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13329, records=49
[INFO ] 2026-06-02 12:20:17.383 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430782,ok=430782,error=0, records=49
[WARN ] 2026-06-02 12:20:22.822 [1485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:20:22.991 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844216},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:20:23.166 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:20:23.166 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:20:23.167 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:20:23.167 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:20:23.167 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:20:23.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:20:23.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:20:23.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:20:23.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:20:23.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:20:23.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:20:24.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:20:32.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 12:20:32.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430783,ok=430783,error=0, records=41
[INFO ] 2026-06-02 12:20:33.825 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21545/300s
[WARN ] 2026-06-02 12:20:37.828 [2017 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:20:39.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:20:43.330 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21554/300s
[INFO ] 2026-06-02 12:20:47.418 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 12:20:47.418 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430784,ok=430784,error=0, records=41
[WARN ] 2026-06-02 12:20:52.833 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:20:54.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:21:02.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 12:21:02.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430785,ok=430785,error=0, records=41
[WARN ] 2026-06-02 12:21:07.839 [2056 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:21:09.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:21:17.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 12:21:17.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430786,ok=430786,error=0, records=41
[INFO ] 2026-06-02 12:21:17.429 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21541/300s
[WARN ] 2026-06-02 12:21:22.843 [2017 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:21:24.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:21:32.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 12:21:32.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430787,ok=430787,error=0, records=41
[WARN ] 2026-06-02 12:21:37.848 [2056 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:21:39.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:21:47.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 12:21:47.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430788,ok=430788,error=0, records=41
[WARN ] 2026-06-02 12:21:52.854 [2051 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:21:53.128 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21550/300s
[INFO ] 2026-06-02 12:21:54.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:22:02.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 12:22:02.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430789,ok=430789,error=0, records=41
[INFO ] 2026-06-02 12:22:07.617 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21541/300s
[WARN ] 2026-06-02 12:22:07.859 [2135 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:22:09.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:22:09.821 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21553/300s
[INFO ] 2026-06-02 12:22:17.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 12:22:17.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430790,ok=430790,error=0, records=41
[WARN ] 2026-06-02 12:22:22.864 [1480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:22:24.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:22:32.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 12:22:32.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430791,ok=430791,error=0, records=41
[WARN ] 2026-06-02 12:22:37.868 [2163 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:22:39.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:22:47.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 12:22:47.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430792,ok=430792,error=0, records=41
[WARN ] 2026-06-02 12:22:52.872 [2051 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:22:54.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:23:00.427 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21551/300s
[INFO ] 2026-06-02 12:23:02.229 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21551/300s
[INFO ] 2026-06-02 12:23:02.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:23:02.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430793,ok=430793,error=0, records=41
[WARN ] 2026-06-02 12:23:07.879 [2226 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:23:09.036 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21551/300s
[INFO ] 2026-06-02 12:23:09.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:23:17.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 12:23:17.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430794,ok=430794,error=0, records=41
[WARN ] 2026-06-02 12:23:22.885 [2191 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:23:23.167 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17945/300s
[INFO ] 2026-06-02 12:23:23.168 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844148},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:23:23.323 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:23:23.324 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:23:23.324 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:23:23.324 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:23:23.324 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:23:23.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:23:23.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:23:23.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:23:23.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:23:23.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:23:23.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:23:24.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:23:32.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 12:23:32.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430795,ok=430795,error=0, records=41
[WARN ] 2026-06-02 12:23:37.892 [2261 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:23:39.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 12:23:39.825 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 12:23:47.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:23:47.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430796,ok=430796,error=0, records=41
[WARN ] 2026-06-02 12:23:52.897 [2278 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:23:54.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:23:54.826 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 12:24:02.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 12:24:02.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430797,ok=430797,error=0, records=41
[WARN ] 2026-06-02 12:24:07.902 [2295 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:24:09.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:24:17.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:24:17.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430798,ok=430798,error=0, records=41
[WARN ] 2026-06-02 12:24:22.907 [2312 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:24:24.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:24:32.574 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 12:24:32.574 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430799,ok=430799,error=0, records=41
[WARN ] 2026-06-02 12:24:37.912 [2289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:24:39.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:24:47.579 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 12:24:47.579 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430800,ok=430800,error=0, records=41
[WARN ] 2026-06-02 12:24:52.917 [2289 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:24:54.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:25:02.060 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21555/300s
[INFO ] 2026-06-02 12:25:02.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 12:25:02.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430801,ok=430801,error=0, records=41
[WARN ] 2026-06-02 12:25:07.922 [2337 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:25:09.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:25:17.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 12:25:17.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430802,ok=430802,error=0, records=41
[WARN ] 2026-06-02 12:25:22.929 [2378 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:25:24.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:25:32.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:25:32.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430803,ok=430803,error=0, records=41
[INFO ] 2026-06-02 12:25:33.932 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21546/300s
[WARN ] 2026-06-02 12:25:37.933 [2354 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:25:39.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.42MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:25:43.337 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21555/300s
[INFO ] 2026-06-02 12:25:47.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 12:25:47.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430804,ok=430804,error=0, records=41
[WARN ] 2026-06-02 12:25:52.939 [2411 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:25:54.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:26:02.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 12:26:02.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430805,ok=430805,error=0, records=41
[WARN ] 2026-06-02 12:26:07.944 [2428 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:26:09.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:26:17.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 12:26:17.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430806,ok=430806,error=0, records=41
[INFO ] 2026-06-02 12:26:17.622 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21542/300s
[WARN ] 2026-06-02 12:26:22.951 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:26:23.325 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844084},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:26:23.486 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:26:23.486 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:26:23.486 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:26:23.486 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:26:23.486 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:26:23.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:26:23.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:26:23.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:26:23.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:26:23.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:26:23.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:26:24.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:26:32.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 12:26:32.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430807,ok=430807,error=0, records=41
[WARN ] 2026-06-02 12:26:37.955 [2453 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:26:39.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:26:47.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 12:26:47.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430808,ok=430808,error=0, records=41
[WARN ] 2026-06-02 12:26:52.961 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:26:53.186 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21551/300s
[INFO ] 2026-06-02 12:26:54.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:27:02.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 12:27:02.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430809,ok=430809,error=0, records=41
[INFO ] 2026-06-02 12:27:07.802 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21542/300s
[WARN ] 2026-06-02 12:27:07.965 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:27:09.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:27:09.835 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21554/300s
[INFO ] 2026-06-02 12:27:17.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 12:27:17.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430810,ok=430810,error=0, records=41
[WARN ] 2026-06-02 12:27:22.969 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:27:24.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:27:32.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 12:27:32.654 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430811,ok=430811,error=0, records=41
[WARN ] 2026-06-02 12:27:37.974 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:27:39.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:27:47.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:27:47.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430812,ok=430812,error=0, records=41
[WARN ] 2026-06-02 12:27:52.978 [2524 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:27:54.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:28:00.504 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21552/300s
[INFO ] 2026-06-02 12:28:02.306 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21552/300s
[INFO ] 2026-06-02 12:28:02.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 12:28:02.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430813,ok=430813,error=0, records=41
[WARN ] 2026-06-02 12:28:07.983 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:28:09.112 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21552/300s
[INFO ] 2026-06-02 12:28:09.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:28:17.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 12:28:17.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430814,ok=430814,error=0, records=41
[WARN ] 2026-06-02 12:28:22.988 [2538 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:28:24.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:28:32.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 12:28:32.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430815,ok=430815,error=0, records=41
[WARN ] 2026-06-02 12:28:37.993 [2438 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:28:39.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:28:47.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 12:28:47.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430816,ok=430816,error=0, records=41
[WARN ] 2026-06-02 12:28:52.999 [2566 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:28:54.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:29:02.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:29:02.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430817,ok=430817,error=0, records=41
[WARN ] 2026-06-02 12:29:08.004 [2552 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:29:09.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:29:17.694 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 12:29:17.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430818,ok=430818,error=0, records=41
[WARN ] 2026-06-02 12:29:23.008 [2580 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:29:23.486 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17946/300s
[INFO ] 2026-06-02 12:29:23.488 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20844012},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:29:23.644 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:29:23.644 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:29:23.644 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:29:23.644 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:29:23.644 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:29:23.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:29:23.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:29:23.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:29:23.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:29:23.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:29:23.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:29:24.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:29:32.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 12:29:32.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430819,ok=430819,error=0, records=41
[WARN ] 2026-06-02 12:29:38.015 [2538 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:29:39.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:29:47.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 12:29:47.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430820,ok=430820,error=0, records=41
[WARN ] 2026-06-02 12:29:53.019 [2594 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:29:54.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:30:02.064 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21556/300s
[INFO ] 2026-06-02 12:30:02.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:30:02.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430821,ok=430821,error=0, records=41
[WARN ] 2026-06-02 12:30:08.023 [2580 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:30:09.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:30:17.727 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 12:30:17.727 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430822,ok=430822,error=0, records=41
[WARN ] 2026-06-02 12:30:23.028 [2608 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:30:24.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:30:32.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 12:30:32.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430823,ok=430823,error=0, records=41
[INFO ] 2026-06-02 12:30:34.031 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21547/300s
[WARN ] 2026-06-02 12:30:38.032 [2683 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:30:39.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:30:43.344 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21556/300s
[INFO ] 2026-06-02 12:30:47.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11272, records=44
[INFO ] 2026-06-02 12:30:47.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430824,ok=430824,error=0, records=44
[WARN ] 2026-06-02 12:30:53.039 [2655 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:30:54.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:31:02.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 12:31:02.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430825,ok=430825,error=0, records=41
[WARN ] 2026-06-02 12:31:08.045 [2720 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:31:09.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:31:17.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:31:17.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430826,ok=430826,error=0, records=41
[INFO ] 2026-06-02 12:31:17.748 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21543/300s
[WARN ] 2026-06-02 12:31:23.052 [2727 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:31:24.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:31:32.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 12:31:32.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430827,ok=430827,error=0, records=41
[WARN ] 2026-06-02 12:31:37.556 [2743 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:31:39.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:31:47.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 12:31:47.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430828,ok=430828,error=0, records=41
[WARN ] 2026-06-02 12:31:52.562 [2748 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:31:53.244 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21552/300s
[INFO ] 2026-06-02 12:31:54.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:32:02.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 12:32:02.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430829,ok=430829,error=0, records=41
[WARN ] 2026-06-02 12:32:07.568 [2743 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:32:07.985 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21543/300s
[INFO ] 2026-06-02 12:32:09.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:32:09.848 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21555/300s
[INFO ] 2026-06-02 12:32:17.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 12:32:17.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430830,ok=430830,error=0, records=41
[WARN ] 2026-06-02 12:32:22.573 [2776 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:32:23.646 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843936},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:32:23.821 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:32:23.821 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 12:32:23.821 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:32:23.821 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:32:23.821 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:32:23.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:32:23.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:32:23.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:32:23.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:32:23.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:32:23.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:32:24.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:32:32.779 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:32:32.779 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430831,ok=430831,error=0, records=41
[WARN ] 2026-06-02 12:32:37.579 [2815 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:32:39.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:32:47.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 12:32:47.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430832,ok=430832,error=0, records=41
[WARN ] 2026-06-02 12:32:52.585 [2837 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:32:54.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:33:00.572 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21553/300s
[INFO ] 2026-06-02 12:33:02.374 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21553/300s
[INFO ] 2026-06-02 12:33:02.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 12:33:02.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430833,ok=430833,error=0, records=41
[WARN ] 2026-06-02 12:33:07.589 [2849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:33:09.180 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21553/300s
[INFO ] 2026-06-02 12:33:09.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:33:17.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 12:33:17.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430834,ok=430834,error=0, records=41
[WARN ] 2026-06-02 12:33:22.594 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:33:24.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:33:32.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 12:33:32.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430835,ok=430835,error=0, records=41
[WARN ] 2026-06-02 12:33:37.599 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:33:39.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 12:33:39.852 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 12:33:47.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 12:33:47.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430836,ok=430836,error=0, records=41
[WARN ] 2026-06-02 12:33:52.604 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:33:54.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:34:02.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 12:34:02.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430837,ok=430837,error=0, records=41
[WARN ] 2026-06-02 12:34:07.610 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:34:09.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:34:17.821 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 12:34:17.821 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430838,ok=430838,error=0, records=41
[WARN ] 2026-06-02 12:34:22.615 [2849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:34:24.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:34:32.827 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 12:34:32.827 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430839,ok=430839,error=0, records=41
[WARN ] 2026-06-02 12:34:37.620 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:34:39.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:34:47.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 12:34:47.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430840,ok=430840,error=0, records=41
[WARN ] 2026-06-02 12:34:52.625 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:34:54.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:35:02.067 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21557/300s
[INFO ] 2026-06-02 12:35:02.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 12:35:02.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430841,ok=430841,error=0, records=41
[WARN ] 2026-06-02 12:35:07.630 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:35:09.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:35:17.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 12:35:17.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430842,ok=430842,error=0, records=41
[WARN ] 2026-06-02 12:35:22.636 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:35:23.821 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17947/300s
[INFO ] 2026-06-02 12:35:23.823 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843864},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:35:24.005 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:35:24.005 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:35:24.005 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:35:24.005 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:35:24.005 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:35:24.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:35:24.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:35:24.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:35:24.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:35:24.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:35:24.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:35:24.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:35:32.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 12:35:32.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430843,ok=430843,error=0, records=41
[INFO ] 2026-06-02 12:35:34.139 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21548/300s
[WARN ] 2026-06-02 12:35:37.640 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:35:39.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:35:43.351 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21557/300s
[INFO ] 2026-06-02 12:35:47.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 12:35:47.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430844,ok=430844,error=0, records=41
[WARN ] 2026-06-02 12:35:52.646 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:35:54.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:36:02.959 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 12:36:02.959 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430845,ok=430845,error=0, records=41
[WARN ] 2026-06-02 12:36:07.651 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:36:09.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:36:17.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:36:17.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430846,ok=430846,error=0, records=41
[INFO ] 2026-06-02 12:36:17.964 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21544/300s
[WARN ] 2026-06-02 12:36:22.656 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:36:24.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:36:32.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 12:36:32.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430847,ok=430847,error=0, records=41
[WARN ] 2026-06-02 12:36:37.661 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:36:39.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:36:47.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 12:36:47.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430848,ok=430848,error=0, records=41
[WARN ] 2026-06-02 12:36:52.667 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:36:53.301 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21553/300s
[INFO ] 2026-06-02 12:36:54.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:37:02.984 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 12:37:02.984 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430849,ok=430849,error=0, records=41
[WARN ] 2026-06-02 12:37:07.671 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:37:08.170 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21544/300s
[INFO ] 2026-06-02 12:37:09.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:37:09.861 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21556/300s
[INFO ] 2026-06-02 12:37:17.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 12:37:17.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430850,ok=430850,error=0, records=41
[WARN ] 2026-06-02 12:37:22.677 [2849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:37:24.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:37:32.994 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 12:37:32.994 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430851,ok=430851,error=0, records=41
[WARN ] 2026-06-02 12:37:37.682 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:37:39.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:37:47.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 12:37:47.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430852,ok=430852,error=0, records=41
[WARN ] 2026-06-02 12:37:52.687 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:37:54.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:38:00.637 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21554/300s
[INFO ] 2026-06-02 12:38:02.439 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21554/300s
[INFO ] 2026-06-02 12:38:03.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 12:38:03.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430853,ok=430853,error=0, records=41
[WARN ] 2026-06-02 12:38:07.692 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:38:09.245 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21554/300s
[INFO ] 2026-06-02 12:38:09.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:38:18.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 12:38:18.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430854,ok=430854,error=0, records=41
[WARN ] 2026-06-02 12:38:22.698 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:38:24.007 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843800},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:38:24.168 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:38:24.168 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 12:38:24.168 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:38:24.168 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:38:24.168 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:38:24.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:38:24.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:38:24.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:38:24.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:38:24.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:38:24.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:38:24.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:38:33.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:38:33.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430855,ok=430855,error=0, records=41
[WARN ] 2026-06-02 12:38:37.702 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:38:39.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:38:48.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 12:38:48.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430856,ok=430856,error=0, records=41
[WARN ] 2026-06-02 12:38:52.707 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:38:54.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:38:54.865 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 12:39:03.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 12:39:03.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430857,ok=430857,error=0, records=41
[WARN ] 2026-06-02 12:39:07.712 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:39:09.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:39:18.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 12:39:18.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430858,ok=430858,error=0, records=41
[WARN ] 2026-06-02 12:39:22.718 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:39:24.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:39:33.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 12:39:33.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430859,ok=430859,error=0, records=41
[WARN ] 2026-06-02 12:39:37.723 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:39:39.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:39:48.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 12:39:48.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430860,ok=430860,error=0, records=41
[WARN ] 2026-06-02 12:39:52.727 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:39:54.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:40:02.070 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21558/300s
[INFO ] 2026-06-02 12:40:03.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 12:40:03.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430861,ok=430861,error=0, records=41
[WARN ] 2026-06-02 12:40:07.732 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:40:09.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:40:18.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 12:40:18.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430862,ok=430862,error=0, records=41
[WARN ] 2026-06-02 12:40:22.737 [2849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:40:24.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:40:33.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 12:40:33.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430863,ok=430863,error=0, records=41
[INFO ] 2026-06-02 12:40:34.241 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21549/300s
[WARN ] 2026-06-02 12:40:37.743 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:40:39.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:40:43.358 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21558/300s
[INFO ] 2026-06-02 12:40:48.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 12:40:48.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430864,ok=430864,error=0, records=41
[WARN ] 2026-06-02 12:40:52.747 [2864 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:40:54.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:41:03.084 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 12:41:03.084 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430865,ok=430865,error=0, records=41
[WARN ] 2026-06-02 12:41:07.753 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:41:09.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:41:18.089 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 12:41:18.089 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430866,ok=430866,error=0, records=41
[INFO ] 2026-06-02 12:41:18.089 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21545/300s
[WARN ] 2026-06-02 12:41:22.758 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:41:24.168 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17948/300s
[INFO ] 2026-06-02 12:41:24.170 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843728},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:41:24.309 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:41:24.309 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 12:41:24.309 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:41:24.309 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:41:24.309 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:41:24.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:41:24.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:41:24.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:41:24.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:41:24.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:41:24.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:41:24.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:41:33.094 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 12:41:33.094 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430867,ok=430867,error=0, records=41
[WARN ] 2026-06-02 12:41:37.762 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:41:39.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:41:48.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 12:41:48.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430868,ok=430868,error=0, records=41
[WARN ] 2026-06-02 12:41:52.767 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:41:53.358 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21554/300s
[INFO ] 2026-06-02 12:41:54.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:42:03.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 12:42:03.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430869,ok=430869,error=0, records=41
[WARN ] 2026-06-02 12:42:07.771 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:42:08.349 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21545/300s
[INFO ] 2026-06-02 12:42:09.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:42:09.874 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21557/300s
[INFO ] 2026-06-02 12:42:18.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 12:42:18.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430870,ok=430870,error=0, records=41
[WARN ] 2026-06-02 12:42:22.776 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:42:24.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:42:33.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:42:33.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430871,ok=430871,error=0, records=41
[WARN ] 2026-06-02 12:42:37.780 [2849 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:42:39.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:42:48.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:42:48.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430872,ok=430872,error=0, records=41
[WARN ] 2026-06-02 12:42:52.785 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:42:54.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:43:00.699 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21555/300s
[INFO ] 2026-06-02 12:43:02.501 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21555/300s
[INFO ] 2026-06-02 12:43:03.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 12:43:03.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430873,ok=430873,error=0, records=41
[WARN ] 2026-06-02 12:43:07.791 [2834 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:43:09.307 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21555/300s
[INFO ] 2026-06-02 12:43:09.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:43:18.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-02 12:43:18.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430874,ok=430874,error=0, records=41
[WARN ] 2026-06-02 12:43:22.796 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:43:24.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:43:33.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 12:43:33.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430875,ok=430875,error=0, records=41
[WARN ] 2026-06-02 12:43:37.800 [2804 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:43:39.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 12:43:39.878 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 12:43:48.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-02 12:43:48.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430876,ok=430876,error=0, records=41
[WARN ] 2026-06-02 12:43:52.805 [3426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:43:54.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:44:03.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 12:44:03.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430877,ok=430877,error=0, records=41
[WARN ] 2026-06-02 12:44:07.810 [3426 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:44:09.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:44:18.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 12:44:18.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430878,ok=430878,error=0, records=41
[WARN ] 2026-06-02 12:44:22.815 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:44:24.311 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843668},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:44:24.488 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:44:24.488 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:44:24.488 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:44:24.488 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:44:24.488 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:44:24.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:44:24.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:44:24.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:44:24.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:44:24.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:44:24.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:44:24.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:44:33.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 12:44:33.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430879,ok=430879,error=0, records=41
[WARN ] 2026-06-02 12:44:37.820 [3449 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:44:39.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:44:48.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:44:48.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430880,ok=430880,error=0, records=41
[WARN ] 2026-06-02 12:44:52.825 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:44:54.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:45:02.073 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21559/300s
[INFO ] 2026-06-02 12:45:03.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 12:45:03.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430881,ok=430881,error=0, records=41
[WARN ] 2026-06-02 12:45:07.831 [3470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:45:09.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:45:18.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 12:45:18.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430882,ok=430882,error=0, records=41
[WARN ] 2026-06-02 12:45:22.837 [2848 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:45:24.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:45:33.240 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 12:45:33.240 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430883,ok=430883,error=0, records=41
[INFO ] 2026-06-02 12:45:34.340 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21550/300s
[WARN ] 2026-06-02 12:45:37.842 [3512 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:45:39.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:45:43.364 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21559/300s
[INFO ] 2026-06-02 12:45:48.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 12:45:48.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430884,ok=430884,error=0, records=41
[WARN ] 2026-06-02 12:45:52.847 [3455 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:45:54.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:46:03.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:46:03.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430885,ok=430885,error=0, records=41
[WARN ] 2026-06-02 12:46:07.854 [3455 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:46:09.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:46:18.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 12:46:18.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430886,ok=430886,error=0, records=41
[INFO ] 2026-06-02 12:46:18.259 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21546/300s
[WARN ] 2026-06-02 12:46:22.858 [3470 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:46:24.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:46:33.264 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 12:46:33.264 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430887,ok=430887,error=0, records=41
[WARN ] 2026-06-02 12:46:37.864 [3512 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:46:39.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:46:48.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 12:46:48.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430888,ok=430888,error=0, records=41
[WARN ] 2026-06-02 12:46:52.868 [3512 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:46:53.413 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21555/300s
[INFO ] 2026-06-02 12:46:54.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:47:03.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:47:03.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430889,ok=430889,error=0, records=41
[WARN ] 2026-06-02 12:47:07.873 [3589 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:47:08.524 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21546/300s
[INFO ] 2026-06-02 12:47:09.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:47:09.886 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21558/300s
[INFO ] 2026-06-02 12:47:18.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 12:47:18.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430890,ok=430890,error=0, records=41
[WARN ] 2026-06-02 12:47:22.877 [3619 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:47:24.488 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17949/300s
[INFO ] 2026-06-02 12:47:24.490 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:47:24.639 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:47:24.639 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:47:24.639 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:47:24.639 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:47:24.639 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:47:24.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:47:24.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:47:24.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:47:24.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:47:24.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:47:24.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:47:24.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:47:33.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 12:47:33.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430891,ok=430891,error=0, records=41
[WARN ] 2026-06-02 12:47:37.883 [3631 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:47:39.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:47:48.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 12:47:48.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430892,ok=430892,error=0, records=41
[WARN ] 2026-06-02 12:47:52.888 [3646 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:47:54.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:48:00.775 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21556/300s
[INFO ] 2026-06-02 12:48:02.577 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21556/300s
[INFO ] 2026-06-02 12:48:03.317 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 12:48:03.317 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430893,ok=430893,error=0, records=41
[WARN ] 2026-06-02 12:48:07.893 [3674 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:48:09.383 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21556/300s
[INFO ] 2026-06-02 12:48:09.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:48:18.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 12:48:18.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430894,ok=430894,error=0, records=41
[WARN ] 2026-06-02 12:48:22.898 [3691 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:48:24.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:48:33.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 12:48:33.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430895,ok=430895,error=0, records=41
[WARN ] 2026-06-02 12:48:37.903 [3701 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:48:39.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:48:48.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 12:48:48.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430896,ok=430896,error=0, records=41
[WARN ] 2026-06-02 12:48:52.907 [3696 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:48:54.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:49:03.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 12:49:03.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430897,ok=430897,error=0, records=41
[WARN ] 2026-06-02 12:49:07.912 [3696 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:49:09.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:49:18.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 12:49:18.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430898,ok=430898,error=0, records=41
[WARN ] 2026-06-02 12:49:22.918 [3696 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:49:24.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:49:33.379 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 12:49:33.379 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430899,ok=430899,error=0, records=41
[WARN ] 2026-06-02 12:49:37.923 [3701 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:49:39.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:49:48.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 12:49:48.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430900,ok=430900,error=0, records=41
[WARN ] 2026-06-02 12:49:52.928 [3768 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:49:54.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:50:02.076 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21560/300s
[INFO ] 2026-06-02 12:50:03.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 12:50:03.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430901,ok=430901,error=0, records=41
[WARN ] 2026-06-02 12:50:07.935 [3784 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:50:09.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:50:18.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 12:50:18.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430902,ok=430902,error=0, records=41
[WARN ] 2026-06-02 12:50:22.941 [3816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:50:24.641 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843532},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:50:24.809 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:50:24.809 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:50:24.809 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:50:24.809 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:50:24.809 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:50:24.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:50:24.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:50:24.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:50:24.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:50:24.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:50:24.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:50:24.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:50:33.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:50:33.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430903,ok=430903,error=0, records=41
[INFO ] 2026-06-02 12:50:34.445 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21551/300s
[WARN ] 2026-06-02 12:50:37.947 [3779 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:50:39.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:50:43.370 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21560/300s
[INFO ] 2026-06-02 12:50:48.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 12:50:48.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430904,ok=430904,error=0, records=41
[WARN ] 2026-06-02 12:50:52.951 [3816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:50:54.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:51:03.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-02 12:51:03.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430905,ok=430905,error=0, records=41
[WARN ] 2026-06-02 12:51:07.957 [3854 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:51:09.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:51:18.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 12:51:18.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430906,ok=430906,error=0, records=41
[INFO ] 2026-06-02 12:51:18.420 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21547/300s
[WARN ] 2026-06-02 12:51:22.961 [3821 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:51:24.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:51:33.424 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-02 12:51:33.424 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430907,ok=430907,error=0, records=41
[WARN ] 2026-06-02 12:51:37.967 [3882 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:51:39.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:51:48.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 12:51:48.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430908,ok=430908,error=0, records=41
[WARN ] 2026-06-02 12:51:52.972 [3821 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:51:53.470 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21556/300s
[INFO ] 2026-06-02 12:51:54.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:52:03.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 12:52:03.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430909,ok=430909,error=0, records=41
[WARN ] 2026-06-02 12:52:07.976 [3816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:52:08.711 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21547/300s
[INFO ] 2026-06-02 12:52:09.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:52:09.899 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21559/300s
[INFO ] 2026-06-02 12:52:18.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 12:52:18.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430910,ok=430910,error=0, records=41
[WARN ] 2026-06-02 12:52:22.981 [3816 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:52:24.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:52:33.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 12:52:33.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430911,ok=430911,error=0, records=41
[WARN ] 2026-06-02 12:52:37.985 [3923 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:52:39.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:52:48.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 12:52:48.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430912,ok=430912,error=0, records=41
[WARN ] 2026-06-02 12:52:52.989 [3923 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:52:54.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:53:00.848 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21557/300s
[INFO ] 2026-06-02 12:53:02.649 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21557/300s
[INFO ] 2026-06-02 12:53:03.467 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 12:53:03.467 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430913,ok=430913,error=0, records=41
[WARN ] 2026-06-02 12:53:07.995 [3923 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:53:09.455 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21557/300s
[INFO ] 2026-06-02 12:53:09.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:53:18.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 12:53:18.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430914,ok=430914,error=0, records=41
[WARN ] 2026-06-02 12:53:22.999 [3951 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:53:24.810 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17950/300s
[INFO ] 2026-06-02 12:53:24.811 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843468},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:53:24.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-02 12:53:24.969 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:53:24.969 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:53:24.969 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:53:24.969 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:53:24.969 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:53:25.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:53:25.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:53:25.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:53:25.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:53:25.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:53:25.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:53:33.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 12:53:33.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430915,ok=430915,error=0, records=41
[WARN ] 2026-06-02 12:53:38.004 [3923 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:53:39.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 12:53:39.903 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 12:53:48.485 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 12:53:48.485 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430916,ok=430916,error=0, records=41
[WARN ] 2026-06-02 12:53:53.009 [4007 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:53:54.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:53:54.903 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 12:54:03.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 12:54:03.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430917,ok=430917,error=0, records=41
[WARN ] 2026-06-02 12:54:08.014 [3937 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:54:09.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:54:18.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 12:54:18.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430918,ok=430918,error=0, records=41
[WARN ] 2026-06-02 12:54:23.019 [4007 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:54:24.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:54:33.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 12:54:33.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430919,ok=430919,error=0, records=41
[WARN ] 2026-06-02 12:54:38.024 [4021 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:54:39.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:54:48.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 12:54:48.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430920,ok=430920,error=0, records=41
[WARN ] 2026-06-02 12:54:53.029 [3992 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:54:54.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:55:02.080 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21561/300s
[INFO ] 2026-06-02 12:55:03.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 12:55:03.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430921,ok=430921,error=0, records=41
[WARN ] 2026-06-02 12:55:08.037 [3992 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:55:09.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:55:18.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 12:55:18.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430922,ok=430922,error=0, records=41
[WARN ] 2026-06-02 12:55:23.042 [4049 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:55:24.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:55:33.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 12:55:33.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430923,ok=430923,error=0, records=41
[INFO ] 2026-06-02 12:55:34.545 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21552/300s
[WARN ] 2026-06-02 12:55:38.047 [4131 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:55:39.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:55:43.377 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21561/300s
[INFO ] 2026-06-02 12:55:48.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 12:55:48.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430924,ok=430924,error=0, records=41
[WARN ] 2026-06-02 12:55:53.052 [4131 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:55:54.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:56:03.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 12:56:03.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430925,ok=430925,error=0, records=41
[WARN ] 2026-06-02 12:56:07.557 [4141 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:56:09.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:56:18.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 12:56:18.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430926,ok=430926,error=0, records=41
[INFO ] 2026-06-02 12:56:18.553 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21548/300s
[WARN ] 2026-06-02 12:56:22.563 [4141 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:56:24.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:56:24.971 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843396},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:56:25.139 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:56:25.139 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 12:56:25.139 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:56:25.139 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:56:25.139 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:56:25.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:56:25.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:56:25.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:56:25.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:56:25.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:56:25.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:56:33.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 12:56:33.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430927,ok=430927,error=0, records=41
[WARN ] 2026-06-02 12:56:37.569 [4202 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:56:39.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:56:48.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:56:48.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430928,ok=430928,error=0, records=41
[WARN ] 2026-06-02 12:56:52.574 [4175 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:56:53.531 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21557/300s
[INFO ] 2026-06-02 12:56:54.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:57:03.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 12:57:03.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430929,ok=430929,error=0, records=41
[WARN ] 2026-06-02 12:57:07.580 [4202 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:57:08.897 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21548/300s
[INFO ] 2026-06-02 12:57:09.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:57:09.913 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21560/300s
[INFO ] 2026-06-02 12:57:18.576 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 12:57:18.576 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430930,ok=430930,error=0, records=41
[WARN ] 2026-06-02 12:57:22.585 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:57:24.914 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:57:33.582 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 12:57:33.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430931,ok=430931,error=0, records=41
[WARN ] 2026-06-02 12:57:37.591 [4255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:57:39.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:57:48.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 12:57:48.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430932,ok=430932,error=0, records=41
[WARN ] 2026-06-02 12:57:52.597 [4255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:57:54.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:58:00.926 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21558/300s
[INFO ] 2026-06-02 12:58:02.727 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21558/300s
[INFO ] 2026-06-02 12:58:03.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 12:58:03.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430933,ok=430933,error=0, records=41
[WARN ] 2026-06-02 12:58:07.603 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:58:09.534 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21558/300s
[INFO ] 2026-06-02 12:58:09.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:58:18.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 12:58:18.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430934,ok=430934,error=0, records=41
[WARN ] 2026-06-02 12:58:22.608 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:58:24.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:58:33.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 12:58:33.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430935,ok=430935,error=0, records=41
[WARN ] 2026-06-02 12:58:37.615 [4255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:58:39.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:58:48.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 12:58:48.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430936,ok=430936,error=0, records=41
[WARN ] 2026-06-02 12:58:52.620 [4255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:58:54.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:59:03.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 12:59:03.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430937,ok=430937,error=0, records=41
[WARN ] 2026-06-02 12:59:07.625 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:59:09.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:59:18.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-02 12:59:18.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430938,ok=430938,error=0, records=41
[WARN ] 2026-06-02 12:59:22.630 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:59:24.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:59:25.140 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17951/300s
[INFO ] 2026-06-02 12:59:25.141 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843328},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 12:59:25.310 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 12:59:25.310 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 12:59:25.310 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 12:59:25.310 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 12:59:25.310 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 12:59:25.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:59:25.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 12:59:25.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:59:25.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 12:59:25.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:59:25.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 12:59:33.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-02 12:59:33.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430939,ok=430939,error=0, records=41
[WARN ] 2026-06-02 12:59:37.635 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:59:39.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 12:59:48.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 12:59:48.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430940,ok=430940,error=0, records=41
[WARN ] 2026-06-02 12:59:52.639 [4255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 12:59:54.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:00:02.083 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21562/300s
[INFO ] 2026-06-02 13:00:03.651 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 13:00:03.651 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430941,ok=430941,error=0, records=41
[WARN ] 2026-06-02 13:00:07.643 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:00:09.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:00:18.658 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 13:00:18.658 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430942,ok=430942,error=0, records=41
[WARN ] 2026-06-02 13:00:22.647 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:00:24.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:00:33.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 13:00:33.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430943,ok=430943,error=0, records=41
[INFO ] 2026-06-02 13:00:34.652 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21553/300s
[WARN ] 2026-06-02 13:00:37.653 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:00:39.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:00:43.383 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21562/300s
[INFO ] 2026-06-02 13:00:48.669 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-02 13:00:48.669 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430944,ok=430944,error=0, records=41
[WARN ] 2026-06-02 13:00:52.659 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:00:54.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:01:03.674 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10278, records=41
[INFO ] 2026-06-02 13:01:03.674 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430945,ok=430945,error=0, records=41
[WARN ] 2026-06-02 13:01:07.666 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:01:09.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:01:18.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10308, records=41
[INFO ] 2026-06-02 13:01:18.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430946,ok=430946,error=0, records=41
[INFO ] 2026-06-02 13:01:18.679 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21549/300s
[WARN ] 2026-06-02 13:01:22.671 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:01:24.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:01:33.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 13:01:33.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430947,ok=430947,error=0, records=41
[WARN ] 2026-06-02 13:01:37.675 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:01:39.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:01:48.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 13:01:48.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430948,ok=430948,error=0, records=41
[WARN ] 2026-06-02 13:01:52.681 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:01:53.587 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21558/300s
[INFO ] 2026-06-02 13:01:54.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:02:03.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 13:02:03.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430949,ok=430949,error=0, records=41
[WARN ] 2026-06-02 13:02:07.687 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:02:09.081 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21549/300s
[INFO ] 2026-06-02 13:02:09.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:02:09.925 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21561/300s
[INFO ] 2026-06-02 13:02:18.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 13:02:18.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430950,ok=430950,error=0, records=41
[WARN ] 2026-06-02 13:02:22.693 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:02:24.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:02:25.312 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843244},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:02:25.491 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:02:25.491 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 13:02:25.491 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:02:25.491 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:02:25.491 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:02:25.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:02:25.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:02:25.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:02:25.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:02:25.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:02:25.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:02:33.709 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 13:02:33.709 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430951,ok=430951,error=0, records=41
[WARN ] 2026-06-02 13:02:37.697 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:02:39.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:02:48.716 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 13:02:48.716 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430952,ok=430952,error=0, records=41
[WARN ] 2026-06-02 13:02:52.703 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:02:54.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:03:00.986 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21559/300s
[INFO ] 2026-06-02 13:03:02.788 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21559/300s
[INFO ] 2026-06-02 13:03:03.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 13:03:03.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430953,ok=430953,error=0, records=41
[WARN ] 2026-06-02 13:03:07.708 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:03:09.594 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21559/300s
[INFO ] 2026-06-02 13:03:09.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:03:18.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-02 13:03:18.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430954,ok=430954,error=0, records=41
[WARN ] 2026-06-02 13:03:22.712 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:03:24.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:03:33.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 13:03:33.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430955,ok=430955,error=0, records=41
[WARN ] 2026-06-02 13:03:37.718 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:03:39.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 13:03:39.929 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 13:03:48.752 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 13:03:48.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430956,ok=430956,error=0, records=41
[WARN ] 2026-06-02 13:03:52.724 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:03:54.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:04:03.759 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 13:04:03.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430957,ok=430957,error=0, records=41
[WARN ] 2026-06-02 13:04:07.729 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:04:09.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:04:18.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 13:04:18.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430958,ok=430958,error=0, records=41
[WARN ] 2026-06-02 13:04:22.735 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:04:24.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:04:33.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-02 13:04:33.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430959,ok=430959,error=0, records=41
[WARN ] 2026-06-02 13:04:37.740 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:04:39.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:04:48.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 13:04:48.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430960,ok=430960,error=0, records=41
[WARN ] 2026-06-02 13:04:52.746 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:04:54.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:05:02.087 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21563/300s
[INFO ] 2026-06-02 13:05:03.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 13:05:03.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430961,ok=430961,error=0, records=41
[WARN ] 2026-06-02 13:05:07.751 [4255 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:05:09.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:05:18.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 13:05:18.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430962,ok=430962,error=0, records=41
[WARN ] 2026-06-02 13:05:22.756 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:05:24.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:05:25.491 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17952/300s
[INFO ] 2026-06-02 13:05:25.493 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:05:25.658 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:05:25.658 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 13:05:25.658 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:05:25.658 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:05:25.658 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:05:25.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:05:25.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:05:25.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:05:25.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:05:25.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:05:25.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:05:33.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 13:05:33.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430963,ok=430963,error=0, records=41
[INFO ] 2026-06-02 13:05:34.760 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21554/300s
[WARN ] 2026-06-02 13:05:37.762 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:05:39.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:05:43.389 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21563/300s
[INFO ] 2026-06-02 13:05:48.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 13:05:48.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430964,ok=430964,error=0, records=41
[WARN ] 2026-06-02 13:05:52.766 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:05:54.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:06:03.878 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 13:06:03.878 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430965,ok=430965,error=0, records=41
[WARN ] 2026-06-02 13:06:07.771 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:06:09.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:06:18.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 13:06:18.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430966,ok=430966,error=0, records=41
[INFO ] 2026-06-02 13:06:18.883 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21550/300s
[WARN ] 2026-06-02 13:06:22.775 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:06:24.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:06:33.890 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10366, records=41
[INFO ] 2026-06-02 13:06:33.890 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430967,ok=430967,error=0, records=41
[WARN ] 2026-06-02 13:06:37.782 [4244 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:06:39.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:06:48.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 13:06:48.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430968,ok=430968,error=0, records=41
[WARN ] 2026-06-02 13:06:52.787 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:06:53.646 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21559/300s
[INFO ] 2026-06-02 13:06:54.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:07:03.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 13:07:03.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430969,ok=430969,error=0, records=41
[WARN ] 2026-06-02 13:07:07.792 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:07:09.265 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21550/300s
[INFO ] 2026-06-02 13:07:09.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:07:09.938 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21562/300s
[INFO ] 2026-06-02 13:07:18.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:07:18.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430970,ok=430970,error=0, records=41
[WARN ] 2026-06-02 13:07:22.796 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:07:24.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:07:33.916 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 13:07:33.916 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430971,ok=430971,error=0, records=41
[WARN ] 2026-06-02 13:07:37.802 [4266 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:07:39.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:07:48.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 13:07:48.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430972,ok=430972,error=0, records=41
[WARN ] 2026-06-02 13:07:52.807 [4815 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:07:54.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:08:01.060 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21560/300s
[INFO ] 2026-06-02 13:08:02.862 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21560/300s
[INFO ] 2026-06-02 13:08:03.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:08:03.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430973,ok=430973,error=0, records=41
[WARN ] 2026-06-02 13:08:07.812 [4229 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:08:09.667 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21560/300s
[INFO ] 2026-06-02 13:08:09.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:08:18.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 13:08:18.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430974,ok=430974,error=0, records=41
[WARN ] 2026-06-02 13:08:22.818 [4853 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:08:24.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:08:25.660 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:08:25.835 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:08:25.835 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 13:08:25.835 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:08:25.836 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:08:25.836 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:08:25.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:08:25.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:08:25.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:08:25.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:08:25.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:08:25.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:08:33.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 13:08:33.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430975,ok=430975,error=0, records=41
[WARN ] 2026-06-02 13:08:37.822 [4839 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:08:39.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:08:48.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 13:08:48.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430976,ok=430976,error=0, records=41
[WARN ] 2026-06-02 13:08:52.827 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:08:54.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:08:54.943 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 13:09:04.003 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 13:09:04.003 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430977,ok=430977,error=0, records=41
[WARN ] 2026-06-02 13:09:07.832 [4275 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:09:09.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=24.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:09:19.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 13:09:19.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430978,ok=430978,error=0, records=41
[WARN ] 2026-06-02 13:09:22.837 [4900 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:09:24.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:09:34.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 13:09:34.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430979,ok=430979,error=0, records=41
[WARN ] 2026-06-02 13:09:37.843 [4923 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:09:39.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:09:49.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 13:09:49.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430980,ok=430980,error=0, records=41
[WARN ] 2026-06-02 13:09:52.848 [4923 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:09:54.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:10:02.090 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21564/300s
[INFO ] 2026-06-02 13:10:04.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 13:10:04.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430981,ok=430981,error=0, records=41
[WARN ] 2026-06-02 13:10:07.853 [4900 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:10:09.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:10:19.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 13:10:19.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430982,ok=430982,error=0, records=41
[WARN ] 2026-06-02 13:10:22.858 [4937 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:10:24.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:10:34.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 13:10:34.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430983,ok=430983,error=0, records=41
[INFO ] 2026-06-02 13:10:34.862 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21555/300s
[WARN ] 2026-06-02 13:10:37.863 [4914 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:10:39.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:10:43.396 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21564/300s
[INFO ] 2026-06-02 13:10:49.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 13:10:49.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430984,ok=430984,error=0, records=41
[WARN ] 2026-06-02 13:10:52.867 [4900 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:10:54.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:11:04.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 13:11:04.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430985,ok=430985,error=0, records=41
[WARN ] 2026-06-02 13:11:07.872 [4914 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:11:09.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:11:19.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 13:11:19.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430986,ok=430986,error=0, records=41
[INFO ] 2026-06-02 13:11:19.068 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21551/300s
[WARN ] 2026-06-02 13:11:22.877 [5020 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:11:24.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:11:25.836 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17953/300s
[INFO ] 2026-06-02 13:11:25.837 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20843044},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:11:26.005 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:11:26.005 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 13:11:26.005 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:11:26.005 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:11:26.005 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:11:26.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:11:26.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:11:26.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:11:26.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:11:26.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:11:26.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:11:34.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 13:11:34.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430987,ok=430987,error=0, records=41
[WARN ] 2026-06-02 13:11:37.882 [5020 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:11:39.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:11:49.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 13:11:49.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430988,ok=430988,error=0, records=41
[WARN ] 2026-06-02 13:11:52.888 [5036 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:11:53.701 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21560/300s
[INFO ] 2026-06-02 13:11:54.951 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:12:04.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 13:12:04.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430989,ok=430989,error=0, records=41
[WARN ] 2026-06-02 13:12:07.892 [5059 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:12:09.445 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21551/300s
[INFO ] 2026-06-02 13:12:09.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:12:09.952 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21563/300s
[INFO ] 2026-06-02 13:12:19.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 13:12:19.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430990,ok=430990,error=0, records=41
[WARN ] 2026-06-02 13:12:22.898 [5075 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:12:24.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:12:34.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 13:12:34.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430991,ok=430991,error=0, records=41
[WARN ] 2026-06-02 13:12:37.902 [5087 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:12:39.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:12:49.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 13:12:49.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430992,ok=430992,error=0, records=41
[WARN ] 2026-06-02 13:12:52.907 [5129 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:12:54.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:13:01.124 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21561/300s
[INFO ] 2026-06-02 13:13:02.926 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21561/300s
[INFO ] 2026-06-02 13:13:04.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 13:13:04.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430993,ok=430993,error=0, records=41
[WARN ] 2026-06-02 13:13:07.913 [5145 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:13:09.732 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21561/300s
[INFO ] 2026-06-02 13:13:09.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:13:19.143 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 13:13:19.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430994,ok=430994,error=0, records=41
[WARN ] 2026-06-02 13:13:22.919 [5145 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:13:24.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:13:34.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 13:13:34.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430995,ok=430995,error=0, records=41
[WARN ] 2026-06-02 13:13:37.924 [5145 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:13:39.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 13:13:39.956 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 13:13:49.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 13:13:49.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430996,ok=430996,error=0, records=41
[WARN ] 2026-06-02 13:13:52.929 [5171 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:13:54.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:14:04.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 13:14:04.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430997,ok=430997,error=0, records=41
[WARN ] 2026-06-02 13:14:07.935 [5145 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:14:09.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:14:19.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 13:14:19.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430998,ok=430998,error=0, records=41
[WARN ] 2026-06-02 13:14:22.941 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:14:24.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:14:26.007 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:14:26.192 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:14:26.193 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 13:14:26.193 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:14:26.193 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:14:26.193 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:14:26.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:14:26.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:14:26.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:14:26.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:14:26.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:14:26.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:14:34.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 13:14:34.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=430999,ok=430999,error=0, records=41
[WARN ] 2026-06-02 13:14:37.947 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:14:39.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:14:49.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 13:14:49.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431000,ok=431000,error=0, records=41
[WARN ] 2026-06-02 13:14:52.953 [5218 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:14:54.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:15:02.094 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21565/300s
[INFO ] 2026-06-02 13:15:04.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 13:15:04.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431001,ok=431001,error=0, records=41
[WARN ] 2026-06-02 13:15:07.958 [5251 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:15:09.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:15:19.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 13:15:19.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431002,ok=431002,error=0, records=41
[WARN ] 2026-06-02 13:15:22.963 [5265 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:15:24.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:15:34.192 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-02 13:15:34.192 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431003,ok=431003,error=0, records=41
[INFO ] 2026-06-02 13:15:34.966 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21556/300s
[WARN ] 2026-06-02 13:15:37.967 [5279 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:15:39.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:15:43.402 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21565/300s
[INFO ] 2026-06-02 13:15:49.200 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 13:15:49.200 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431004,ok=431004,error=0, records=41
[WARN ] 2026-06-02 13:15:52.972 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:15:54.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:16:04.206 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:16:04.206 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431005,ok=431005,error=0, records=41
[WARN ] 2026-06-02 13:16:07.976 [5321 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:16:09.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:16:19.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 13:16:19.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431006,ok=431006,error=0, records=41
[INFO ] 2026-06-02 13:16:19.211 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21552/300s
[WARN ] 2026-06-02 13:16:22.981 [5230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:16:24.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:16:34.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 13:16:34.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431007,ok=431007,error=0, records=41
[WARN ] 2026-06-02 13:16:37.986 [5230 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:16:39.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:16:49.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 13:16:49.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431008,ok=431008,error=0, records=41
[WARN ] 2026-06-02 13:16:52.991 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:16:53.761 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21561/300s
[INFO ] 2026-06-02 13:16:54.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:17:04.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 13:17:04.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431009,ok=431009,error=0, records=41
[WARN ] 2026-06-02 13:17:07.995 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:17:09.630 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21552/300s
[INFO ] 2026-06-02 13:17:09.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:17:09.965 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21564/300s
[INFO ] 2026-06-02 13:17:19.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 13:17:19.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431010,ok=431010,error=0, records=41
[WARN ] 2026-06-02 13:17:23.000 [5391 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:17:24.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:17:26.193 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17954/300s
[INFO ] 2026-06-02 13:17:26.194 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842908},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:17:26.365 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:17:26.365 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 13:17:26.365 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:17:26.365 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:17:26.365 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:17:26.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:17:26.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:17:26.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:17:26.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:17:26.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:17:26.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:17:34.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 13:17:34.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431011,ok=431011,error=0, records=41
[WARN ] 2026-06-02 13:17:38.006 [5217 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:17:39.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:17:49.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 13:17:49.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431012,ok=431012,error=0, records=41
[WARN ] 2026-06-02 13:17:53.010 [5377 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:17:54.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:18:01.185 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21562/300s
[INFO ] 2026-06-02 13:18:02.987 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21562/300s
[INFO ] 2026-06-02 13:18:04.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 13:18:04.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431013,ok=431013,error=0, records=41
[WARN ] 2026-06-02 13:18:08.017 [5363 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:18:09.794 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21562/300s
[INFO ] 2026-06-02 13:18:09.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:18:19.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 13:18:19.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431014,ok=431014,error=0, records=41
[WARN ] 2026-06-02 13:18:23.022 [5447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:18:24.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:18:34.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 13:18:34.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431015,ok=431015,error=0, records=41
[WARN ] 2026-06-02 13:18:38.027 [5420 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:18:39.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:18:49.267 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 13:18:49.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431016,ok=431016,error=0, records=41
[WARN ] 2026-06-02 13:18:53.033 [5363 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:18:54.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:19:04.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 13:19:04.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431017,ok=431017,error=0, records=41
[WARN ] 2026-06-02 13:19:08.040 [5447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:19:09.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:19:19.278 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 13:19:19.278 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431018,ok=431018,error=0, records=41
[WARN ] 2026-06-02 13:19:23.046 [5447 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:19:24.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:19:34.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 13:19:34.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431019,ok=431019,error=0, records=41
[WARN ] 2026-06-02 13:19:38.052 [5363 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:19:39.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:19:49.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 13:19:49.300 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431020,ok=431020,error=0, records=41
[WARN ] 2026-06-02 13:19:52.556 [5542 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:19:54.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:20:02.097 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21566/300s
[INFO ] 2026-06-02 13:20:04.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 13:20:04.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431021,ok=431021,error=0, records=41
[WARN ] 2026-06-02 13:20:07.562 [5568 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:20:09.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:20:19.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 13:20:19.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431022,ok=431022,error=0, records=41
[WARN ] 2026-06-02 13:20:22.567 [5579 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:20:24.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:20:26.366 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842840},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:20:26.532 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:20:26.532 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 13:20:26.532 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:20:26.532 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:20:26.532 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:20:26.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:20:26.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:20:26.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:20:26.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:20:26.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:20:26.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:20:34.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 13:20:34.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431023,ok=431023,error=0, records=41
[INFO ] 2026-06-02 13:20:35.071 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21557/300s
[WARN ] 2026-06-02 13:20:37.572 [5605 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:20:39.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:20:43.409 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21566/300s
[INFO ] 2026-06-02 13:20:49.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 13:20:49.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431024,ok=431024,error=0, records=41
[WARN ] 2026-06-02 13:20:52.576 [5616 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:20:54.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:21:04.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:21:04.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431025,ok=431025,error=0, records=41
[WARN ] 2026-06-02 13:21:07.581 [5641 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:21:09.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:21:19.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 13:21:19.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431026,ok=431026,error=0, records=41
[INFO ] 2026-06-02 13:21:19.354 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21553/300s
[WARN ] 2026-06-02 13:21:22.587 [5653 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:21:24.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:21:34.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 13:21:34.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431027,ok=431027,error=0, records=41
[WARN ] 2026-06-02 13:21:37.593 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:21:39.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:21:49.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 13:21:49.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431028,ok=431028,error=0, records=41
[WARN ] 2026-06-02 13:21:52.597 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:21:53.818 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21562/300s
[INFO ] 2026-06-02 13:21:54.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:22:04.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:22:04.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431029,ok=431029,error=0, records=41
[WARN ] 2026-06-02 13:22:07.602 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:22:09.813 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21553/300s
[INFO ] 2026-06-02 13:22:09.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:22:09.977 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21565/300s
[INFO ] 2026-06-02 13:22:19.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 13:22:19.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431030,ok=431030,error=0, records=41
[WARN ] 2026-06-02 13:22:22.608 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:22:24.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:22:34.413 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 13:22:34.413 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431031,ok=431031,error=0, records=41
[WARN ] 2026-06-02 13:22:37.613 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:22:39.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:22:49.419 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 13:22:49.419 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431032,ok=431032,error=0, records=41
[WARN ] 2026-06-02 13:22:52.618 [5690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:22:54.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:23:01.255 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21563/300s
[INFO ] 2026-06-02 13:23:03.057 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21563/300s
[INFO ] 2026-06-02 13:23:04.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 13:23:04.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431033,ok=431033,error=0, records=41
[WARN ] 2026-06-02 13:23:07.623 [5690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:23:09.862 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21563/300s
[INFO ] 2026-06-02 13:23:09.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:23:19.431 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 13:23:19.431 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431034,ok=431034,error=0, records=41
[WARN ] 2026-06-02 13:23:22.629 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:23:24.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:23:26.533 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17955/300s
[INFO ] 2026-06-02 13:23:26.534 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:23:26.683 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:23:26.683 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 13:23:26.683 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:23:26.683 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:23:26.683 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:23:26.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:23:26.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:23:26.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:23:26.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:23:26.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:23:26.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:23:34.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 13:23:34.436 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431035,ok=431035,error=0, records=41
[WARN ] 2026-06-02 13:23:37.635 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:23:39.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 13:23:39.981 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 13:23:49.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 13:23:49.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431036,ok=431036,error=0, records=41
[WARN ] 2026-06-02 13:23:52.640 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:23:54.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:23:54.982 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 13:24:04.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:24:04.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431037,ok=431037,error=0, records=41
[WARN ] 2026-06-02 13:24:07.646 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:24:09.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:24:19.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 13:24:19.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431038,ok=431038,error=0, records=41
[WARN ] 2026-06-02 13:24:22.651 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:24:24.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:24:34.550 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 13:24:34.550 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431039,ok=431039,error=0, records=41
[WARN ] 2026-06-02 13:24:37.656 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:24:39.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:24:49.556 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 13:24:49.556 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431040,ok=431040,error=0, records=41
[WARN ] 2026-06-02 13:24:52.661 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:24:54.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:25:02.101 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21567/300s
[INFO ] 2026-06-02 13:25:04.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 13:25:04.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431041,ok=431041,error=0, records=41
[WARN ] 2026-06-02 13:25:07.665 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:25:09.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:25:19.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 13:25:19.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431042,ok=431042,error=0, records=41
[WARN ] 2026-06-02 13:25:22.670 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:25:24.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:25:34.573 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 13:25:34.573 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431043,ok=431043,error=0, records=41
[INFO ] 2026-06-02 13:25:35.173 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21558/300s
[WARN ] 2026-06-02 13:25:37.674 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:25:39.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:25:43.416 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21567/300s
[INFO ] 2026-06-02 13:25:49.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 13:25:49.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431044,ok=431044,error=0, records=41
[WARN ] 2026-06-02 13:25:52.680 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:25:54.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:26:04.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 13:26:04.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431045,ok=431045,error=0, records=41
[WARN ] 2026-06-02 13:26:07.684 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:26:09.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:26:19.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 13:26:19.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431046,ok=431046,error=0, records=41
[INFO ] 2026-06-02 13:26:19.589 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21554/300s
[WARN ] 2026-06-02 13:26:22.690 [5690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:26:24.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:26:26.685 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842708},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:26:26.830 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:26:26.830 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 13:26:26.830 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:26:26.831 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:26:26.831 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:26:26.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:26:26.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:26:26.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:26:26.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:26:26.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:26:26.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:26:34.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 13:26:34.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431047,ok=431047,error=0, records=41
[WARN ] 2026-06-02 13:26:37.695 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:26:39.989 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:26:49.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 13:26:49.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431048,ok=431048,error=0, records=41
[WARN ] 2026-06-02 13:26:52.700 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:26:53.875 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21563/300s
[INFO ] 2026-06-02 13:26:54.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:27:04.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-02 13:27:04.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431049,ok=431049,error=0, records=41
[WARN ] 2026-06-02 13:27:07.706 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:27:09.986 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21554/300s
[INFO ] 2026-06-02 13:27:09.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:27:09.991 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21566/300s
[INFO ] 2026-06-02 13:27:19.623 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-02 13:27:19.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431050,ok=431050,error=0, records=41
[WARN ] 2026-06-02 13:27:22.710 [5642 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:27:24.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:27:34.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 13:27:34.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431051,ok=431051,error=0, records=41
[WARN ] 2026-06-02 13:27:37.715 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:27:39.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:27:49.632 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 13:27:49.632 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431052,ok=431052,error=0, records=41
[WARN ] 2026-06-02 13:27:52.720 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:27:54.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:28:01.320 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21564/300s
[INFO ] 2026-06-02 13:28:03.122 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21564/300s
[INFO ] 2026-06-02 13:28:04.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 13:28:04.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431053,ok=431053,error=0, records=41
[WARN ] 2026-06-02 13:28:07.724 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:28:09.928 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21564/300s
[INFO ] 2026-06-02 13:28:09.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:28:19.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 13:28:19.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431054,ok=431054,error=0, records=41
[WARN ] 2026-06-02 13:28:22.730 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:28:24.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:28:34.672 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 13:28:34.672 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431055,ok=431055,error=0, records=41
[WARN ] 2026-06-02 13:28:37.735 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:28:39.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:28:49.678 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 13:28:49.678 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431056,ok=431056,error=0, records=41
[WARN ] 2026-06-02 13:28:52.741 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:28:54.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:29:04.683 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 13:29:04.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431057,ok=431057,error=0, records=41
[WARN ] 2026-06-02 13:29:07.746 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:29:09.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:29:19.693 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 13:29:19.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431058,ok=431058,error=0, records=41
[WARN ] 2026-06-02 13:29:22.752 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:29:24.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:29:26.831 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17956/300s
[INFO ] 2026-06-02 13:29:26.832 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842644},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:29:26.984 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:29:26.985 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 13:29:26.985 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:29:26.985 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:29:26.985 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:29:27.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:29:27.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:29:27.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:29:27.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:29:27.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:29:27.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:29:34.699 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 13:29:34.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431059,ok=431059,error=0, records=41
[WARN ] 2026-06-02 13:29:37.757 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:29:39.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:29:49.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 13:29:49.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431060,ok=431060,error=0, records=41
[WARN ] 2026-06-02 13:29:52.762 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:29:54.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:30:02.104 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21568/300s
[INFO ] 2026-06-02 13:30:04.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 13:30:04.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431061,ok=431061,error=0, records=41
[WARN ] 2026-06-02 13:30:07.767 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:30:09.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:30:19.717 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-02 13:30:19.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431062,ok=431062,error=0, records=41
[WARN ] 2026-06-02 13:30:22.772 [5684 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:30:24.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:30:34.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 13:30:34.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431063,ok=431063,error=0, records=41
[INFO ] 2026-06-02 13:30:35.277 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21559/300s
[WARN ] 2026-06-02 13:30:37.778 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:30:39.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:30:43.422 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21568/300s
[INFO ] 2026-06-02 13:30:49.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10316, records=41
[INFO ] 2026-06-02 13:30:49.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431064,ok=431064,error=0, records=41
[WARN ] 2026-06-02 13:30:52.783 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:30:55.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:31:04.736 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 13:31:04.736 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431065,ok=431065,error=0, records=41
[WARN ] 2026-06-02 13:31:07.788 [5675 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:31:10.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:31:19.746 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 13:31:19.746 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431066,ok=431066,error=0, records=41
[INFO ] 2026-06-02 13:31:19.746 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21555/300s
[WARN ] 2026-06-02 13:31:22.793 [5690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:31:25.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:31:34.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 13:31:34.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431067,ok=431067,error=0, records=41
[WARN ] 2026-06-02 13:31:37.798 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:31:40.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:31:49.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10171, records=41
[INFO ] 2026-06-02 13:31:49.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431068,ok=431068,error=0, records=41
[WARN ] 2026-06-02 13:31:52.803 [5690 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:31:53.931 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21564/300s
[INFO ] 2026-06-02 13:31:55.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:32:04.762 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 13:32:04.762 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431069,ok=431069,error=0, records=41
[WARN ] 2026-06-02 13:32:07.808 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:32:10.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:32:10.003 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21567/300s
[INFO ] 2026-06-02 13:32:10.169 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21555/300s
[INFO ] 2026-06-02 13:32:19.768 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 13:32:19.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431070,ok=431070,error=0, records=41
[WARN ] 2026-06-02 13:32:22.814 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:32:25.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:32:26.986 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842560},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:32:27.141 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:32:27.142 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 13:32:27.142 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:32:27.142 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:32:27.142 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:32:27.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:32:27.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:32:27.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:32:27.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:32:27.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:32:27.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:32:34.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 13:32:34.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431071,ok=431071,error=0, records=41
[WARN ] 2026-06-02 13:32:37.819 [6277 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:32:40.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:32:49.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 13:32:49.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431072,ok=431072,error=0, records=41
[WARN ] 2026-06-02 13:32:52.825 [6291 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:32:55.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:33:01.373 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21565/300s
[INFO ] 2026-06-02 13:33:03.175 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21565/300s
[INFO ] 2026-06-02 13:33:04.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 13:33:04.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431073,ok=431073,error=0, records=41
[WARN ] 2026-06-02 13:33:07.830 [6256 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:33:09.981 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21565/300s
[INFO ] 2026-06-02 13:33:10.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:33:19.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 13:33:19.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431074,ok=431074,error=0, records=41
[WARN ] 2026-06-02 13:33:22.835 [6291 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:33:25.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:33:34.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 13:33:34.804 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431075,ok=431075,error=0, records=41
[WARN ] 2026-06-02 13:33:37.841 [5652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:33:40.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 13:33:40.008 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 13:33:49.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 13:33:49.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431076,ok=431076,error=0, records=41
[WARN ] 2026-06-02 13:33:52.846 [6319 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:33:55.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:34:04.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 13:34:04.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431077,ok=431077,error=0, records=41
[WARN ] 2026-06-02 13:34:07.852 [6291 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:34:10.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:34:19.825 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 13:34:19.825 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431078,ok=431078,error=0, records=41
[WARN ] 2026-06-02 13:34:22.858 [6356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:34:25.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:34:34.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 13:34:34.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431079,ok=431079,error=0, records=41
[WARN ] 2026-06-02 13:34:37.864 [6356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:34:40.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:34:49.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10151, records=41
[INFO ] 2026-06-02 13:34:49.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431080,ok=431080,error=0, records=41
[WARN ] 2026-06-02 13:34:52.869 [6328 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:34:55.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:35:02.107 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21569/300s
[INFO ] 2026-06-02 13:35:04.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 13:35:04.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431081,ok=431081,error=0, records=41
[WARN ] 2026-06-02 13:35:07.875 [6418 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:35:10.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:35:19.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11566, records=49
[INFO ] 2026-06-02 13:35:19.907 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431082,ok=431082,error=0, records=49
[WARN ] 2026-06-02 13:35:22.879 [6356 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:35:25.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:35:27.142 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17957/300s
[INFO ] 2026-06-02 13:35:27.144 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842488},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:35:27.303 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:35:27.304 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 13:35:27.304 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:35:27.304 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:35:27.304 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:35:27.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:35:27.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:35:27.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:35:27.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:35:27.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:35:27.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:35:34.912 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 13:35:34.912 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431083,ok=431083,error=0, records=41
[INFO ] 2026-06-02 13:35:35.383 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21560/300s
[WARN ] 2026-06-02 13:35:37.884 [6441 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:35:40.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:35:43.428 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21569/300s
[INFO ] 2026-06-02 13:35:50.004 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-02 13:35:50.004 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431084,ok=431084,error=0, records=41
[WARN ] 2026-06-02 13:35:52.890 [6466 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:35:55.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:36:05.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 13:36:05.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431085,ok=431085,error=0, records=41
[WARN ] 2026-06-02 13:36:07.896 [6487 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:36:10.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:36:20.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 13:36:20.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431086,ok=431086,error=0, records=41
[INFO ] 2026-06-02 13:36:20.016 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21556/300s
[WARN ] 2026-06-02 13:36:22.901 [6504 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:36:25.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:36:35.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 13:36:35.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431087,ok=431087,error=0, records=41
[WARN ] 2026-06-02 13:36:37.907 [6515 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:36:40.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:36:50.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 13:36:50.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431088,ok=431088,error=0, records=41
[WARN ] 2026-06-02 13:36:52.913 [6498 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:36:53.982 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21565/300s
[INFO ] 2026-06-02 13:36:55.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:37:05.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10305, records=41
[INFO ] 2026-06-02 13:37:05.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431089,ok=431089,error=0, records=41
[WARN ] 2026-06-02 13:37:07.918 [6553 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:37:10.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:37:10.015 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21568/300s
[INFO ] 2026-06-02 13:37:10.347 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21556/300s
[INFO ] 2026-06-02 13:37:20.047 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 13:37:20.047 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431090,ok=431090,error=0, records=41
[WARN ] 2026-06-02 13:37:22.923 [6563 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:37:25.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:37:35.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 13:37:35.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431091,ok=431091,error=0, records=41
[WARN ] 2026-06-02 13:37:37.929 [6575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:37:40.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:37:50.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 13:37:50.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431092,ok=431092,error=0, records=41
[WARN ] 2026-06-02 13:37:52.935 [6604 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:37:55.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:38:01.414 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21566/300s
[INFO ] 2026-06-02 13:38:03.216 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21566/300s
[INFO ] 2026-06-02 13:38:05.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 13:38:05.064 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431093,ok=431093,error=0, records=41
[WARN ] 2026-06-02 13:38:07.942 [6575 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:38:10.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:38:10.022 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21566/300s
[INFO ] 2026-06-02 13:38:20.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 13:38:20.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431094,ok=431094,error=0, records=41
[WARN ] 2026-06-02 13:38:22.949 [6620 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:38:25.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:38:27.305 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842428},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:38:27.456 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:38:27.456 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 13:38:27.456 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:38:27.456 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:38:27.456 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:38:27.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:38:27.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:38:27.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:38:27.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:38:27.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:38:27.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:38:35.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=12151, records=49
[INFO ] 2026-06-02 13:38:35.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431095,ok=431095,error=0, records=49
[WARN ] 2026-06-02 13:38:37.955 [6630 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:38:40.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:38:50.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:38:50.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431096,ok=431096,error=0, records=41
[WARN ] 2026-06-02 13:38:52.960 [6629 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:38:55.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:38:55.020 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 13:39:05.090 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 13:39:05.090 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431097,ok=431097,error=0, records=41
[WARN ] 2026-06-02 13:39:07.965 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:39:10.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:39:20.095 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 13:39:20.095 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431098,ok=431098,error=0, records=41
[WARN ] 2026-06-02 13:39:22.969 [6685 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:39:25.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:39:35.102 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 13:39:35.102 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431099,ok=431099,error=0, records=41
[WARN ] 2026-06-02 13:39:37.974 [6658 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:39:40.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:39:50.110 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:39:50.110 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431100,ok=431100,error=0, records=41
[WARN ] 2026-06-02 13:39:52.979 [6712 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:39:55.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:40:02.111 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21570/300s
[INFO ] 2026-06-02 13:40:05.116 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 13:40:05.116 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431101,ok=431101,error=0, records=41
[WARN ] 2026-06-02 13:40:07.985 [6630 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:40:10.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:40:20.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 13:40:20.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431102,ok=431102,error=0, records=41
[WARN ] 2026-06-02 13:40:22.989 [6712 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:40:25.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:40:35.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 13:40:35.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431103,ok=431103,error=0, records=41
[INFO ] 2026-06-02 13:40:35.493 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21561/300s
[WARN ] 2026-06-02 13:40:37.994 [6629 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:40:40.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:40:43.435 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21570/300s
[INFO ] 2026-06-02 13:40:50.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 13:40:50.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431104,ok=431104,error=0, records=41
[WARN ] 2026-06-02 13:40:52.998 [6744 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:40:55.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:41:05.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 13:41:05.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431105,ok=431105,error=0, records=41
[WARN ] 2026-06-02 13:41:08.004 [6758 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:41:10.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:41:20.146 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:41:20.146 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431106,ok=431106,error=0, records=41
[INFO ] 2026-06-02 13:41:20.146 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21557/300s
[WARN ] 2026-06-02 13:41:23.009 [6800 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:41:25.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:41:27.456 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17958/300s
[INFO ] 2026-06-02 13:41:27.458 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842360},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:41:27.597 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:41:27.597 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 13:41:27.597 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:41:27.597 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:41:27.597 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:41:27.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:41:27.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:41:27.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:41:27.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:41:27.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:41:27.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:41:35.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 13:41:35.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431107,ok=431107,error=0, records=41
[WARN ] 2026-06-02 13:41:38.013 [6800 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:41:40.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:41:50.160 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 13:41:50.160 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431108,ok=431108,error=0, records=41
[WARN ] 2026-06-02 13:41:53.017 [6815 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:41:54.040 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21566/300s
[INFO ] 2026-06-02 13:41:55.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:42:05.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 13:42:05.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431109,ok=431109,error=0, records=41
[WARN ] 2026-06-02 13:42:08.023 [6800 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:42:10.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:42:10.028 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21569/300s
[INFO ] 2026-06-02 13:42:10.530 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21557/300s
[INFO ] 2026-06-02 13:42:20.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:42:20.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431110,ok=431110,error=0, records=41
[WARN ] 2026-06-02 13:42:23.029 [6772 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:42:25.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:42:35.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 13:42:35.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431111,ok=431111,error=0, records=41
[WARN ] 2026-06-02 13:42:38.034 [6843 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:42:40.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:42:50.189 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 13:42:50.189 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431112,ok=431112,error=0, records=41
[WARN ] 2026-06-02 13:42:53.039 [6870 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:42:55.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:43:01.484 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21567/300s
[INFO ] 2026-06-02 13:43:03.285 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21567/300s
[INFO ] 2026-06-02 13:43:05.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 13:43:05.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431113,ok=431113,error=0, records=41
[WARN ] 2026-06-02 13:43:08.045 [6898 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:43:10.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:43:10.092 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21567/300s
[INFO ] 2026-06-02 13:43:20.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 13:43:20.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431114,ok=431114,error=0, records=41
[WARN ] 2026-06-02 13:43:23.050 [6913 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:43:25.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:43:35.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 13:43:35.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431115,ok=431115,error=0, records=41
[WARN ] 2026-06-02 13:43:37.556 [6934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:43:40.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 13:43:40.032 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 13:43:50.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 13:43:50.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431116,ok=431116,error=0, records=41
[WARN ] 2026-06-02 13:43:52.561 [6942 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:43:55.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:44:05.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 13:44:05.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431117,ok=431117,error=0, records=41
[WARN ] 2026-06-02 13:44:07.566 [6960 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:44:10.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:44:20.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 13:44:20.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431118,ok=431118,error=0, records=41
[WARN ] 2026-06-02 13:44:22.570 [6994 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:44:25.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:44:27.598 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842296},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:44:27.784 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:44:27.784 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 13:44:27.784 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:44:27.784 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:44:27.784 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:44:27.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:44:27.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:44:27.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:44:27.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:44:27.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:44:27.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:44:35.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 13:44:35.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431119,ok=431119,error=0, records=41
[WARN ] 2026-06-02 13:44:37.574 [7005 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:44:40.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:44:50.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 13:44:50.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431120,ok=431120,error=0, records=41
[WARN ] 2026-06-02 13:44:52.578 [7018 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:44:55.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:45:02.114 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21571/300s
[INFO ] 2026-06-02 13:45:05.247 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 13:45:05.247 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431121,ok=431121,error=0, records=41
[WARN ] 2026-06-02 13:45:07.583 [7036 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:45:10.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:45:20.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-02 13:45:20.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431122,ok=431122,error=0, records=41
[WARN ] 2026-06-02 13:45:22.590 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:45:25.037 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:45:35.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10338, records=41
[INFO ] 2026-06-02 13:45:35.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431123,ok=431123,error=0, records=41
[INFO ] 2026-06-02 13:45:35.594 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21562/300s
[WARN ] 2026-06-02 13:45:37.595 [7074 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:45:40.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:45:43.441 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21571/300s
[INFO ] 2026-06-02 13:45:50.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-02 13:45:50.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431124,ok=431124,error=0, records=41
[WARN ] 2026-06-02 13:45:52.600 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:45:55.038 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:46:05.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 13:46:05.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431125,ok=431125,error=0, records=41
[WARN ] 2026-06-02 13:46:07.605 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:46:10.039 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:46:20.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 13:46:20.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431126,ok=431126,error=0, records=41
[INFO ] 2026-06-02 13:46:20.280 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21558/300s
[WARN ] 2026-06-02 13:46:22.611 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:46:25.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:46:35.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 13:46:35.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431127,ok=431127,error=0, records=41
[WARN ] 2026-06-02 13:46:37.616 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:46:40.040 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:46:50.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 13:46:50.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431128,ok=431128,error=0, records=41
[WARN ] 2026-06-02 13:46:52.620 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:46:54.117 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21567/300s
[INFO ] 2026-06-02 13:46:55.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:47:05.296 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 13:47:05.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431129,ok=431129,error=0, records=41
[WARN ] 2026-06-02 13:47:07.624 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:47:10.041 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:47:10.041 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21570/300s
[INFO ] 2026-06-02 13:47:10.714 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21558/300s
[INFO ] 2026-06-02 13:47:20.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 13:47:20.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431130,ok=431130,error=0, records=41
[WARN ] 2026-06-02 13:47:22.630 [7074 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:47:25.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:47:27.784 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17959/300s
[INFO ] 2026-06-02 13:47:27.785 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842228},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:47:27.941 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:47:27.941 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 13:47:27.942 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:47:27.942 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:47:27.942 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:47:27.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:47:27.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:47:27.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:47:27.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:47:27.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:47:27.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:47:35.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 13:47:35.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431131,ok=431131,error=0, records=41
[WARN ] 2026-06-02 13:47:37.637 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:47:40.042 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:47:50.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 13:47:50.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431132,ok=431132,error=0, records=41
[WARN ] 2026-06-02 13:47:52.642 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:47:55.043 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:48:01.544 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21568/300s
[INFO ] 2026-06-02 13:48:03.347 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21568/300s
[INFO ] 2026-06-02 13:48:05.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 13:48:05.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431133,ok=431133,error=0, records=41
[WARN ] 2026-06-02 13:48:07.647 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:48:10.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:48:10.153 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21568/300s
[INFO ] 2026-06-02 13:48:20.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 13:48:20.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431134,ok=431134,error=0, records=41
[WARN ] 2026-06-02 13:48:22.652 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:48:25.044 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:48:35.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 13:48:35.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431135,ok=431135,error=0, records=41
[WARN ] 2026-06-02 13:48:37.658 [7074 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:48:40.045 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:48:50.352 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 13:48:50.352 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431136,ok=431136,error=0, records=41
[WARN ] 2026-06-02 13:48:52.662 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:48:55.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:49:05.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 13:49:05.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431137,ok=431137,error=0, records=41
[WARN ] 2026-06-02 13:49:07.667 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:49:10.046 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:49:20.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 13:49:20.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431138,ok=431138,error=0, records=41
[WARN ] 2026-06-02 13:49:22.673 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:49:25.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:49:35.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:49:35.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431139,ok=431139,error=0, records=41
[WARN ] 2026-06-02 13:49:37.677 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:49:40.047 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:49:50.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 13:49:50.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431140,ok=431140,error=0, records=41
[WARN ] 2026-06-02 13:49:52.683 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:49:55.048 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:50:02.118 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21572/300s
[INFO ] 2026-06-02 13:50:05.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 13:50:05.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431141,ok=431141,error=0, records=41
[WARN ] 2026-06-02 13:50:07.689 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:50:10.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:50:20.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:50:20.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431142,ok=431142,error=0, records=41
[WARN ] 2026-06-02 13:50:22.694 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:50:25.049 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:50:27.943 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842164},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:50:28.098 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:50:28.098 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 13:50:28.099 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:50:28.099 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:50:28.099 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:50:28.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:50:28.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:50:28.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:50:28.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:50:28.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:50:28.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:50:35.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 13:50:35.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431143,ok=431143,error=0, records=41
[INFO ] 2026-06-02 13:50:35.699 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21563/300s
[WARN ] 2026-06-02 13:50:37.700 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:50:40.050 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:50:43.448 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21572/300s
[INFO ] 2026-06-02 13:50:50.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 13:50:50.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431144,ok=431144,error=0, records=41
[WARN ] 2026-06-02 13:50:52.704 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:50:55.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:51:05.472 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 13:51:05.472 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431145,ok=431145,error=0, records=41
[WARN ] 2026-06-02 13:51:07.709 [7074 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:51:10.051 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:51:20.478 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 13:51:20.478 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431146,ok=431146,error=0, records=41
[INFO ] 2026-06-02 13:51:20.478 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21559/300s
[WARN ] 2026-06-02 13:51:22.714 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:51:25.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:51:35.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 13:51:35.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431147,ok=431147,error=0, records=41
[WARN ] 2026-06-02 13:51:37.719 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:51:40.052 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:51:50.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 13:51:50.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431148,ok=431148,error=0, records=41
[WARN ] 2026-06-02 13:51:52.724 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:51:54.171 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21568/300s
[INFO ] 2026-06-02 13:51:55.053 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:52:05.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 13:52:05.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431149,ok=431149,error=0, records=41
[WARN ] 2026-06-02 13:52:07.728 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:52:10.054 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:52:10.054 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21571/300s
[INFO ] 2026-06-02 13:52:10.896 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21559/300s
[INFO ] 2026-06-02 13:52:20.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 13:52:20.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431150,ok=431150,error=0, records=41
[WARN ] 2026-06-02 13:52:22.734 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:52:25.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:52:35.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 13:52:35.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431151,ok=431151,error=0, records=41
[WARN ] 2026-06-02 13:52:37.740 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:52:40.055 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:52:50.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 13:52:50.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431152,ok=431152,error=0, records=41
[WARN ] 2026-06-02 13:52:52.746 [7074 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:52:55.056 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:53:01.612 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21569/300s
[INFO ] 2026-06-02 13:53:03.414 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21569/300s
[INFO ] 2026-06-02 13:53:05.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 13:53:05.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431153,ok=431153,error=0, records=41
[WARN ] 2026-06-02 13:53:07.750 [7089 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:53:10.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:53:10.220 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21569/300s
[INFO ] 2026-06-02 13:53:20.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 13:53:20.581 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431154,ok=431154,error=0, records=41
[WARN ] 2026-06-02 13:53:22.757 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:53:25.057 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:53:28.099 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17960/300s
[INFO ] 2026-06-02 13:53:28.100 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20842096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:53:28.264 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:53:28.264 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 13:53:28.265 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:53:28.265 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:53:28.265 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:53:28.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:53:28.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:53:28.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:53:28.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:53:28.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:53:28.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:53:35.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 13:53:35.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431155,ok=431155,error=0, records=41
[WARN ] 2026-06-02 13:53:37.761 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:53:40.058 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 13:53:40.058 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 13:53:50.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 13:53:50.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431156,ok=431156,error=0, records=41
[WARN ] 2026-06-02 13:53:52.767 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:53:55.059 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:53:55.059 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 13:54:05.604 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 13:54:05.604 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431157,ok=431157,error=0, records=41
[WARN ] 2026-06-02 13:54:07.773 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:54:10.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:54:20.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 13:54:20.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431158,ok=431158,error=0, records=41
[WARN ] 2026-06-02 13:54:22.778 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:54:25.060 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:54:35.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 13:54:35.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431159,ok=431159,error=0, records=41
[WARN ] 2026-06-02 13:54:37.783 [7074 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:54:40.061 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:54:50.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 13:54:50.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431160,ok=431160,error=0, records=41
[WARN ] 2026-06-02 13:54:52.788 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:54:55.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:55:02.122 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21573/300s
[INFO ] 2026-06-02 13:55:05.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 13:55:05.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431161,ok=431161,error=0, records=41
[WARN ] 2026-06-02 13:55:07.793 [7012 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:55:10.062 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:55:20.629 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 13:55:20.629 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431162,ok=431162,error=0, records=41
[WARN ] 2026-06-02 13:55:22.814 [7031 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:55:25.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 13:55:32.318 [7031 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32052/stat), No such file or directory
[WARN ] 2026-06-02 13:55:32.319 [7031 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32069/stat), No such file or directory
[INFO ] 2026-06-02 13:55:35.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 13:55:35.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431163,ok=431163,error=0, records=41
[INFO ] 2026-06-02 13:55:35.818 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21564/300s
[WARN ] 2026-06-02 13:55:37.819 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:55:40.063 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:55:43.454 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21573/300s
[WARN ] 2026-06-02 13:55:47.323 [7031 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32052/stat), No such file or directory
[WARN ] 2026-06-02 13:55:47.324 [7031 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/32069/stat), No such file or directory
[INFO ] 2026-06-02 13:55:50.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-02 13:55:50.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431164,ok=431164,error=0, records=41
[WARN ] 2026-06-02 13:55:52.824 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:55:55.064 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:56:05.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 13:56:05.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431165,ok=431165,error=0, records=41
[WARN ] 2026-06-02 13:56:07.829 [7700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:56:10.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:56:20.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 13:56:20.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431166,ok=431166,error=0, records=41
[INFO ] 2026-06-02 13:56:20.653 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21560/300s
[WARN ] 2026-06-02 13:56:22.834 [7700 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:56:25.065 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:56:28.266 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841968},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:56:28.443 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:56:28.443 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 13:56:28.443 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:56:28.443 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:56:28.443 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:56:28.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:56:28.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:56:28.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:56:28.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:56:28.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:56:28.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 13:56:32.340 [7700 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7630/stat), No such file or directory
[INFO ] 2026-06-02 13:56:35.726 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 13:56:35.726 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431167,ok=431167,error=0, records=41
[WARN ] 2026-06-02 13:56:37.839 [7670 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:56:40.066 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 13:56:47.346 [7757 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7630/stat), No such file or directory
[INFO ] 2026-06-02 13:56:50.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 13:56:50.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431168,ok=431168,error=0, records=41
[WARN ] 2026-06-02 13:56:52.845 [7670 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:56:54.228 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21569/300s
[INFO ] 2026-06-02 13:56:55.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:57:05.737 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 13:57:05.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431169,ok=431169,error=0, records=41
[WARN ] 2026-06-02 13:57:07.850 [7716 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:57:10.067 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:57:10.067 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21572/300s
[INFO ] 2026-06-02 13:57:11.087 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21560/300s
[INFO ] 2026-06-02 13:57:20.742 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 13:57:20.742 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431170,ok=431170,error=0, records=41
[WARN ] 2026-06-02 13:57:22.855 [7065 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:57:25.068 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:57:35.748 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 13:57:35.748 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431171,ok=431171,error=0, records=41
[WARN ] 2026-06-02 13:57:37.860 [7716 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:57:40.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:57:50.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-02 13:57:50.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431172,ok=431172,error=0, records=41
[WARN ] 2026-06-02 13:57:52.865 [7716 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:57:55.069 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:58:01.706 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21570/300s
[INFO ] 2026-06-02 13:58:03.513 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21570/300s
[INFO ] 2026-06-02 13:58:05.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 13:58:05.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431173,ok=431173,error=0, records=41
[WARN ] 2026-06-02 13:58:07.870 [7798 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:58:10.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:58:10.318 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21570/300s
[INFO ] 2026-06-02 13:58:20.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 13:58:20.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431174,ok=431174,error=0, records=41
[WARN ] 2026-06-02 13:58:22.876 [7716 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:58:25.070 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:58:35.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 13:58:35.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431175,ok=431175,error=0, records=41
[WARN ] 2026-06-02 13:58:37.880 [7840 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:58:40.071 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:58:50.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 13:58:50.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431176,ok=431176,error=0, records=41
[WARN ] 2026-06-02 13:58:52.886 [7868 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:58:55.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:59:05.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 13:59:05.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431177,ok=431177,error=0, records=41
[WARN ] 2026-06-02 13:59:07.892 [7841 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:59:10.072 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:59:20.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 13:59:20.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431178,ok=431178,error=0, records=41
[WARN ] 2026-06-02 13:59:22.899 [7905 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:59:25.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:59:28.443 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17961/300s
[INFO ] 2026-06-02 13:59:28.445 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841908},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 13:59:28.602 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 13:59:28.602 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 13:59:28.602 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 13:59:28.602 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 13:59:28.602 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 13:59:28.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:59:28.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 13:59:28.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:59:28.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 13:59:28.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:59:28.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 13:59:35.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 13:59:35.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431179,ok=431179,error=0, records=41
[WARN ] 2026-06-02 13:59:37.904 [7928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:59:40.073 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 13:59:50.807 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 13:59:50.807 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431180,ok=431180,error=0, records=41
[WARN ] 2026-06-02 13:59:52.909 [7939 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 13:59:55.074 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:00:02.125 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21574/300s
[INFO ] 2026-06-02 14:00:05.812 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10373, records=41
[INFO ] 2026-06-02 14:00:05.812 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431181,ok=431181,error=0, records=41
[WARN ] 2026-06-02 14:00:07.915 [7945 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:00:10.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:00:20.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 14:00:20.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431182,ok=431182,error=0, records=41
[WARN ] 2026-06-02 14:00:22.921 [7945 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:00:25.075 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:00:35.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10394, records=41
[INFO ] 2026-06-02 14:00:35.831 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431183,ok=431183,error=0, records=41
[INFO ] 2026-06-02 14:00:35.927 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21565/300s
[WARN ] 2026-06-02 14:00:37.928 [7960 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:00:40.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.94MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:00:43.461 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21574/300s
[INFO ] 2026-06-02 14:00:50.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10405, records=41
[INFO ] 2026-06-02 14:00:50.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431184,ok=431184,error=0, records=41
[WARN ] 2026-06-02 14:00:52.934 [8027 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:00:55.076 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:01:05.852 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 14:01:05.852 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431185,ok=431185,error=0, records=41
[WARN ] 2026-06-02 14:01:07.940 [7960 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:01:10.077 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:01:20.857 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-02 14:01:20.857 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431186,ok=431186,error=0, records=41
[INFO ] 2026-06-02 14:01:20.857 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21561/300s
[WARN ] 2026-06-02 14:01:22.946 [8073 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:01:25.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:01:35.863 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 14:01:35.863 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431187,ok=431187,error=0, records=41
[WARN ] 2026-06-02 14:01:37.952 [8088 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:01:40.078 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:01:50.869 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 14:01:50.869 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431188,ok=431188,error=0, records=41
[WARN ] 2026-06-02 14:01:52.958 [8088 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:01:54.285 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21570/300s
[INFO ] 2026-06-02 14:01:55.079 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:02:05.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 14:02:05.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431189,ok=431189,error=0, records=41
[WARN ] 2026-06-02 14:02:07.963 [8116 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:02:10.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:02:10.080 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21573/300s
[INFO ] 2026-06-02 14:02:11.269 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21561/300s
[INFO ] 2026-06-02 14:02:20.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 14:02:20.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431190,ok=431190,error=0, records=41
[WARN ] 2026-06-02 14:02:22.968 [8088 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:02:25.080 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:02:28.604 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841836},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:02:28.760 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:02:28.760 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 14:02:28.760 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:02:28.760 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:02:28.760 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:02:28.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:02:28.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:02:28.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:02:28.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:02:28.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:02:28.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:02:35.885 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 14:02:35.885 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431191,ok=431191,error=0, records=41
[WARN ] 2026-06-02 14:02:37.972 [8088 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:02:40.081 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:02:50.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 14:02:50.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431192,ok=431192,error=0, records=41
[WARN ] 2026-06-02 14:02:52.977 [8062 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:02:55.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:03:01.776 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21571/300s
[INFO ] 2026-06-02 14:03:03.578 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21571/300s
[INFO ] 2026-06-02 14:03:05.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 14:03:05.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431193,ok=431193,error=0, records=41
[WARN ] 2026-06-02 14:03:07.981 [8062 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:03:10.082 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:03:10.384 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21571/300s
[INFO ] 2026-06-02 14:03:20.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 14:03:20.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431194,ok=431194,error=0, records=41
[WARN ] 2026-06-02 14:03:22.987 [8062 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:03:25.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:03:35.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 14:03:35.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431195,ok=431195,error=0, records=41
[WARN ] 2026-06-02 14:03:37.992 [8187 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:03:40.083 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 14:03:40.084 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 14:03:50.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 14:03:50.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431196,ok=431196,error=0, records=41
[WARN ] 2026-06-02 14:03:52.997 [8145 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:03:55.084 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:04:05.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 14:04:05.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431197,ok=431197,error=0, records=41
[WARN ] 2026-06-02 14:04:08.002 [8201 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:04:10.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:04:20.995 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-02 14:04:20.995 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431198,ok=431198,error=0, records=41
[WARN ] 2026-06-02 14:04:23.007 [8130 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:04:25.085 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:04:36.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 14:04:36.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431199,ok=431199,error=0, records=41
[WARN ] 2026-06-02 14:04:38.013 [8243 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:04:40.086 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:04:51.016 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 14:04:51.016 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431200,ok=431200,error=0, records=41
[WARN ] 2026-06-02 14:04:53.018 [8215 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:04:55.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:05:02.129 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21575/300s
[INFO ] 2026-06-02 14:05:06.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 14:05:06.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431201,ok=431201,error=0, records=41
[WARN ] 2026-06-02 14:05:08.023 [8271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:05:10.087 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:05:21.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 14:05:21.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431202,ok=431202,error=0, records=41
[WARN ] 2026-06-02 14:05:23.028 [8271 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:05:25.088 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:05:28.760 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17962/300s
[INFO ] 2026-06-02 14:05:28.762 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841772},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:05:28.908 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:05:28.908 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 14:05:28.908 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:05:28.908 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:05:28.908 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:05:28.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:05:28.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:05:28.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:05:28.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:05:28.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:05:28.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:05:36.033 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21566/300s
[INFO ] 2026-06-02 14:05:36.035 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 14:05:36.035 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431203,ok=431203,error=0, records=41
[WARN ] 2026-06-02 14:05:38.034 [8314 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:05:40.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:05:43.467 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21575/300s
[INFO ] 2026-06-02 14:05:51.040 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 14:05:51.040 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431204,ok=431204,error=0, records=41
[WARN ] 2026-06-02 14:05:53.038 [8201 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:05:55.089 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:06:06.046 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 14:06:06.046 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431205,ok=431205,error=0, records=41
[WARN ] 2026-06-02 14:06:08.044 [8353 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:06:10.090 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:06:21.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 14:06:21.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431206,ok=431206,error=0, records=41
[INFO ] 2026-06-02 14:06:21.056 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21562/300s
[WARN ] 2026-06-02 14:06:23.049 [8364 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:06:25.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:06:36.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 14:06:36.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431207,ok=431207,error=0, records=41
[WARN ] 2026-06-02 14:06:37.554 [8201 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:06:40.091 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:06:51.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 14:06:51.066 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431208,ok=431208,error=0, records=41
[WARN ] 2026-06-02 14:06:52.559 [8397 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:06:54.345 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21571/300s
[INFO ] 2026-06-02 14:06:55.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:07:06.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 14:07:06.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431209,ok=431209,error=0, records=41
[WARN ] 2026-06-02 14:07:07.565 [8414 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:07:10.092 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:07:10.092 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21574/300s
[INFO ] 2026-06-02 14:07:11.453 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21562/300s
[INFO ] 2026-06-02 14:07:21.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 14:07:21.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431210,ok=431210,error=0, records=41
[WARN ] 2026-06-02 14:07:22.570 [8414 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:07:25.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:07:36.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 14:07:36.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431211,ok=431211,error=0, records=41
[WARN ] 2026-06-02 14:07:37.575 [8450 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:07:40.093 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:07:51.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 14:07:51.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431212,ok=431212,error=0, records=41
[WARN ] 2026-06-02 14:07:52.581 [8449 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:07:55.094 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:08:01.841 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21572/300s
[INFO ] 2026-06-02 14:08:03.643 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21572/300s
[INFO ] 2026-06-02 14:08:06.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 14:08:06.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431213,ok=431213,error=0, records=41
[WARN ] 2026-06-02 14:08:07.587 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:08:10.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:08:10.450 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21572/300s
[INFO ] 2026-06-02 14:08:21.115 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 14:08:21.115 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431214,ok=431214,error=0, records=41
[WARN ] 2026-06-02 14:08:22.594 [8462 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:08:25.095 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:08:28.910 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841704},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:08:29.083 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:08:29.084 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 14:08:29.084 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:08:29.084 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:08:29.084 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:08:29.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:08:29.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:08:29.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:08:29.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:08:29.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:08:29.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:08:36.119 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11358, records=45
[INFO ] 2026-06-02 14:08:36.119 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431215,ok=431215,error=0, records=45
[WARN ] 2026-06-02 14:08:37.599 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:08:40.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:08:51.126 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 14:08:51.126 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431216,ok=431216,error=0, records=41
[WARN ] 2026-06-02 14:08:52.605 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:08:55.096 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:08:55.096 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 14:09:06.131 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 14:09:06.131 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431217,ok=431217,error=0, records=41
[WARN ] 2026-06-02 14:09:07.611 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:09:10.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:09:21.137 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 14:09:21.137 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431218,ok=431218,error=0, records=41
[WARN ] 2026-06-02 14:09:22.616 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:09:25.098 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:09:36.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 14:09:36.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431219,ok=431219,error=0, records=41
[WARN ] 2026-06-02 14:09:37.621 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:09:40.099 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:09:51.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 14:09:51.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431220,ok=431220,error=0, records=41
[WARN ] 2026-06-02 14:09:52.625 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:09:55.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:10:02.132 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21576/300s
[INFO ] 2026-06-02 14:10:06.154 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 14:10:06.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431221,ok=431221,error=0, records=41
[WARN ] 2026-06-02 14:10:07.630 [8517 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:10:10.100 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:10:21.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 14:10:21.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431222,ok=431222,error=0, records=41
[WARN ] 2026-06-02 14:10:22.635 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:10:25.101 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:10:36.139 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21567/300s
[INFO ] 2026-06-02 14:10:36.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 14:10:36.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431223,ok=431223,error=0, records=41
[WARN ] 2026-06-02 14:10:37.640 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:10:40.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:10:43.473 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21576/300s
[INFO ] 2026-06-02 14:10:51.170 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 14:10:51.170 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431224,ok=431224,error=0, records=41
[WARN ] 2026-06-02 14:10:52.646 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:10:55.102 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:11:06.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 14:11:06.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431225,ok=431225,error=0, records=41
[WARN ] 2026-06-02 14:11:07.652 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:11:10.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:11:21.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 14:11:21.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431226,ok=431226,error=0, records=41
[INFO ] 2026-06-02 14:11:21.182 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21563/300s
[WARN ] 2026-06-02 14:11:22.658 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:11:25.103 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:11:29.084 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17963/300s
[INFO ] 2026-06-02 14:11:29.085 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841628},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:11:29.244 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:11:29.244 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 14:11:29.244 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:11:29.244 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:11:29.244 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:11:29.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:11:29.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:11:29.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:11:29.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:11:29.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:11:29.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:11:36.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 14:11:36.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431227,ok=431227,error=0, records=41
[WARN ] 2026-06-02 14:11:37.664 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:11:40.104 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:11:51.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 14:11:51.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431228,ok=431228,error=0, records=41
[WARN ] 2026-06-02 14:11:52.670 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:11:54.404 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21572/300s
[INFO ] 2026-06-02 14:11:55.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:12:06.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 14:12:06.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431229,ok=431229,error=0, records=41
[WARN ] 2026-06-02 14:12:07.678 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:12:10.105 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:12:10.105 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21575/300s
[INFO ] 2026-06-02 14:12:11.646 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21563/300s
[INFO ] 2026-06-02 14:12:21.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 14:12:21.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431230,ok=431230,error=0, records=41
[WARN ] 2026-06-02 14:12:22.684 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:12:25.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:12:36.209 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 14:12:36.209 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431231,ok=431231,error=0, records=41
[WARN ] 2026-06-02 14:12:37.689 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:12:40.106 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:12:51.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 14:12:51.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431232,ok=431232,error=0, records=41
[WARN ] 2026-06-02 14:12:52.694 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:12:55.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:13:01.929 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21573/300s
[INFO ] 2026-06-02 14:13:03.720 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21573/300s
[INFO ] 2026-06-02 14:13:06.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 14:13:06.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431233,ok=431233,error=0, records=41
[WARN ] 2026-06-02 14:13:07.700 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:13:10.107 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:13:10.534 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21573/300s
[INFO ] 2026-06-02 14:13:21.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 14:13:21.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431234,ok=431234,error=0, records=41
[WARN ] 2026-06-02 14:13:22.706 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:13:25.108 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:13:36.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10155, records=41
[INFO ] 2026-06-02 14:13:36.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431235,ok=431235,error=0, records=41
[WARN ] 2026-06-02 14:13:37.711 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:13:40.109 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 14:13:40.109 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 14:13:51.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 14:13:51.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431236,ok=431236,error=0, records=41
[WARN ] 2026-06-02 14:13:52.717 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:13:55.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:14:06.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 14:14:06.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431237,ok=431237,error=0, records=41
[WARN ] 2026-06-02 14:14:07.722 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:14:10.110 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:14:21.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 14:14:21.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431238,ok=431238,error=0, records=41
[WARN ] 2026-06-02 14:14:22.727 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:14:25.111 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:14:29.246 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841548},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:14:29.423 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:14:29.424 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:14:29.424 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:14:29.424 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:14:29.424 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:14:29.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:14:29.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:14:29.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:14:29.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:14:29.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:14:29.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:14:36.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 14:14:36.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431239,ok=431239,error=0, records=41
[WARN ] 2026-06-02 14:14:37.732 [8517 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:14:40.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:14:51.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 14:14:51.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431240,ok=431240,error=0, records=41
[WARN ] 2026-06-02 14:14:52.738 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:14:55.112 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:15:02.136 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21577/300s
[INFO ] 2026-06-02 14:15:06.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-02 14:15:06.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431241,ok=431241,error=0, records=41
[WARN ] 2026-06-02 14:15:07.743 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:15:10.113 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:15:21.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 14:15:21.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431242,ok=431242,error=0, records=41
[WARN ] 2026-06-02 14:15:22.748 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:15:25.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:15:36.252 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21568/300s
[INFO ] 2026-06-02 14:15:36.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 14:15:36.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431243,ok=431243,error=0, records=41
[WARN ] 2026-06-02 14:15:37.753 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:15:40.114 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:15:43.479 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21577/300s
[INFO ] 2026-06-02 14:15:51.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 14:15:51.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431244,ok=431244,error=0, records=41
[WARN ] 2026-06-02 14:15:52.758 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:15:55.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:16:06.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 14:16:06.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431245,ok=431245,error=0, records=41
[WARN ] 2026-06-02 14:16:07.763 [8517 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:16:10.115 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:16:21.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 14:16:21.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431246,ok=431246,error=0, records=41
[INFO ] 2026-06-02 14:16:21.302 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21564/300s
[WARN ] 2026-06-02 14:16:22.769 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:16:25.116 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:16:36.323 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 14:16:36.323 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431247,ok=431247,error=0, records=41
[WARN ] 2026-06-02 14:16:37.775 [8507 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:16:40.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:16:51.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 14:16:51.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431248,ok=431248,error=0, records=41
[WARN ] 2026-06-02 14:16:52.779 [8496 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:16:54.466 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21573/300s
[INFO ] 2026-06-02 14:16:55.117 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:17:06.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 14:17:06.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431249,ok=431249,error=0, records=41
[WARN ] 2026-06-02 14:17:07.784 [8517 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:17:10.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:17:10.118 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21576/300s
[INFO ] 2026-06-02 14:17:11.828 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21564/300s
[INFO ] 2026-06-02 14:17:21.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 14:17:21.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431250,ok=431250,error=0, records=41
[WARN ] 2026-06-02 14:17:22.790 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:17:25.118 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:17:29.424 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17964/300s
[INFO ] 2026-06-02 14:17:29.426 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841480},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:17:29.603 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:17:29.603 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 14:17:29.603 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:17:29.603 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:17:29.603 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:17:29.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:17:29.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:17:29.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:17:29.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:17:29.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:17:29.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:17:36.350 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 14:17:36.350 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431251,ok=431251,error=0, records=41
[WARN ] 2026-06-02 14:17:37.796 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:17:40.119 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:17:51.356 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 14:17:51.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431252,ok=431252,error=0, records=41
[WARN ] 2026-06-02 14:17:52.801 [8485 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:17:55.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:18:01.992 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21574/300s
[INFO ] 2026-06-02 14:18:03.794 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21574/300s
[INFO ] 2026-06-02 14:18:06.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 14:18:06.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431253,ok=431253,error=0, records=41
[WARN ] 2026-06-02 14:18:07.807 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:18:10.120 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:18:10.606 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21574/300s
[INFO ] 2026-06-02 14:18:21.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 14:18:21.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431254,ok=431254,error=0, records=41
[WARN ] 2026-06-02 14:18:22.814 [9030 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:18:25.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:18:36.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 14:18:36.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431255,ok=431255,error=0, records=41
[WARN ] 2026-06-02 14:18:37.820 [8468 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:18:40.121 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:18:47.324 [8517 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7594/stat), No such file or directory
[WARN ] 2026-06-02 14:18:47.324 [8517 ] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/7587/stat), No such file or directory
[INFO ] 2026-06-02 14:18:51.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 14:18:51.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431256,ok=431256,error=0, records=41
[WARN ] 2026-06-02 14:18:52.825 [8517 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:18:55.122 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:19:06.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 14:19:06.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431257,ok=431257,error=0, records=41
[WARN ] 2026-06-02 14:19:07.832 [9185 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:19:10.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:19:21.405 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 14:19:21.405 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431258,ok=431258,error=0, records=41
[WARN ] 2026-06-02 14:19:22.837 [9185 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:19:25.123 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:19:36.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 14:19:36.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431259,ok=431259,error=0, records=41
[WARN ] 2026-06-02 14:19:37.842 [9185 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:19:40.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:19:51.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 14:19:51.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431260,ok=431260,error=0, records=41
[WARN ] 2026-06-02 14:19:52.848 [9171 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:19:55.124 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:20:02.139 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21578/300s
[INFO ] 2026-06-02 14:20:06.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 14:20:06.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431261,ok=431261,error=0, records=41
[WARN ] 2026-06-02 14:20:07.854 [9201 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:20:10.125 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:20:21.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 14:20:21.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431262,ok=431262,error=0, records=41
[WARN ] 2026-06-02 14:20:22.860 [9144 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:20:25.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:20:29.605 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841400},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:20:29.777 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:20:29.777 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 14:20:29.777 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:20:29.777 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:20:29.777 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:20:29.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:20:29.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:20:29.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:20:29.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:20:29.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:20:29.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:20:36.363 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21569/300s
[INFO ] 2026-06-02 14:20:36.441 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 14:20:36.441 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431263,ok=431263,error=0, records=41
[WARN ] 2026-06-02 14:20:37.864 [9171 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:20:40.126 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:20:43.486 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21578/300s
[INFO ] 2026-06-02 14:20:51.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 14:20:51.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431264,ok=431264,error=0, records=41
[WARN ] 2026-06-02 14:20:52.868 [9201 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:20:55.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:21:06.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 14:21:06.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431265,ok=431265,error=0, records=41
[WARN ] 2026-06-02 14:21:07.873 [9286 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:21:10.127 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:21:21.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 14:21:21.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431266,ok=431266,error=0, records=41
[INFO ] 2026-06-02 14:21:21.457 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21565/300s
[WARN ] 2026-06-02 14:21:22.879 [9310 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:21:25.128 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:21:36.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 14:21:36.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431267,ok=431267,error=0, records=41
[WARN ] 2026-06-02 14:21:37.885 [9315 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:21:40.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:21:51.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 14:21:51.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431268,ok=431268,error=0, records=41
[WARN ] 2026-06-02 14:21:52.890 [9349 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:21:54.524 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21574/300s
[INFO ] 2026-06-02 14:21:55.129 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:22:06.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-02 14:22:06.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431269,ok=431269,error=0, records=41
[WARN ] 2026-06-02 14:22:07.896 [9372 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:22:10.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:22:10.130 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21577/300s
[INFO ] 2026-06-02 14:22:12.011 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21565/300s
[INFO ] 2026-06-02 14:22:21.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 14:22:21.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431270,ok=431270,error=0, records=41
[WARN ] 2026-06-02 14:22:22.903 [9372 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:22:25.130 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:22:36.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 14:22:36.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431271,ok=431271,error=0, records=41
[WARN ] 2026-06-02 14:22:37.910 [9383 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:22:40.131 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:22:51.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 14:22:51.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431272,ok=431272,error=0, records=41
[WARN ] 2026-06-02 14:22:52.916 [9412 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:22:55.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:23:02.062 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21575/300s
[INFO ] 2026-06-02 14:23:03.863 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21575/300s
[INFO ] 2026-06-02 14:23:06.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 14:23:06.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431273,ok=431273,error=0, records=41
[WARN ] 2026-06-02 14:23:07.921 [9435 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:23:10.132 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:23:10.669 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21575/300s
[INFO ] 2026-06-02 14:23:21.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 14:23:21.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431274,ok=431274,error=0, records=41
[WARN ] 2026-06-02 14:23:22.927 [9452 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:23:25.133 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:23:29.777 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17965/300s
[INFO ] 2026-06-02 14:23:29.779 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:23:29.973 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:23:29.973 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 14:23:36.545 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 14:23:36.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431275,ok=431275,error=0, records=41
[WARN ] 2026-06-02 14:23:37.932 [9469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:23:40.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 14:23:40.134 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 14:23:51.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 14:23:51.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431276,ok=431276,error=0, records=41
[WARN ] 2026-06-02 14:23:52.937 [9480 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:23:55.134 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:23:55.134 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 14:24:06.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 14:24:06.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431277,ok=431277,error=0, records=41
[WARN ] 2026-06-02 14:24:07.943 [9469 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:24:10.136 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.39MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:24:21.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 14:24:21.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431278,ok=431278,error=0, records=41
[WARN ] 2026-06-02 14:24:22.948 [9512 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:24:25.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:24:36.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 14:24:36.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431279,ok=431279,error=0, records=41
[WARN ] 2026-06-02 14:24:37.953 [9506 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:24:40.137 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:24:51.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 14:24:51.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431280,ok=431280,error=0, records=41
[WARN ] 2026-06-02 14:24:52.958 [9512 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:24:55.138 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:25:02.143 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21579/300s
[INFO ] 2026-06-02 14:25:06.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 14:25:06.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431281,ok=431281,error=0, records=41
[WARN ] 2026-06-02 14:25:07.964 [9479 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:25:10.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:25:21.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 14:25:21.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431282,ok=431282,error=0, records=41
[WARN ] 2026-06-02 14:25:22.968 [9568 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:25:25.139 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.49MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:25:36.472 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21570/300s
[INFO ] 2026-06-02 14:25:36.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 14:25:36.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431283,ok=431283,error=0, records=41
[WARN ] 2026-06-02 14:25:37.973 [9501 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:25:40.140 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:25:43.492 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21579/300s
[INFO ] 2026-06-02 14:25:51.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 14:25:51.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431284,ok=431284,error=0, records=41
[WARN ] 2026-06-02 14:25:52.979 [9506 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:25:55.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:26:06.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 14:26:06.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431285,ok=431285,error=0, records=41
[WARN ] 2026-06-02 14:26:07.984 [9609 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:26:10.141 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:26:21.693 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 14:26:21.693 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431286,ok=431286,error=0, records=41
[INFO ] 2026-06-02 14:26:21.693 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21566/300s
[WARN ] 2026-06-02 14:26:22.989 [9506 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:26:25.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:26:29.974 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841260},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:26:30.124 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:26:30.124 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 14:26:30.125 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:26:30.125 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:26:30.125 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:26:30.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:26:30.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:26:30.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:26:30.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:26:30.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:26:30.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:26:36.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 14:26:36.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431287,ok=431287,error=0, records=41
[WARN ] 2026-06-02 14:26:37.995 [9506 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:26:40.142 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:26:51.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 14:26:51.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431288,ok=431288,error=0, records=41
[WARN ] 2026-06-02 14:26:52.999 [9638 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:26:54.581 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21575/300s
[INFO ] 2026-06-02 14:26:55.143 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:27:06.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 14:27:06.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431289,ok=431289,error=0, records=41
[WARN ] 2026-06-02 14:27:08.004 [9479 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:27:10.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:27:10.144 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21578/300s
[INFO ] 2026-06-02 14:27:12.197 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21566/300s
[INFO ] 2026-06-02 14:27:21.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 14:27:21.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431290,ok=431290,error=0, records=41
[WARN ] 2026-06-02 14:27:23.009 [9652 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:27:25.144 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:27:36.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 14:27:36.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431291,ok=431291,error=0, records=41
[WARN ] 2026-06-02 14:27:38.014 [9694 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:27:40.145 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:27:51.728 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 14:27:51.728 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431292,ok=431292,error=0, records=41
[WARN ] 2026-06-02 14:27:53.019 [9666 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:27:55.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:28:02.142 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21576/300s
[INFO ] 2026-06-02 14:28:03.943 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21576/300s
[INFO ] 2026-06-02 14:28:06.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 14:28:06.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431293,ok=431293,error=0, records=41
[WARN ] 2026-06-02 14:28:08.024 [9694 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:28:10.146 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:28:10.750 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21576/300s
[INFO ] 2026-06-02 14:28:21.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 14:28:21.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431294,ok=431294,error=0, records=41
[WARN ] 2026-06-02 14:28:23.029 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:28:25.147 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:28:36.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 14:28:36.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431295,ok=431295,error=0, records=41
[WARN ] 2026-06-02 14:28:38.034 [9708 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:28:40.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:28:51.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 14:28:51.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431296,ok=431296,error=0, records=41
[WARN ] 2026-06-02 14:28:53.039 [9750 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:28:55.148 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:29:06.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 14:29:06.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431297,ok=431297,error=0, records=41
[WARN ] 2026-06-02 14:29:08.045 [9781 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:29:10.149 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:29:21.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 14:29:21.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431298,ok=431298,error=0, records=41
[WARN ] 2026-06-02 14:29:23.050 [9722 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:29:25.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:29:30.125 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17966/300s
[INFO ] 2026-06-02 14:29:30.126 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841188},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:29:30.292 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:29:30.292 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 14:29:30.292 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:29:30.292 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:29:30.292 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:29:30.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:29:30.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:29:30.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:29:30.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:29:30.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:29:30.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:29:36.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 14:29:36.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431299,ok=431299,error=0, records=41
[WARN ] 2026-06-02 14:29:37.555 [9776 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:29:40.150 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:29:51.801 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-02 14:29:51.801 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431300,ok=431300,error=0, records=41
[WARN ] 2026-06-02 14:29:52.560 [9833 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:29:55.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:30:02.147 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21580/300s
[INFO ] 2026-06-02 14:30:06.806 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 14:30:06.806 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431301,ok=431301,error=0, records=41
[WARN ] 2026-06-02 14:30:07.565 [9818 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:30:10.151 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:30:21.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 14:30:21.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431302,ok=431302,error=0, records=41
[WARN ] 2026-06-02 14:30:22.569 [9871 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:30:25.152 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:30:36.575 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21571/300s
[INFO ] 2026-06-02 14:30:36.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 14:30:36.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431303,ok=431303,error=0, records=41
[WARN ] 2026-06-02 14:30:37.576 [9887 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:30:40.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:30:43.499 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21580/300s
[INFO ] 2026-06-02 14:30:51.907 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 14:30:51.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431304,ok=431304,error=0, records=41
[WARN ] 2026-06-02 14:30:52.581 [9904 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:30:55.153 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:31:06.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 14:31:06.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431305,ok=431305,error=0, records=41
[WARN ] 2026-06-02 14:31:07.586 [9912 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:31:10.154 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:31:21.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 14:31:21.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431306,ok=431306,error=0, records=41
[INFO ] 2026-06-02 14:31:21.919 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21567/300s
[WARN ] 2026-06-02 14:31:22.591 [9929 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:31:25.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:31:36.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 14:31:36.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431307,ok=431307,error=0, records=41
[WARN ] 2026-06-02 14:31:37.596 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:31:40.155 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:31:51.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 14:31:51.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431308,ok=431308,error=0, records=41
[WARN ] 2026-06-02 14:31:52.602 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:31:54.641 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21576/300s
[INFO ] 2026-06-02 14:31:55.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:32:06.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 14:32:06.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431309,ok=431309,error=0, records=41
[WARN ] 2026-06-02 14:32:07.608 [9940 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:32:10.156 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:32:10.156 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21579/300s
[INFO ] 2026-06-02 14:32:12.382 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21567/300s
[INFO ] 2026-06-02 14:32:21.943 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 14:32:21.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431310,ok=431310,error=0, records=41
[WARN ] 2026-06-02 14:32:22.613 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:32:25.157 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:32:30.294 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:32:30.437 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:32:30.437 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-02 14:32:30.437 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:32:30.437 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:32:30.437 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:32:30.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:32:30.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:32:30.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:32:30.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:32:30.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:32:30.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:32:36.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 14:32:36.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431311,ok=431311,error=0, records=41
[WARN ] 2026-06-02 14:32:37.618 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:32:40.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:32:51.953 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 14:32:51.953 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431312,ok=431312,error=0, records=41
[WARN ] 2026-06-02 14:32:52.623 [9940 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:32:55.158 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:33:02.211 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21577/300s
[INFO ] 2026-06-02 14:33:04.012 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21577/300s
[INFO ] 2026-06-02 14:33:07.061 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 14:33:07.061 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431313,ok=431313,error=0, records=41
[WARN ] 2026-06-02 14:33:07.629 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:33:10.159 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:33:10.819 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21577/300s
[INFO ] 2026-06-02 14:33:22.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 14:33:22.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431314,ok=431314,error=0, records=41
[WARN ] 2026-06-02 14:33:22.634 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:33:25.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:33:37.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 14:33:37.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431315,ok=431315,error=0, records=41
[WARN ] 2026-06-02 14:33:37.638 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:33:40.160 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 14:33:40.160 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 14:33:52.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 14:33:52.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431316,ok=431316,error=0, records=41
[WARN ] 2026-06-02 14:33:52.645 [9928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:33:55.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:34:07.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 14:34:07.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431317,ok=431317,error=0, records=41
[WARN ] 2026-06-02 14:34:07.650 [9928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:34:10.161 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:34:22.100 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 14:34:22.100 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431318,ok=431318,error=0, records=41
[WARN ] 2026-06-02 14:34:22.654 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:34:25.162 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:34:37.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 14:34:37.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431319,ok=431319,error=0, records=41
[WARN ] 2026-06-02 14:34:37.659 [9928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:34:40.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:34:52.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 14:34:52.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431320,ok=431320,error=0, records=41
[WARN ] 2026-06-02 14:34:52.666 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:34:55.163 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:35:02.150 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21581/300s
[INFO ] 2026-06-02 14:35:07.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 14:35:07.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431321,ok=431321,error=0, records=41
[WARN ] 2026-06-02 14:35:07.673 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:35:10.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:35:22.123 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 14:35:22.123 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431322,ok=431322,error=0, records=41
[WARN ] 2026-06-02 14:35:22.678 [9940 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:35:25.164 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:35:30.437 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17967/300s
[INFO ] 2026-06-02 14:35:30.439 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20841044},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:35:30.610 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:35:30.610 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:35:30.610 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:35:30.610 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:35:30.610 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:35:30.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:35:30.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:35:30.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:35:30.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:35:30.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:35:30.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:35:36.682 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21572/300s
[INFO ] 2026-06-02 14:35:37.128 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 14:35:37.128 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431323,ok=431323,error=0, records=41
[WARN ] 2026-06-02 14:35:37.682 [9928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:35:40.165 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:35:43.505 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21581/300s
[INFO ] 2026-06-02 14:35:52.133 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 14:35:52.133 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431324,ok=431324,error=0, records=41
[WARN ] 2026-06-02 14:35:52.688 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:35:55.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:36:07.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 14:36:07.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431325,ok=431325,error=0, records=41
[WARN ] 2026-06-02 14:36:07.693 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:36:10.166 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:36:22.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-02 14:36:22.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431326,ok=431326,error=0, records=41
[INFO ] 2026-06-02 14:36:22.145 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21568/300s
[WARN ] 2026-06-02 14:36:22.698 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:36:25.167 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:36:37.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 14:36:37.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431327,ok=431327,error=0, records=41
[WARN ] 2026-06-02 14:36:37.703 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:36:40.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:36:52.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 14:36:52.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431328,ok=431328,error=0, records=41
[WARN ] 2026-06-02 14:36:52.708 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:36:54.698 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21577/300s
[INFO ] 2026-06-02 14:36:55.168 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:37:07.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10299, records=41
[INFO ] 2026-06-02 14:37:07.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431329,ok=431329,error=0, records=41
[WARN ] 2026-06-02 14:37:07.713 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:37:10.169 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:37:10.169 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21580/300s
[INFO ] 2026-06-02 14:37:12.571 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21568/300s
[INFO ] 2026-06-02 14:37:22.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 14:37:22.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431330,ok=431330,error=0, records=41
[WARN ] 2026-06-02 14:37:22.718 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:37:25.170 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:37:37.269 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 14:37:37.269 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431331,ok=431331,error=0, records=41
[WARN ] 2026-06-02 14:37:37.724 [9928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:37:40.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:37:52.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 14:37:52.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431332,ok=431332,error=0, records=41
[WARN ] 2026-06-02 14:37:52.729 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:37:55.171 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:38:02.286 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21578/300s
[INFO ] 2026-06-02 14:38:04.088 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21578/300s
[INFO ] 2026-06-02 14:38:07.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 14:38:07.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431333,ok=431333,error=0, records=41
[WARN ] 2026-06-02 14:38:07.735 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:38:10.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:38:10.894 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21578/300s
[INFO ] 2026-06-02 14:38:22.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 14:38:22.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431334,ok=431334,error=0, records=41
[WARN ] 2026-06-02 14:38:22.741 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:38:25.172 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:38:30.612 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840976},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:38:30.782 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:38:30.782 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:38:30.782 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:38:30.782 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:38:30.782 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:38:30.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:38:30.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:38:30.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:38:30.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:38:30.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:38:30.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:38:37.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 14:38:37.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431335,ok=431335,error=0, records=41
[WARN ] 2026-06-02 14:38:37.747 [9940 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:38:40.173 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:38:52.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 14:38:52.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431336,ok=431336,error=0, records=41
[WARN ] 2026-06-02 14:38:52.752 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:38:55.174 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:38:55.174 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 14:39:07.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 14:39:07.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431337,ok=431337,error=0, records=41
[WARN ] 2026-06-02 14:39:07.758 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:39:10.175 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:39:22.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 14:39:22.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431338,ok=431338,error=0, records=41
[WARN ] 2026-06-02 14:39:22.764 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:39:25.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:39:37.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 14:39:37.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431339,ok=431339,error=0, records=41
[WARN ] 2026-06-02 14:39:37.770 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:39:40.176 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:39:52.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 14:39:52.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431340,ok=431340,error=0, records=41
[WARN ] 2026-06-02 14:39:52.776 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:39:55.177 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:40:02.154 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21582/300s
[INFO ] 2026-06-02 14:40:07.336 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10286, records=41
[INFO ] 2026-06-02 14:40:07.336 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431341,ok=431341,error=0, records=41
[WARN ] 2026-06-02 14:40:07.781 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:40:10.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:40:22.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 14:40:22.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431342,ok=431342,error=0, records=41
[WARN ] 2026-06-02 14:40:22.787 [9928 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:40:25.178 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:40:36.792 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21573/300s
[INFO ] 2026-06-02 14:40:37.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 14:40:37.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431343,ok=431343,error=0, records=41
[WARN ] 2026-06-02 14:40:37.792 [9934 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:40:40.179 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:40:43.512 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21582/300s
[INFO ] 2026-06-02 14:40:52.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-02 14:40:52.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431344,ok=431344,error=0, records=41
[WARN ] 2026-06-02 14:40:52.799 [9924 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:40:55.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:41:07.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 14:41:07.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431345,ok=431345,error=0, records=41
[WARN ] 2026-06-02 14:41:07.804 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:41:10.180 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:41:22.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-02 14:41:22.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431346,ok=431346,error=0, records=41
[INFO ] 2026-06-02 14:41:22.366 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21569/300s
[WARN ] 2026-06-02 14:41:22.810 [10604] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:41:25.181 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:41:30.782 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17968/300s
[INFO ] 2026-06-02 14:41:30.784 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840872},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:41:30.950 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:41:30.950 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:41:30.950 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:41:30.950 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:41:30.950 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:41:30.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:41:30.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:41:30.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:41:30.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:41:30.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:41:30.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:41:37.371 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 14:41:37.371 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431347,ok=431347,error=0, records=41
[WARN ] 2026-06-02 14:41:37.817 [10620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:41:40.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:41:52.377 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 14:41:52.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431348,ok=431348,error=0, records=41
[WARN ] 2026-06-02 14:41:52.822 [10626] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:41:54.759 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21578/300s
[INFO ] 2026-06-02 14:41:55.182 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:42:07.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10269, records=41
[INFO ] 2026-06-02 14:42:07.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431349,ok=431349,error=0, records=41
[WARN ] 2026-06-02 14:42:07.827 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:42:10.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:42:10.183 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21581/300s
[INFO ] 2026-06-02 14:42:12.760 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21569/300s
[INFO ] 2026-06-02 14:42:22.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 14:42:22.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431350,ok=431350,error=0, records=41
[WARN ] 2026-06-02 14:42:22.832 [10677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:42:25.183 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:42:37.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 14:42:37.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431351,ok=431351,error=0, records=41
[WARN ] 2026-06-02 14:42:37.837 [10604] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:42:40.184 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:42:52.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 14:42:52.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431352,ok=431352,error=0, records=41
[WARN ] 2026-06-02 14:42:52.843 [10700] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:42:55.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:43:02.324 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21579/300s
[INFO ] 2026-06-02 14:43:04.125 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21579/300s
[INFO ] 2026-06-02 14:43:07.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 14:43:07.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431353,ok=431353,error=0, records=41
[WARN ] 2026-06-02 14:43:07.848 [10620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:43:10.185 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:43:10.932 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21579/300s
[INFO ] 2026-06-02 14:43:22.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 14:43:22.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431354,ok=431354,error=0, records=41
[WARN ] 2026-06-02 14:43:22.853 [10700] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:43:25.186 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:43:37.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 14:43:37.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431355,ok=431355,error=0, records=41
[WARN ] 2026-06-02 14:43:37.857 [10620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:43:40.187 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 14:43:40.187 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 14:43:52.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 14:43:52.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431356,ok=431356,error=0, records=41
[WARN ] 2026-06-02 14:43:52.862 [10742] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:43:55.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:44:07.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 14:44:07.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431357,ok=431357,error=0, records=41
[WARN ] 2026-06-02 14:44:07.866 [9947 ] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:44:10.188 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:44:22.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 14:44:22.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431358,ok=431358,error=0, records=41
[WARN ] 2026-06-02 14:44:22.872 [10677] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:44:25.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:44:30.952 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840796},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:44:31.137 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:44:31.138 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 14:44:31.138 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:44:31.138 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:44:31.138 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:44:31.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:44:31.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:44:31.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:44:31.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:44:31.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:44:31.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:44:37.466 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 14:44:37.466 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431359,ok=431359,error=0, records=41
[WARN ] 2026-06-02 14:44:37.878 [10770] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:44:40.189 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:44:52.471 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 14:44:52.471 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431360,ok=431360,error=0, records=41
[WARN ] 2026-06-02 14:44:52.884 [10800] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:44:55.190 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:45:02.157 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21583/300s
[INFO ] 2026-06-02 14:45:07.479 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 14:45:07.479 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431361,ok=431361,error=0, records=41
[WARN ] 2026-06-02 14:45:07.890 [10838] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:45:10.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:45:22.484 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 14:45:22.484 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431362,ok=431362,error=0, records=41
[WARN ] 2026-06-02 14:45:22.895 [10849] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:45:25.191 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:45:36.900 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21574/300s
[INFO ] 2026-06-02 14:45:37.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 14:45:37.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431363,ok=431363,error=0, records=41
[WARN ] 2026-06-02 14:45:37.901 [10816] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:45:40.192 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:45:43.519 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21583/300s
[INFO ] 2026-06-02 14:45:52.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 14:45:52.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431364,ok=431364,error=0, records=41
[WARN ] 2026-06-02 14:45:52.906 [10884] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:45:55.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:46:07.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 14:46:07.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431365,ok=431365,error=0, records=41
[WARN ] 2026-06-02 14:46:07.912 [10895] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:46:10.193 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:46:22.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 14:46:22.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431366,ok=431366,error=0, records=41
[INFO ] 2026-06-02 14:46:22.511 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21570/300s
[WARN ] 2026-06-02 14:46:22.918 [10916] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:46:25.194 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:46:37.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10327, records=41
[INFO ] 2026-06-02 14:46:37.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431367,ok=431367,error=0, records=41
[WARN ] 2026-06-02 14:46:37.924 [10938] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:46:40.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:46:52.523 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 14:46:52.523 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431368,ok=431368,error=0, records=41
[WARN ] 2026-06-02 14:46:52.929 [10932] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:46:54.819 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21579/300s
[INFO ] 2026-06-02 14:46:55.195 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:47:07.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 14:47:07.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431369,ok=431369,error=0, records=41
[WARN ] 2026-06-02 14:47:07.935 [10965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:47:10.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:47:10.196 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21582/300s
[INFO ] 2026-06-02 14:47:12.943 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21570/300s
[INFO ] 2026-06-02 14:47:22.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 14:47:22.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431370,ok=431370,error=0, records=41
[WARN ] 2026-06-02 14:47:22.941 [10976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:47:25.196 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:47:31.138 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17969/300s
[INFO ] 2026-06-02 14:47:31.140 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840724},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:47:31.296 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:47:31.296 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 14:47:31.296 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:47:31.296 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:47:31.296 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:47:31.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:47:31.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:47:31.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:47:31.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:47:31.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:47:31.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:47:37.543 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 14:47:37.543 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431371,ok=431371,error=0, records=41
[WARN ] 2026-06-02 14:47:37.948 [10991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:47:40.197 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:47:52.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 14:47:52.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431372,ok=431372,error=0, records=41
[WARN ] 2026-06-02 14:47:52.954 [10965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:47:55.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:48:02.410 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21580/300s
[INFO ] 2026-06-02 14:48:04.212 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21580/300s
[INFO ] 2026-06-02 14:48:07.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 14:48:07.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431373,ok=431373,error=0, records=41
[WARN ] 2026-06-02 14:48:07.959 [10976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:48:10.198 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:48:11.019 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21580/300s
[INFO ] 2026-06-02 14:48:22.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 14:48:22.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431374,ok=431374,error=0, records=41
[WARN ] 2026-06-02 14:48:22.965 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:48:25.199 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:48:37.566 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 14:48:37.566 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431375,ok=431375,error=0, records=41
[WARN ] 2026-06-02 14:48:37.970 [11052] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:48:40.200 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:48:52.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 14:48:52.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431376,ok=431376,error=0, records=41
[WARN ] 2026-06-02 14:48:52.975 [10976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:48:55.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:49:07.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 14:49:07.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431377,ok=431377,error=0, records=41
[WARN ] 2026-06-02 14:49:07.979 [11066] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:49:10.201 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:49:22.583 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 14:49:22.583 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431378,ok=431378,error=0, records=41
[WARN ] 2026-06-02 14:49:22.985 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:49:25.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:49:37.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 14:49:37.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431379,ok=431379,error=0, records=41
[WARN ] 2026-06-02 14:49:37.990 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:49:40.202 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:49:52.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 14:49:52.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431380,ok=431380,error=0, records=41
[WARN ] 2026-06-02 14:49:52.994 [11123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:49:55.203 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:50:02.161 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21584/300s
[INFO ] 2026-06-02 14:50:07.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 14:50:07.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431381,ok=431381,error=0, records=41
[WARN ] 2026-06-02 14:50:08.000 [11066] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:50:10.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:50:22.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 14:50:22.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431382,ok=431382,error=0, records=41
[WARN ] 2026-06-02 14:50:23.004 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:50:25.204 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:50:31.298 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840648},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:50:31.461 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:50:31.461 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:50:31.461 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:50:31.461 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:50:31.461 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:50:31.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:50:31.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:50:31.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:50:31.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:50:31.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:50:31.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 14:50:32.508 [11142] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10506/stat), No such file or directory
[WARN ] 2026-06-02 14:50:32.508 [11142] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10544/stat), No such file or directory
[INFO ] 2026-06-02 14:50:37.009 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21575/300s
[INFO ] 2026-06-02 14:50:37.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 14:50:37.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431383,ok=431383,error=0, records=41
[WARN ] 2026-06-02 14:50:38.010 [11156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:50:40.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:50:43.526 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21584/300s
[WARN ] 2026-06-02 14:50:47.514 [10966] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10506/stat), No such file or directory
[WARN ] 2026-06-02 14:50:47.514 [10966] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10544/stat), No such file or directory
[INFO ] 2026-06-02 14:50:52.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 14:50:52.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431384,ok=431384,error=0, records=41
[WARN ] 2026-06-02 14:50:53.015 [11108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:50:55.205 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:51:07.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 14:51:07.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431385,ok=431385,error=0, records=41
[WARN ] 2026-06-02 14:51:08.021 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:51:10.206 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:51:22.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 14:51:22.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431386,ok=431386,error=0, records=41
[INFO ] 2026-06-02 14:51:22.627 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21571/300s
[WARN ] 2026-06-02 14:51:23.026 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:51:25.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:51:37.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 14:51:37.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431387,ok=431387,error=0, records=41
[WARN ] 2026-06-02 14:51:38.031 [11156] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:51:40.207 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:51:52.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 14:51:52.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431388,ok=431388,error=0, records=41
[WARN ] 2026-06-02 14:51:53.036 [11243] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:51:54.877 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21580/300s
[INFO ] 2026-06-02 14:51:55.208 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:52:07.645 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 14:52:07.645 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431389,ok=431389,error=0, records=41
[WARN ] 2026-06-02 14:52:08.043 [10966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:52:10.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:52:10.209 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21583/300s
[INFO ] 2026-06-02 14:52:13.129 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21571/300s
[INFO ] 2026-06-02 14:52:22.650 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10309, records=41
[INFO ] 2026-06-02 14:52:22.650 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431390,ok=431390,error=0, records=41
[WARN ] 2026-06-02 14:52:23.048 [11260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:52:25.209 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:52:37.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 14:52:37.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431391,ok=431391,error=0, records=41
[WARN ] 2026-06-02 14:52:38.052 [11297] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:52:40.210 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:52:52.557 [11303] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:52:52.662 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 14:52:52.662 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431392,ok=431392,error=0, records=41
[INFO ] 2026-06-02 14:52:55.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:53:02.484 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21581/300s
[INFO ] 2026-06-02 14:53:04.285 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21581/300s
[WARN ] 2026-06-02 14:53:07.562 [11322] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:53:07.667 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 14:53:07.667 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431393,ok=431393,error=0, records=41
[INFO ] 2026-06-02 14:53:10.211 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:53:11.092 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21581/300s
[WARN ] 2026-06-02 14:53:22.569 [11350] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:53:22.673 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 14:53:22.673 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431394,ok=431394,error=0, records=41
[INFO ] 2026-06-02 14:53:25.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:53:31.461 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17970/300s
[INFO ] 2026-06-02 14:53:31.463 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:53:31.623 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:53:31.623 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:53:31.623 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:53:31.623 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:53:31.623 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:53:31.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:53:31.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:53:31.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:53:31.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:53:31.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:53:31.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 14:53:37.573 [11364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:53:37.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 14:53:37.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431395,ok=431395,error=0, records=41
[INFO ] 2026-06-02 14:53:40.212 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 14:53:40.213 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 14:53:52.579 [11364] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:53:52.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 14:53:52.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431396,ok=431396,error=0, records=41
[INFO ] 2026-06-02 14:53:55.214 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:53:55.214 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 14:54:07.584 [11387] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:54:07.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 14:54:07.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431397,ok=431397,error=0, records=41
[INFO ] 2026-06-02 14:54:10.215 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:54:22.589 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:54:22.712 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 14:54:22.712 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431398,ok=431398,error=0, records=41
[INFO ] 2026-06-02 14:54:25.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:54:37.597 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:54:37.718 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 14:54:37.718 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431399,ok=431399,error=0, records=41
[INFO ] 2026-06-02 14:54:40.216 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:54:52.602 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:54:52.723 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 14:54:52.723 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431400,ok=431400,error=0, records=41
[INFO ] 2026-06-02 14:54:55.217 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:55:02.164 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21585/300s
[WARN ] 2026-06-02 14:55:07.607 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:55:07.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10375, records=41
[INFO ] 2026-06-02 14:55:07.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431401,ok=431401,error=0, records=41
[INFO ] 2026-06-02 14:55:10.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:55:22.612 [11436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:55:22.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 14:55:22.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431402,ok=431402,error=0, records=41
[INFO ] 2026-06-02 14:55:25.218 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:55:37.116 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21576/300s
[WARN ] 2026-06-02 14:55:37.617 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:55:37.809 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 14:55:37.809 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431403,ok=431403,error=0, records=41
[INFO ] 2026-06-02 14:55:40.219 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:55:43.532 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21585/300s
[WARN ] 2026-06-02 14:55:52.622 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:55:52.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 14:55:52.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431404,ok=431404,error=0, records=41
[INFO ] 2026-06-02 14:55:55.220 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:56:07.628 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:56:07.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 14:56:07.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431405,ok=431405,error=0, records=41
[INFO ] 2026-06-02 14:56:10.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:56:22.633 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:56:22.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 14:56:22.839 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431406,ok=431406,error=0, records=41
[INFO ] 2026-06-02 14:56:22.840 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21572/300s
[INFO ] 2026-06-02 14:56:25.221 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:56:31.625 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840488},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:56:31.799 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:56:31.799 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 14:56:31.799 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:56:31.799 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:56:31.799 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:56:31.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:56:31.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:56:31.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:56:31.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:56:31.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:56:31.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 14:56:37.639 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:56:37.846 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 14:56:37.846 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431407,ok=431407,error=0, records=41
[INFO ] 2026-06-02 14:56:40.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:56:52.645 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:56:52.851 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 14:56:52.851 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431408,ok=431408,error=0, records=41
[INFO ] 2026-06-02 14:56:54.936 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21581/300s
[INFO ] 2026-06-02 14:56:55.222 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:57:07.651 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:57:07.856 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 14:57:07.856 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431409,ok=431409,error=0, records=41
[INFO ] 2026-06-02 14:57:10.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:57:10.223 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21584/300s
[INFO ] 2026-06-02 14:57:13.316 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21572/300s
[WARN ] 2026-06-02 14:57:22.656 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:57:22.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10311, records=41
[INFO ] 2026-06-02 14:57:22.865 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431410,ok=431410,error=0, records=41
[INFO ] 2026-06-02 14:57:25.223 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:57:37.663 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:57:37.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 14:57:37.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431411,ok=431411,error=0, records=41
[INFO ] 2026-06-02 14:57:40.224 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:57:52.667 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:57:52.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-02 14:57:52.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431412,ok=431412,error=0, records=41
[INFO ] 2026-06-02 14:57:55.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:58:02.547 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21582/300s
[INFO ] 2026-06-02 14:58:04.349 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21582/300s
[WARN ] 2026-06-02 14:58:07.672 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:58:07.884 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 14:58:07.884 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431413,ok=431413,error=0, records=41
[INFO ] 2026-06-02 14:58:10.225 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:58:11.155 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21582/300s
[WARN ] 2026-06-02 14:58:22.677 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:58:22.889 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 14:58:22.889 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431414,ok=431414,error=0, records=41
[INFO ] 2026-06-02 14:58:25.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:58:37.683 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:58:37.895 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 14:58:37.895 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431415,ok=431415,error=0, records=41
[INFO ] 2026-06-02 14:58:40.226 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:58:52.687 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:58:52.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 14:58:52.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431416,ok=431416,error=0, records=41
[INFO ] 2026-06-02 14:58:55.227 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:59:07.693 [11436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:59:07.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 14:59:07.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431417,ok=431417,error=0, records=41
[INFO ] 2026-06-02 14:59:10.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:59:22.698 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:59:22.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 14:59:22.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431418,ok=431418,error=0, records=41
[INFO ] 2026-06-02 14:59:25.228 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 14:59:31.799 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17971/300s
[INFO ] 2026-06-02 14:59:31.801 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840416},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 14:59:31.978 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 14:59:31.978 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 14:59:31.979 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 14:59:31.979 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 14:59:31.979 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 14:59:32.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:59:32.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 14:59:32.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:59:32.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 14:59:32.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 14:59:32.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 14:59:37.703 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:59:37.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 14:59:37.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431419,ok=431419,error=0, records=41
[INFO ] 2026-06-02 14:59:40.229 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 14:59:52.711 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 14:59:52.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 14:59:52.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431420,ok=431420,error=0, records=41
[INFO ] 2026-06-02 14:59:55.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:00:02.167 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21586/300s
[WARN ] 2026-06-02 15:00:07.715 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:00:07.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 15:00:07.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431421,ok=431421,error=0, records=41
[INFO ] 2026-06-02 15:00:10.230 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:00:22.720 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:00:22.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 15:00:22.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431422,ok=431422,error=0, records=41
[INFO ] 2026-06-02 15:00:25.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:00:37.225 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21577/300s
[WARN ] 2026-06-02 15:00:37.725 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:00:37.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 15:00:37.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431423,ok=431423,error=0, records=41
[INFO ] 2026-06-02 15:00:40.231 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:00:43.539 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21586/300s
[WARN ] 2026-06-02 15:00:52.730 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:00:52.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 15:00:52.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431424,ok=431424,error=0, records=41
[INFO ] 2026-06-02 15:00:55.232 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:01:07.735 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:01:07.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 15:01:07.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431425,ok=431425,error=0, records=41
[INFO ] 2026-06-02 15:01:10.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:01:22.741 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:01:22.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 15:01:22.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431426,ok=431426,error=0, records=41
[INFO ] 2026-06-02 15:01:22.976 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21573/300s
[INFO ] 2026-06-02 15:01:25.233 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:01:37.746 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:01:38.003 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 15:01:38.003 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431427,ok=431427,error=0, records=41
[INFO ] 2026-06-02 15:01:40.234 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:01:52.751 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:01:53.007 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 15:01:53.007 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431428,ok=431428,error=0, records=41
[INFO ] 2026-06-02 15:01:54.991 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21582/300s
[INFO ] 2026-06-02 15:01:55.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:02:07.757 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:02:08.017 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 15:02:08.017 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431429,ok=431429,error=0, records=41
[INFO ] 2026-06-02 15:02:10.235 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:02:10.235 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21585/300s
[INFO ] 2026-06-02 15:02:13.502 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21573/300s
[WARN ] 2026-06-02 15:02:22.762 [11436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:02:23.022 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 15:02:23.022 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431430,ok=431430,error=0, records=41
[INFO ] 2026-06-02 15:02:25.236 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:02:31.980 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840324},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:02:32.137 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:02:32.137 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:02:32.137 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:02:32.137 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:02:32.137 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:02:32.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:02:32.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:02:32.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:02:32.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:02:32.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:02:32.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:02:37.767 [11436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:02:38.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 15:02:38.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431431,ok=431431,error=0, records=41
[INFO ] 2026-06-02 15:02:40.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:02:52.772 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:02:53.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 15:02:53.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431432,ok=431432,error=0, records=41
[INFO ] 2026-06-02 15:02:55.237 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:03:02.622 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21583/300s
[INFO ] 2026-06-02 15:03:04.424 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21583/300s
[WARN ] 2026-06-02 15:03:07.779 [11404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:03:08.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 15:03:08.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431433,ok=431433,error=0, records=41
[INFO ] 2026-06-02 15:03:10.238 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:03:11.231 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21583/300s
[WARN ] 2026-06-02 15:03:22.784 [11436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:03:23.044 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 15:03:23.044 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431434,ok=431434,error=0, records=41
[INFO ] 2026-06-02 15:03:25.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:03:37.789 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:03:38.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 15:03:38.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431435,ok=431435,error=0, records=41
[INFO ] 2026-06-02 15:03:40.239 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 15:03:40.239 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 15:03:52.794 [11418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:03:53.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 15:03:53.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431436,ok=431436,error=0, records=41
[INFO ] 2026-06-02 15:03:55.240 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:04:07.799 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:04:08.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 15:04:08.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431437,ok=431437,error=0, records=41
[INFO ] 2026-06-02 15:04:10.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:04:22.806 [11393] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:04:23.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 15:04:23.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431438,ok=431438,error=0, records=41
[INFO ] 2026-06-02 15:04:25.241 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:04:37.810 [11982] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:04:38.104 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10345, records=41
[INFO ] 2026-06-02 15:04:38.104 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431439,ok=431439,error=0, records=41
[INFO ] 2026-06-02 15:04:40.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:04:52.817 [11992] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:04:53.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-02 15:04:53.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431440,ok=431440,error=0, records=41
[INFO ] 2026-06-02 15:04:55.242 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:05:02.170 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21587/300s
[WARN ] 2026-06-02 15:05:07.823 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:05:08.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 15:05:08.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431441,ok=431441,error=0, records=41
[INFO ] 2026-06-02 15:05:10.243 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:05:22.828 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:05:23.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 15:05:23.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431442,ok=431442,error=0, records=41
[INFO ] 2026-06-02 15:05:25.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:05:32.137 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17972/300s
[INFO ] 2026-06-02 15:05:32.138 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840252},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:05:32.309 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:05:32.309 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:05:32.310 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:05:32.310 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:05:32.310 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:05:32.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:05:32.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:05:32.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:05:32.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:05:32.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:05:32.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:05:37.334 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21578/300s
[WARN ] 2026-06-02 15:05:37.834 [11422] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:05:38.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 15:05:38.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431443,ok=431443,error=0, records=41
[INFO ] 2026-06-02 15:05:40.244 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:05:43.546 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21587/300s
[WARN ] 2026-06-02 15:05:52.839 [11967] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:05:53.135 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11288, records=50
[INFO ] 2026-06-02 15:05:53.135 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431444,ok=431444,error=0, records=50
[INFO ] 2026-06-02 15:05:55.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:06:07.844 [11997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:06:08.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 15:06:08.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431445,ok=431445,error=0, records=41
[INFO ] 2026-06-02 15:06:10.245 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:06:22.849 [11997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:06:23.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 15:06:23.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431446,ok=431446,error=0, records=41
[INFO ] 2026-06-02 15:06:23.148 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21574/300s
[INFO ] 2026-06-02 15:06:25.246 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:06:37.854 [12089] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:06:38.157 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 15:06:38.157 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431447,ok=431447,error=0, records=41
[INFO ] 2026-06-02 15:06:40.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:06:52.859 [12089] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:06:53.162 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 15:06:53.162 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431448,ok=431448,error=0, records=41
[INFO ] 2026-06-02 15:06:55.048 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21583/300s
[INFO ] 2026-06-02 15:06:55.247 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:07:07.863 [12103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:07:08.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 15:07:08.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431449,ok=431449,error=0, records=41
[INFO ] 2026-06-02 15:07:10.248 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:07:10.248 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21586/300s
[INFO ] 2026-06-02 15:07:13.684 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21574/300s
[WARN ] 2026-06-02 15:07:22.869 [12089] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:07:23.177 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 15:07:23.177 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431450,ok=431450,error=0, records=41
[INFO ] 2026-06-02 15:07:25.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:07:37.873 [12103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:07:38.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 15:07:38.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431451,ok=431451,error=0, records=41
[INFO ] 2026-06-02 15:07:40.249 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:07:52.878 [12144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:07:53.191 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 15:07:53.191 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431452,ok=431452,error=0, records=41
[INFO ] 2026-06-02 15:07:55.250 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:08:02.684 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21584/300s
[INFO ] 2026-06-02 15:08:04.486 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21584/300s
[WARN ] 2026-06-02 15:08:07.882 [12159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:08:08.197 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 15:08:08.197 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431453,ok=431453,error=0, records=41
[INFO ] 2026-06-02 15:08:10.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:08:11.291 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21584/300s
[WARN ] 2026-06-02 15:08:22.889 [12188] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:08:23.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 15:08:23.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431454,ok=431454,error=0, records=41
[INFO ] 2026-06-02 15:08:25.251 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:08:32.311 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840172},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:08:32.481 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:08:32.482 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 15:08:32.482 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:08:32.482 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:08:32.482 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:08:32.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:08:32.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:08:32.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:08:32.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:08:32.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:08:32.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:08:37.894 [12217] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:08:38.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 15:08:38.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431455,ok=431455,error=0, records=41
[INFO ] 2026-06-02 15:08:40.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:08:52.898 [12234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:08:53.214 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 15:08:53.214 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431456,ok=431456,error=0, records=41
[INFO ] 2026-06-02 15:08:55.252 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:08:55.253 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 15:09:07.905 [12245] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:09:08.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 15:09:08.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431457,ok=431457,error=0, records=41
[INFO ] 2026-06-02 15:09:10.254 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:09:22.910 [12234] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:09:23.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 15:09:23.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431458,ok=431458,error=0, records=41
[INFO ] 2026-06-02 15:09:25.255 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:09:37.918 [12276] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:09:38.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 15:09:38.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431459,ok=431459,error=0, records=41
[INFO ] 2026-06-02 15:09:40.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:09:52.924 [12277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:09:53.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 15:09:53.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431460,ok=431460,error=0, records=41
[INFO ] 2026-06-02 15:09:55.256 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:10:02.173 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21588/300s
[WARN ] 2026-06-02 15:10:07.930 [12305] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:10:08.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 15:10:08.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431461,ok=431461,error=0, records=41
[INFO ] 2026-06-02 15:10:10.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:10:22.936 [12299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:10:23.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-02 15:10:23.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431462,ok=431462,error=0, records=41
[INFO ] 2026-06-02 15:10:25.257 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:10:37.442 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21579/300s
[WARN ] 2026-06-02 15:10:37.942 [12277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:10:38.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-02 15:10:38.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431463,ok=431463,error=0, records=41
[INFO ] 2026-06-02 15:10:40.258 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:10:43.553 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21588/300s
[WARN ] 2026-06-02 15:10:52.948 [12378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:10:53.261 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 15:10:53.261 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431464,ok=431464,error=0, records=41
[INFO ] 2026-06-02 15:10:55.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:11:07.952 [12277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:11:08.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 15:11:08.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431465,ok=431465,error=0, records=41
[INFO ] 2026-06-02 15:11:10.259 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:11:22.957 [12373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:11:23.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-02 15:11:23.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431466,ok=431466,error=0, records=41
[INFO ] 2026-06-02 15:11:23.275 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21575/300s
[INFO ] 2026-06-02 15:11:25.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:11:32.482 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17973/300s
[INFO ] 2026-06-02 15:11:32.483 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840096},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:11:32.707 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:11:32.708 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 15:11:32.708 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:11:32.708 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:11:32.708 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:11:32.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:11:32.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:11:32.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:11:32.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:11:32.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:11:32.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:11:37.962 [12402] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:11:38.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 15:11:38.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431467,ok=431467,error=0, records=41
[INFO ] 2026-06-02 15:11:40.260 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:11:52.966 [12416] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:11:53.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 15:11:53.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431468,ok=431468,error=0, records=41
[INFO ] 2026-06-02 15:11:55.101 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21584/300s
[INFO ] 2026-06-02 15:11:55.261 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:12:07.971 [12378] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:12:08.291 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 15:12:08.291 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431469,ok=431469,error=0, records=41
[INFO ] 2026-06-02 15:12:10.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.15MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:12:10.262 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21587/300s
[INFO ] 2026-06-02 15:12:13.865 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21575/300s
[WARN ] 2026-06-02 15:12:22.976 [12277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:12:23.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 15:12:23.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431470,ok=431470,error=0, records=41
[INFO ] 2026-06-02 15:12:25.262 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:12:37.980 [12277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:12:38.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 15:12:38.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431471,ok=431471,error=0, records=41
[INFO ] 2026-06-02 15:12:40.263 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:12:52.986 [12445] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:12:53.309 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 15:12:53.309 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431472,ok=431472,error=0, records=41
[INFO ] 2026-06-02 15:12:55.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:13:02.742 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21585/300s
[INFO ] 2026-06-02 15:13:04.544 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21585/300s
[WARN ] 2026-06-02 15:13:07.991 [12277] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:13:08.315 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 15:13:08.315 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431473,ok=431473,error=0, records=41
[INFO ] 2026-06-02 15:13:10.264 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:13:11.350 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21585/300s
[WARN ] 2026-06-02 15:13:22.995 [12388] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:13:23.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 15:13:23.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431474,ok=431474,error=0, records=41
[INFO ] 2026-06-02 15:13:25.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:13:38.000 [12388] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:13:38.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 15:13:38.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431475,ok=431475,error=0, records=41
[INFO ] 2026-06-02 15:13:40.265 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 15:13:40.265 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 15:13:53.005 [12544] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:13:53.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 15:13:53.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431476,ok=431476,error=0, records=41
[INFO ] 2026-06-02 15:13:55.266 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:14:08.010 [12558] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:14:08.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 15:14:08.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431477,ok=431477,error=0, records=41
[INFO ] 2026-06-02 15:14:10.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:14:23.016 [12487] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:14:23.345 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 15:14:23.345 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431478,ok=431478,error=0, records=41
[INFO ] 2026-06-02 15:14:25.267 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:14:32.710 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20840016},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:14:32.882 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:14:32.882 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:14:32.882 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:14:32.882 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:14:32.882 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:14:32.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:14:32.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:14:32.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:14:32.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:14:32.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:14:32.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:14:38.020 [12558] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:14:38.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 15:14:38.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431479,ok=431479,error=0, records=41
[INFO ] 2026-06-02 15:14:40.268 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:14:53.026 [12558] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:14:53.357 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 15:14:53.357 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431480,ok=431480,error=0, records=41
[INFO ] 2026-06-02 15:14:55.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:15:02.176 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21589/300s
[WARN ] 2026-06-02 15:15:08.030 [12601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:15:08.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 15:15:08.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431481,ok=431481,error=0, records=41
[INFO ] 2026-06-02 15:15:10.269 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:15:23.035 [12635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:15:23.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 15:15:23.381 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431482,ok=431482,error=0, records=41
[INFO ] 2026-06-02 15:15:25.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:15:37.546 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21580/300s
[WARN ] 2026-06-02 15:15:38.047 [12646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:15:38.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-02 15:15:38.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431483,ok=431483,error=0, records=41
[INFO ] 2026-06-02 15:15:40.270 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:15:43.559 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21589/300s
[WARN ] 2026-06-02 15:15:53.051 [12664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:15:53.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 15:15:53.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431484,ok=431484,error=0, records=41
[INFO ] 2026-06-02 15:15:55.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:16:07.557 [12686] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:16:08.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 15:16:08.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431485,ok=431485,error=0, records=41
[INFO ] 2026-06-02 15:16:10.271 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:16:22.562 [12697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:16:23.404 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-02 15:16:23.404 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431486,ok=431486,error=0, records=41
[INFO ] 2026-06-02 15:16:23.404 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21576/300s
[INFO ] 2026-06-02 15:16:25.272 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:16:37.567 [12692] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:16:38.409 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 15:16:38.409 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431487,ok=431487,error=0, records=41
[INFO ] 2026-06-02 15:16:40.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:16:52.572 [12732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:16:53.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-02 15:16:53.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431488,ok=431488,error=0, records=41
[INFO ] 2026-06-02 15:16:55.166 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21585/300s
[INFO ] 2026-06-02 15:16:55.273 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:17:07.579 [12737] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:17:08.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 15:17:08.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431489,ok=431489,error=0, records=41
[INFO ] 2026-06-02 15:17:10.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:17:10.274 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21588/300s
[INFO ] 2026-06-02 15:17:14.045 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21576/300s
[WARN ] 2026-06-02 15:17:22.586 [12771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:17:23.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 15:17:23.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431490,ok=431490,error=0, records=41
[INFO ] 2026-06-02 15:17:25.274 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:17:32.882 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17974/300s
[INFO ] 2026-06-02 15:17:32.884 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:17:33.041 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:17:33.041 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 15:17:33.041 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:17:33.041 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:17:33.042 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:17:33.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:17:33.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:17:33.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:17:33.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:17:33.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:17:33.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:17:37.591 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:17:38.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11037, records=42
[INFO ] 2026-06-02 15:17:38.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431491,ok=431491,error=0, records=42
[INFO ] 2026-06-02 15:17:40.275 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:17:52.596 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:17:53.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10357, records=41
[INFO ] 2026-06-02 15:17:53.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431492,ok=431492,error=0, records=41
[INFO ] 2026-06-02 15:17:55.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:18:02.800 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21586/300s
[INFO ] 2026-06-02 15:18:04.602 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21586/300s
[WARN ] 2026-06-02 15:18:07.602 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:18:08.451 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 15:18:08.451 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431493,ok=431493,error=0, records=41
[INFO ] 2026-06-02 15:18:10.276 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:18:11.408 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21586/300s
[WARN ] 2026-06-02 15:18:22.607 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:18:23.455 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 15:18:23.455 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431494,ok=431494,error=0, records=41
[INFO ] 2026-06-02 15:18:25.277 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:18:37.612 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:18:38.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-02 15:18:38.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431495,ok=431495,error=0, records=41
[INFO ] 2026-06-02 15:18:40.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:18:52.618 [12799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:18:53.465 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 15:18:53.465 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431496,ok=431496,error=0, records=41
[INFO ] 2026-06-02 15:18:55.278 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:19:07.624 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:19:08.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 15:19:08.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431497,ok=431497,error=0, records=41
[INFO ] 2026-06-02 15:19:10.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:19:22.629 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:19:23.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 15:19:23.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431498,ok=431498,error=0, records=41
[INFO ] 2026-06-02 15:19:25.279 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:19:37.634 [12799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:19:38.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 15:19:38.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431499,ok=431499,error=0, records=41
[INFO ] 2026-06-02 15:19:40.280 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:19:52.639 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:19:53.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 15:19:53.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431500,ok=431500,error=0, records=41
[INFO ] 2026-06-02 15:19:55.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:20:02.180 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21590/300s
[WARN ] 2026-06-02 15:20:07.645 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:20:08.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 15:20:08.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431501,ok=431501,error=0, records=41
[INFO ] 2026-06-02 15:20:10.281 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:20:22.651 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:20:23.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 15:20:23.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431502,ok=431502,error=0, records=41
[INFO ] 2026-06-02 15:20:25.282 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:20:33.043 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839856},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:20:33.226 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:20:33.226 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[]}
[INFO ] 2026-06-02 15:20:33.226 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:20:33.226 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:20:33.226 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:20:33.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:20:33.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:20:33.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:20:33.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:20:33.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:20:33.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:20:37.655 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21581/300s
[WARN ] 2026-06-02 15:20:37.656 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:20:38.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 15:20:38.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431503,ok=431503,error=0, records=41
[INFO ] 2026-06-02 15:20:40.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:20:43.565 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21590/300s
[WARN ] 2026-06-02 15:20:52.661 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:20:53.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 15:20:53.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431504,ok=431504,error=0, records=41
[INFO ] 2026-06-02 15:20:55.283 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:21:07.667 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:21:08.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-02 15:21:08.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431505,ok=431505,error=0, records=41
[INFO ] 2026-06-02 15:21:10.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:21:22.671 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:21:23.534 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 15:21:23.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431506,ok=431506,error=0, records=41
[INFO ] 2026-06-02 15:21:23.534 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21577/300s
[INFO ] 2026-06-02 15:21:25.284 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:21:37.675 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:21:38.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 15:21:38.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431507,ok=431507,error=0, records=41
[INFO ] 2026-06-02 15:21:40.285 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:21:52.681 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:21:53.562 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 15:21:53.562 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431508,ok=431508,error=0, records=41
[INFO ] 2026-06-02 15:21:55.220 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21586/300s
[INFO ] 2026-06-02 15:21:55.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:22:07.686 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:22:08.568 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 15:22:08.568 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431509,ok=431509,error=0, records=41
[INFO ] 2026-06-02 15:22:10.286 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:22:10.286 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21589/300s
[INFO ] 2026-06-02 15:22:14.224 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21577/300s
[WARN ] 2026-06-02 15:22:22.691 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:22:23.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 15:22:23.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431510,ok=431510,error=0, records=41
[INFO ] 2026-06-02 15:22:25.287 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:22:37.697 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:22:38.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 15:22:38.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431511,ok=431511,error=0, records=41
[INFO ] 2026-06-02 15:22:40.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:22:52.702 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:22:53.616 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 15:22:53.616 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431512,ok=431512,error=0, records=41
[INFO ] 2026-06-02 15:22:55.288 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:23:02.858 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21587/300s
[INFO ] 2026-06-02 15:23:04.659 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21587/300s
[WARN ] 2026-06-02 15:23:07.708 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:23:08.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 15:23:08.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431513,ok=431513,error=0, records=41
[INFO ] 2026-06-02 15:23:10.289 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:23:11.464 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21587/300s
[WARN ] 2026-06-02 15:23:22.714 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:23:23.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 15:23:23.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431514,ok=431514,error=0, records=41
[INFO ] 2026-06-02 15:23:25.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:23:33.227 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17975/300s
[INFO ] 2026-06-02 15:23:33.228 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839780},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:23:33.379 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:23:33.379 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 15:23:33.379 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:23:33.379 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:23:33.379 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:23:33.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:23:33.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:23:33.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:23:33.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:23:33.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:23:33.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:23:37.718 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:23:38.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 15:23:38.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431515,ok=431515,error=0, records=41
[INFO ] 2026-06-02 15:23:40.290 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 15:23:40.290 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 15:23:52.723 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:23:53.639 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 15:23:53.639 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431516,ok=431516,error=0, records=41
[INFO ] 2026-06-02 15:23:55.291 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:23:55.291 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 15:24:07.727 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:24:08.644 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 15:24:08.644 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431517,ok=431517,error=0, records=41
[INFO ] 2026-06-02 15:24:10.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:24:22.732 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:24:23.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 15:24:23.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431518,ok=431518,error=0, records=41
[INFO ] 2026-06-02 15:24:25.293 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:24:37.736 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:24:38.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 15:24:38.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431519,ok=431519,error=0, records=41
[INFO ] 2026-06-02 15:24:40.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:24:52.741 [12799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:24:53.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 15:24:53.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431520,ok=431520,error=0, records=41
[INFO ] 2026-06-02 15:24:55.294 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:25:02.183 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21591/300s
[WARN ] 2026-06-02 15:25:07.746 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:25:08.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 15:25:08.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431521,ok=431521,error=0, records=41
[INFO ] 2026-06-02 15:25:10.295 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:25:22.751 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:25:23.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 15:25:23.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431522,ok=431522,error=0, records=41
[INFO ] 2026-06-02 15:25:25.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:25:37.756 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21582/300s
[WARN ] 2026-06-02 15:25:37.756 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:25:38.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 15:25:38.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431523,ok=431523,error=0, records=41
[INFO ] 2026-06-02 15:25:40.296 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:25:43.571 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21591/300s
[WARN ] 2026-06-02 15:25:52.761 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:25:53.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 15:25:53.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431524,ok=431524,error=0, records=41
[INFO ] 2026-06-02 15:25:55.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:26:07.765 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:26:08.692 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 15:26:08.692 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431525,ok=431525,error=0, records=41
[INFO ] 2026-06-02 15:26:10.297 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:26:22.769 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:26:23.698 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 15:26:23.698 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431526,ok=431526,error=0, records=41
[INFO ] 2026-06-02 15:26:23.698 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21578/300s
[INFO ] 2026-06-02 15:26:25.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:26:33.380 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839716},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:26:33.544 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:26:33.544 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:26:33.544 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:26:33.544 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:26:33.544 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:26:33.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:26:33.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:26:33.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:26:33.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:26:33.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:26:33.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:26:37.774 [12789] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:26:38.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 15:26:38.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431527,ok=431527,error=0, records=41
[INFO ] 2026-06-02 15:26:40.298 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:26:52.779 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:26:53.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 15:26:53.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431528,ok=431528,error=0, records=41
[INFO ] 2026-06-02 15:26:55.281 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21587/300s
[INFO ] 2026-06-02 15:26:55.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:27:07.785 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:27:08.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 15:27:08.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431529,ok=431529,error=0, records=41
[INFO ] 2026-06-02 15:27:10.299 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:27:10.299 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21590/300s
[INFO ] 2026-06-02 15:27:14.404 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21578/300s
[WARN ] 2026-06-02 15:27:22.790 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:27:23.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 15:27:23.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431530,ok=431530,error=0, records=41
[INFO ] 2026-06-02 15:27:25.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:27:37.794 [12799] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:27:38.783 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 15:27:38.783 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431531,ok=431531,error=0, records=41
[INFO ] 2026-06-02 15:27:40.300 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:27:52.800 [12804] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:27:53.788 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 15:27:53.788 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431532,ok=431532,error=0, records=41
[INFO ] 2026-06-02 15:27:55.301 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:28:02.904 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21588/300s
[INFO ] 2026-06-02 15:28:04.706 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21588/300s
[WARN ] 2026-06-02 15:28:07.806 [13376] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:28:08.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 15:28:08.794 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431533,ok=431533,error=0, records=41
[INFO ] 2026-06-02 15:28:10.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:28:11.512 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21588/300s
[WARN ] 2026-06-02 15:28:22.811 [13371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:28:23.849 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 15:28:23.849 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431534,ok=431534,error=0, records=41
[INFO ] 2026-06-02 15:28:25.302 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:28:37.816 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:28:38.854 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 15:28:38.854 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431535,ok=431535,error=0, records=41
[INFO ] 2026-06-02 15:28:40.303 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:28:52.822 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:28:53.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 15:28:53.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431536,ok=431536,error=0, records=41
[INFO ] 2026-06-02 15:28:55.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:29:07.827 [12753] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:29:08.865 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 15:29:08.866 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431537,ok=431537,error=0, records=41
[INFO ] 2026-06-02 15:29:10.304 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:29:22.832 [13432] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:29:23.872 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 15:29:23.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431538,ok=431538,error=0, records=41
[INFO ] 2026-06-02 15:29:25.305 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:29:33.544 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17976/300s
[INFO ] 2026-06-02 15:29:33.546 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839640},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:29:33.715 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:29:33.715 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 15:29:33.715 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:29:33.715 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:29:33.715 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:29:33.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:29:33.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:29:33.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:29:33.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:29:33.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:29:33.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:29:37.838 [13461] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:29:38.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 15:29:38.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431539,ok=431539,error=0, records=41
[INFO ] 2026-06-02 15:29:40.306 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:29:52.843 [13371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:29:53.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 15:29:53.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431540,ok=431540,error=0, records=41
[INFO ] 2026-06-02 15:29:55.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:30:02.186 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21592/300s
[WARN ] 2026-06-02 15:30:07.848 [12783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:30:08.891 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 15:30:08.891 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431541,ok=431541,error=0, records=41
[INFO ] 2026-06-02 15:30:10.307 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:30:22.854 [13471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:30:23.897 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 15:30:23.897 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431542,ok=431542,error=0, records=41
[INFO ] 2026-06-02 15:30:25.308 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:30:37.860 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21583/300s
[WARN ] 2026-06-02 15:30:37.860 [13371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:30:38.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 15:30:38.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431543,ok=431543,error=0, records=41
[INFO ] 2026-06-02 15:30:40.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:30:43.578 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21592/300s
[WARN ] 2026-06-02 15:30:52.865 [13531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:30:53.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 15:30:53.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431544,ok=431544,error=0, records=41
[INFO ] 2026-06-02 15:30:55.309 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:31:07.870 [13471] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:31:08.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 15:31:08.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431545,ok=431545,error=0, records=41
[INFO ] 2026-06-02 15:31:10.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:31:22.877 [13558] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:31:23.918 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 15:31:23.918 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431546,ok=431546,error=0, records=41
[INFO ] 2026-06-02 15:31:23.918 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21579/300s
[INFO ] 2026-06-02 15:31:25.310 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:31:37.883 [13517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:31:38.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 15:31:38.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431547,ok=431547,error=0, records=41
[INFO ] 2026-06-02 15:31:40.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:31:52.888 [13584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:31:53.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 15:31:53.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431548,ok=431548,error=0, records=41
[INFO ] 2026-06-02 15:31:55.311 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:31:55.340 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21588/300s
[WARN ] 2026-06-02 15:32:07.893 [13595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:32:08.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10369, records=41
[INFO ] 2026-06-02 15:32:08.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431549,ok=431549,error=0, records=41
[INFO ] 2026-06-02 15:32:10.312 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:32:10.312 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21591/300s
[INFO ] 2026-06-02 15:32:14.587 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21579/300s
[WARN ] 2026-06-02 15:32:22.898 [13623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:32:23.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-02 15:32:23.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431550,ok=431550,error=0, records=41
[INFO ] 2026-06-02 15:32:25.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.00MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:32:33.717 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839568},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:32:33.889 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:32:33.889 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 15:32:33.889 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:32:33.889 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:32:33.889 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:32:33.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:32:33.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:32:33.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:32:33.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:32:33.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:32:33.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:32:37.903 [13531] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:32:38.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-02 15:32:38.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431551,ok=431551,error=0, records=41
[INFO ] 2026-06-02 15:32:40.313 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:32:52.908 [13646] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:32:53.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 15:32:53.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431552,ok=431552,error=0, records=41
[INFO ] 2026-06-02 15:32:55.314 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:33:02.958 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21589/300s
[INFO ] 2026-06-02 15:33:04.760 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21589/300s
[WARN ] 2026-06-02 15:33:07.914 [13675] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:33:08.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 15:33:08.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431553,ok=431553,error=0, records=41
[INFO ] 2026-06-02 15:33:10.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:33:11.565 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21589/300s
[WARN ] 2026-06-02 15:33:22.919 [13686] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:33:23.966 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 15:33:23.966 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431554,ok=431554,error=0, records=41
[INFO ] 2026-06-02 15:33:25.315 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:33:37.925 [13697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:33:38.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 15:33:38.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431555,ok=431555,error=0, records=41
[INFO ] 2026-06-02 15:33:40.316 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 15:33:40.316 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 15:33:52.930 [13697] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:33:53.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 15:33:53.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431556,ok=431556,error=0, records=41
[INFO ] 2026-06-02 15:33:55.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:34:07.934 [13747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:34:08.983 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 15:34:08.983 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431557,ok=431557,error=0, records=41
[INFO ] 2026-06-02 15:34:10.317 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:34:22.941 [13761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:34:23.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 15:34:23.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431558,ok=431558,error=0, records=41
[INFO ] 2026-06-02 15:34:25.318 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:34:37.947 [13736] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:34:38.996 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 15:34:38.996 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431559,ok=431559,error=0, records=41
[INFO ] 2026-06-02 15:34:40.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:34:52.952 [13787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:34:54.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 15:34:54.003 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431560,ok=431560,error=0, records=41
[INFO ] 2026-06-02 15:34:55.319 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:35:02.190 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21593/300s
[WARN ] 2026-06-02 15:35:07.957 [13771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:35:09.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 15:35:09.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431561,ok=431561,error=0, records=41
[INFO ] 2026-06-02 15:35:10.320 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:35:22.962 [13771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:35:24.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 15:35:24.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431562,ok=431562,error=0, records=41
[INFO ] 2026-06-02 15:35:25.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:35:33.889 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17977/300s
[INFO ] 2026-06-02 15:35:33.891 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839488},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:35:34.065 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:35:34.065 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:35:34.065 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:35:34.065 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:35:34.065 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:35:34.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:35:34.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:35:34.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:35:34.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:35:34.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:35:34.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:35:37.966 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21584/300s
[WARN ] 2026-06-02 15:35:37.966 [13771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:35:39.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 15:35:39.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431563,ok=431563,error=0, records=41
[INFO ] 2026-06-02 15:35:40.321 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:35:43.584 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21593/300s
[WARN ] 2026-06-02 15:35:52.970 [13815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:35:54.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 15:35:54.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431564,ok=431564,error=0, records=41
[INFO ] 2026-06-02 15:35:55.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:36:07.976 [13815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:36:09.041 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 15:36:09.041 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431565,ok=431565,error=0, records=41
[INFO ] 2026-06-02 15:36:10.322 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:36:22.981 [13815] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:36:24.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 15:36:24.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431566,ok=431566,error=0, records=41
[INFO ] 2026-06-02 15:36:24.048 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21580/300s
[INFO ] 2026-06-02 15:36:25.323 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:36:37.986 [13886] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:36:39.053 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 15:36:39.053 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431567,ok=431567,error=0, records=41
[INFO ] 2026-06-02 15:36:40.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:36:52.991 [13844] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:36:54.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 15:36:54.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431568,ok=431568,error=0, records=41
[INFO ] 2026-06-02 15:36:55.324 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:36:55.395 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21589/300s
[WARN ] 2026-06-02 15:37:07.996 [13777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:37:09.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 15:37:09.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431569,ok=431569,error=0, records=41
[INFO ] 2026-06-02 15:37:10.325 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:37:10.325 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21592/300s
[INFO ] 2026-06-02 15:37:14.758 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21580/300s
[WARN ] 2026-06-02 15:37:23.000 [13858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:37:24.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 15:37:24.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431570,ok=431570,error=0, records=41
[INFO ] 2026-06-02 15:37:25.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:37:38.004 [13928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:37:39.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 15:37:39.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431571,ok=431571,error=0, records=41
[INFO ] 2026-06-02 15:37:40.326 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:37:53.010 [13928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:37:54.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 15:37:54.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431572,ok=431572,error=0, records=41
[INFO ] 2026-06-02 15:37:55.327 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:38:03.022 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21590/300s
[INFO ] 2026-06-02 15:38:04.824 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21590/300s
[WARN ] 2026-06-02 15:38:08.014 [13970] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:38:09.092 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 15:38:09.092 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431573,ok=431573,error=0, records=41
[INFO ] 2026-06-02 15:38:10.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:38:11.630 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21590/300s
[WARN ] 2026-06-02 15:38:23.020 [13970] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:38:24.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 15:38:24.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431574,ok=431574,error=0, records=41
[INFO ] 2026-06-02 15:38:25.328 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:38:34.067 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839424},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:38:34.220 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:38:34.221 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 15:38:34.221 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:38:34.221 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:38:34.221 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:38:34.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:38:34.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:38:34.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:38:34.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:38:34.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:38:34.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:38:38.024 [13998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:38:39.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 15:38:39.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431575,ok=431575,error=0, records=41
[INFO ] 2026-06-02 15:38:40.329 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:38:53.029 [13858] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:38:54.109 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 15:38:54.109 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431576,ok=431576,error=0, records=41
[INFO ] 2026-06-02 15:38:55.330 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:38:55.330 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 15:39:08.034 [13777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:39:09.118 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 15:39:09.118 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431577,ok=431577,error=0, records=41
[INFO ] 2026-06-02 15:39:10.331 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:39:23.039 [13928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:39:24.125 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 15:39:24.125 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431578,ok=431578,error=0, records=41
[INFO ] 2026-06-02 15:39:25.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:39:38.045 [14043] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:39:39.130 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 15:39:39.130 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431579,ok=431579,error=0, records=41
[INFO ] 2026-06-02 15:39:40.332 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:39:53.051 [13928] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:39:54.136 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 15:39:54.136 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431580,ok=431580,error=0, records=41
[INFO ] 2026-06-02 15:39:55.333 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.59MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:40:02.193 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21594/300s
[WARN ] 2026-06-02 15:40:07.556 [14079] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:40:09.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 15:40:09.142 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431581,ok=431581,error=0, records=41
[INFO ] 2026-06-02 15:40:10.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:40:22.561 [14095] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:40:24.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 15:40:24.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431582,ok=431582,error=0, records=41
[INFO ] 2026-06-02 15:40:25.334 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:40:37.565 [14108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:40:38.065 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21585/300s
[INFO ] 2026-06-02 15:40:39.152 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 15:40:39.152 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431583,ok=431583,error=0, records=41
[INFO ] 2026-06-02 15:40:40.335 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:40:43.590 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21594/300s
[WARN ] 2026-06-02 15:40:52.571 [14147] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:40:54.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 15:40:54.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431584,ok=431584,error=0, records=41
[INFO ] 2026-06-02 15:40:55.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:41:07.576 [14166] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:41:09.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 15:41:09.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431585,ok=431585,error=0, records=41
[INFO ] 2026-06-02 15:41:10.336 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:41:22.582 [14183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:41:24.168 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 15:41:24.168 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431586,ok=431586,error=0, records=41
[INFO ] 2026-06-02 15:41:24.168 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21581/300s
[INFO ] 2026-06-02 15:41:25.337 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:41:34.221 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17978/300s
[INFO ] 2026-06-02 15:41:34.223 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839348},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:41:34.385 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:41:34.385 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:41:34.386 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:41:34.386 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:41:34.386 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:41:34.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:41:34.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:41:34.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:41:34.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:41:34.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:41:34.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:41:37.586 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:41:39.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10333, records=41
[INFO ] 2026-06-02 15:41:39.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431587,ok=431587,error=0, records=41
[INFO ] 2026-06-02 15:41:40.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:41:52.593 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:41:54.178 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 15:41:54.178 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431588,ok=431588,error=0, records=41
[INFO ] 2026-06-02 15:41:55.338 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:41:55.454 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21590/300s
[WARN ] 2026-06-02 15:42:07.598 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:42:09.183 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 15:42:09.183 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431589,ok=431589,error=0, records=41
[INFO ] 2026-06-02 15:42:10.339 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:42:10.339 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21593/300s
[INFO ] 2026-06-02 15:42:14.943 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21581/300s
[WARN ] 2026-06-02 15:42:22.603 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:42:24.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 15:42:24.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431590,ok=431590,error=0, records=41
[INFO ] 2026-06-02 15:42:25.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:42:37.608 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:42:39.195 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 15:42:39.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431591,ok=431591,error=0, records=41
[INFO ] 2026-06-02 15:42:40.340 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:42:52.614 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:42:54.306 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10303, records=41
[INFO ] 2026-06-02 15:42:54.306 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431592,ok=431592,error=0, records=41
[INFO ] 2026-06-02 15:42:55.341 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:43:03.089 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21591/300s
[INFO ] 2026-06-02 15:43:04.891 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21591/300s
[WARN ] 2026-06-02 15:43:07.620 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:43:09.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 15:43:09.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431593,ok=431593,error=0, records=41
[INFO ] 2026-06-02 15:43:10.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:43:11.697 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21591/300s
[WARN ] 2026-06-02 15:43:22.625 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:43:24.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 15:43:24.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431594,ok=431594,error=0, records=41
[INFO ] 2026-06-02 15:43:25.342 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:43:37.630 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:43:39.322 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 15:43:39.322 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431595,ok=431595,error=0, records=41
[INFO ] 2026-06-02 15:43:40.343 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 15:43:40.343 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 15:43:52.635 [14213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:43:54.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 15:43:54.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431596,ok=431596,error=0, records=41
[INFO ] 2026-06-02 15:43:55.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:44:07.640 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:44:09.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 15:44:09.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431597,ok=431597,error=0, records=41
[INFO ] 2026-06-02 15:44:10.344 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:44:22.645 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:44:24.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 15:44:24.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431598,ok=431598,error=0, records=41
[INFO ] 2026-06-02 15:44:25.345 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:44:34.387 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839280},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:44:34.555 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:44:34.555 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 15:44:34.555 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:44:34.555 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:44:34.555 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:44:34.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:44:34.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:44:34.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:44:34.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:44:34.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:44:34.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:44:37.651 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:44:39.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 15:44:39.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431599,ok=431599,error=0, records=41
[INFO ] 2026-06-02 15:44:40.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:44:52.656 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:44:54.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 15:44:54.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431600,ok=431600,error=0, records=41
[INFO ] 2026-06-02 15:44:55.346 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:45:02.197 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21595/300s
[WARN ] 2026-06-02 15:45:07.661 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:45:09.358 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 15:45:09.358 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431601,ok=431601,error=0, records=41
[INFO ] 2026-06-02 15:45:10.347 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:45:22.667 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:45:24.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 15:45:24.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431602,ok=431602,error=0, records=41
[INFO ] 2026-06-02 15:45:25.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:45:37.673 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:45:38.173 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21586/300s
[INFO ] 2026-06-02 15:45:39.368 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 15:45:39.368 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431603,ok=431603,error=0, records=41
[INFO ] 2026-06-02 15:45:40.348 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:45:43.597 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21595/300s
[WARN ] 2026-06-02 15:45:52.678 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:45:54.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 15:45:54.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431604,ok=431604,error=0, records=41
[INFO ] 2026-06-02 15:45:55.349 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:46:07.683 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:46:09.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 15:46:09.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431605,ok=431605,error=0, records=41
[INFO ] 2026-06-02 15:46:10.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:46:22.688 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:46:24.384 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 15:46:24.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431606,ok=431606,error=0, records=41
[INFO ] 2026-06-02 15:46:24.384 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21582/300s
[INFO ] 2026-06-02 15:46:25.350 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:46:37.695 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:46:39.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 15:46:39.389 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431607,ok=431607,error=0, records=41
[INFO ] 2026-06-02 15:46:40.351 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:46:52.699 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:46:54.394 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 15:46:54.394 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431608,ok=431608,error=0, records=41
[INFO ] 2026-06-02 15:46:55.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:46:55.510 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21591/300s
[WARN ] 2026-06-02 15:47:07.704 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:47:09.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 15:47:09.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431609,ok=431609,error=0, records=41
[INFO ] 2026-06-02 15:47:10.352 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:47:10.352 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21594/300s
[INFO ] 2026-06-02 15:47:15.127 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21582/300s
[WARN ] 2026-06-02 15:47:22.710 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:47:24.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 15:47:24.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431610,ok=431610,error=0, records=41
[INFO ] 2026-06-02 15:47:25.353 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:47:34.555 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17979/300s
[INFO ] 2026-06-02 15:47:34.557 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839204},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:47:34.726 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:47:34.726 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:47:34.727 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:47:34.727 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:47:34.727 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:47:34.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:47:34.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:47:34.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:47:34.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:47:34.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:47:34.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:47:37.714 [14213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:47:39.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 15:47:39.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431611,ok=431611,error=0, records=41
[INFO ] 2026-06-02 15:47:40.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:47:52.719 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:47:54.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 15:47:54.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431612,ok=431612,error=0, records=41
[INFO ] 2026-06-02 15:47:55.354 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:48:03.159 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21592/300s
[INFO ] 2026-06-02 15:48:04.961 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21592/300s
[WARN ] 2026-06-02 15:48:07.724 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:48:09.425 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 15:48:09.425 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431613,ok=431613,error=0, records=41
[INFO ] 2026-06-02 15:48:10.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:48:11.766 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21592/300s
[WARN ] 2026-06-02 15:48:22.730 [14213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:48:24.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 15:48:24.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431614,ok=431614,error=0, records=41
[INFO ] 2026-06-02 15:48:25.355 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:48:37.735 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:48:39.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 15:48:39.482 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431615,ok=431615,error=0, records=41
[INFO ] 2026-06-02 15:48:40.356 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:48:52.741 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:48:54.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 15:48:54.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431616,ok=431616,error=0, records=41
[INFO ] 2026-06-02 15:48:55.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:49:07.746 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:49:09.493 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 15:49:09.493 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431617,ok=431617,error=0, records=41
[INFO ] 2026-06-02 15:49:10.357 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:49:22.752 [14213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:49:24.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 15:49:24.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431618,ok=431618,error=0, records=41
[INFO ] 2026-06-02 15:49:25.358 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:49:37.756 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:49:39.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 15:49:39.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431619,ok=431619,error=0, records=41
[INFO ] 2026-06-02 15:49:40.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:49:52.761 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:49:54.511 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 15:49:54.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431620,ok=431620,error=0, records=41
[INFO ] 2026-06-02 15:49:55.359 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:50:02.200 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21596/300s
[WARN ] 2026-06-02 15:50:07.765 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:50:09.518 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 15:50:09.518 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431621,ok=431621,error=0, records=41
[INFO ] 2026-06-02 15:50:10.360 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:50:22.769 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:50:24.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 15:50:24.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431622,ok=431622,error=0, records=41
[INFO ] 2026-06-02 15:50:25.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:50:34.728 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839136},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:50:34.893 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:50:34.893 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 15:50:34.894 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:50:34.894 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:50:34.894 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:50:34.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:50:34.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:50:34.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:50:34.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:50:34.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:50:34.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:50:37.774 [14190] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:50:38.274 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21587/300s
[INFO ] 2026-06-02 15:50:39.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 15:50:39.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431623,ok=431623,error=0, records=41
[INFO ] 2026-06-02 15:50:40.361 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:50:43.604 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21596/300s
[WARN ] 2026-06-02 15:50:52.780 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:50:54.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 15:50:54.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431624,ok=431624,error=0, records=41
[INFO ] 2026-06-02 15:50:55.362 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:51:07.785 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:51:09.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 15:51:09.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431625,ok=431625,error=0, records=41
[INFO ] 2026-06-02 15:51:10.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:51:22.791 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:51:24.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 15:51:24.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431626,ok=431626,error=0, records=41
[INFO ] 2026-06-02 15:51:24.547 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21583/300s
[INFO ] 2026-06-02 15:51:25.363 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:51:37.795 [14202] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:51:39.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 15:51:39.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431627,ok=431627,error=0, records=41
[INFO ] 2026-06-02 15:51:40.364 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:51:52.800 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:51:54.557 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 15:51:54.557 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431628,ok=431628,error=0, records=41
[INFO ] 2026-06-02 15:51:55.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:51:55.568 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21592/300s
[WARN ] 2026-06-02 15:52:07.805 [14235] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:52:09.563 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 15:52:09.563 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431629,ok=431629,error=0, records=41
[INFO ] 2026-06-02 15:52:10.365 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:52:10.365 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21595/300s
[INFO ] 2026-06-02 15:52:15.310 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21583/300s
[WARN ] 2026-06-02 15:52:22.810 [14785] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:52:24.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 15:52:24.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431630,ok=431630,error=0, records=41
[INFO ] 2026-06-02 15:52:25.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:52:37.815 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:52:39.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 15:52:39.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431631,ok=431631,error=0, records=41
[INFO ] 2026-06-02 15:52:40.366 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:52:52.820 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:52:54.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 15:52:54.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431632,ok=431632,error=0, records=41
[INFO ] 2026-06-02 15:52:55.367 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:53:03.226 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21593/300s
[INFO ] 2026-06-02 15:53:05.027 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21593/300s
[WARN ] 2026-06-02 15:53:07.825 [14201] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:53:09.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 15:53:09.587 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431633,ok=431633,error=0, records=41
[INFO ] 2026-06-02 15:53:10.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:53:11.834 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21593/300s
[WARN ] 2026-06-02 15:53:22.831 [14795] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:53:24.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 15:53:24.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431634,ok=431634,error=0, records=41
[INFO ] 2026-06-02 15:53:25.368 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:53:34.894 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17980/300s
[INFO ] 2026-06-02 15:53:34.896 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839064},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:53:35.063 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:53:35.063 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 15:53:35.063 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:53:35.063 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:53:35.063 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:53:35.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:53:35.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:53:35.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:53:35.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:53:35.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:53:35.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:53:37.835 [14827] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:53:39.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 15:53:39.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431635,ok=431635,error=0, records=41
[INFO ] 2026-06-02 15:53:40.369 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 15:53:40.369 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 15:53:52.841 [14866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:53:54.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 15:53:54.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431636,ok=431636,error=0, records=41
[INFO ] 2026-06-02 15:53:55.370 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:53:55.370 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 15:54:07.846 [14880] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:54:09.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 15:54:09.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431637,ok=431637,error=0, records=41
[INFO ] 2026-06-02 15:54:10.371 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:54:22.852 [14894] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:54:24.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 15:54:24.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431638,ok=431638,error=0, records=41
[INFO ] 2026-06-02 15:54:25.372 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:54:37.857 [14866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:54:39.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 15:54:39.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431639,ok=431639,error=0, records=41
[INFO ] 2026-06-02 15:54:40.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:54:52.861 [14909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:54:54.630 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 15:54:54.630 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431640,ok=431640,error=0, records=41
[INFO ] 2026-06-02 15:54:55.373 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:55:02.204 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21597/300s
[WARN ] 2026-06-02 15:55:07.866 [14866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:55:09.635 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 15:55:09.635 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431641,ok=431641,error=0, records=41
[INFO ] 2026-06-02 15:55:10.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:55:22.871 [14866] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:55:24.641 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 15:55:24.641 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431642,ok=431642,error=0, records=41
[INFO ] 2026-06-02 15:55:25.374 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:55:37.876 [14909] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:55:38.376 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21588/300s
[INFO ] 2026-06-02 15:55:39.647 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 15:55:39.647 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431643,ok=431643,error=0, records=41
[INFO ] 2026-06-02 15:55:40.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:55:43.610 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21597/300s
[WARN ] 2026-06-02 15:55:52.881 [14966] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:55:54.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 15:55:54.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431644,ok=431644,error=0, records=41
[INFO ] 2026-06-02 15:55:55.375 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:56:07.886 [14993] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:56:09.659 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 15:56:09.659 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431645,ok=431645,error=0, records=41
[INFO ] 2026-06-02 15:56:10.376 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:56:22.891 [15019] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:56:24.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 15:56:24.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431646,ok=431646,error=0, records=41
[INFO ] 2026-06-02 15:56:24.755 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21584/300s
[INFO ] 2026-06-02 15:56:25.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.58MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:56:35.064 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20839000},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:56:35.256 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:56:35.256 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 15:56:35.256 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:56:35.256 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:56:35.256 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:56:35.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:56:35.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:56:35.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:56:35.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:56:35.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:56:35.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:56:37.897 [15019] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:56:39.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 15:56:39.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431647,ok=431647,error=0, records=41
[INFO ] 2026-06-02 15:56:40.377 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:56:52.901 [15019] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:56:54.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 15:56:54.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431648,ok=431648,error=0, records=41
[INFO ] 2026-06-02 15:56:55.378 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:56:55.624 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21593/300s
[WARN ] 2026-06-02 15:57:07.907 [15049] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:57:09.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 15:57:09.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431649,ok=431649,error=0, records=41
[INFO ] 2026-06-02 15:57:10.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:57:10.379 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21596/300s
[INFO ] 2026-06-02 15:57:15.491 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21584/300s
[WARN ] 2026-06-02 15:57:22.913 [15081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:57:24.785 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 15:57:24.785 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431650,ok=431650,error=0, records=41
[INFO ] 2026-06-02 15:57:25.379 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:57:37.919 [15104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:57:39.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10318, records=41
[INFO ] 2026-06-02 15:57:39.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431651,ok=431651,error=0, records=41
[INFO ] 2026-06-02 15:57:40.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:57:52.924 [15115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:57:54.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 15:57:54.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431652,ok=431652,error=0, records=41
[INFO ] 2026-06-02 15:57:55.380 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:58:03.290 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21594/300s
[INFO ] 2026-06-02 15:58:05.092 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21594/300s
[WARN ] 2026-06-02 15:58:07.929 [15138] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:58:09.858 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 15:58:09.858 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431653,ok=431653,error=0, records=41
[INFO ] 2026-06-02 15:58:10.381 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:58:11.898 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21594/300s
[WARN ] 2026-06-02 15:58:22.935 [15155] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:58:24.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 15:58:24.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431654,ok=431654,error=0, records=41
[INFO ] 2026-06-02 15:58:25.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:58:37.942 [15161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:58:39.870 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 15:58:39.870 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431655,ok=431655,error=0, records=41
[INFO ] 2026-06-02 15:58:40.382 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:58:52.949 [15150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:58:54.875 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 15:58:54.875 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431656,ok=431656,error=0, records=41
[INFO ] 2026-06-02 15:58:55.383 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:59:07.953 [15161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:59:09.881 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 15:59:09.881 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431657,ok=431657,error=0, records=41
[INFO ] 2026-06-02 15:59:10.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:59:22.958 [15180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:59:24.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 15:59:24.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431658,ok=431658,error=0, records=41
[INFO ] 2026-06-02 15:59:25.384 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 15:59:35.257 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17981/300s
[INFO ] 2026-06-02 15:59:35.258 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838932},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 15:59:35.417 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 15:59:35.417 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 15:59:35.417 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 15:59:35.417 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 15:59:35.417 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 15:59:35.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:59:35.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 15:59:35.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:59:35.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 15:59:35.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 15:59:35.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 15:59:37.963 [15223] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:59:39.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 15:59:39.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431659,ok=431659,error=0, records=41
[INFO ] 2026-06-02 15:59:40.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 15:59:52.967 [15150] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 15:59:54.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 15:59:54.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431660,ok=431660,error=0, records=41
[INFO ] 2026-06-02 15:59:55.385 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:00:02.207 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21598/300s
[WARN ] 2026-06-02 16:00:07.972 [15209] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:00:09.905 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 16:00:09.905 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431661,ok=431661,error=0, records=41
[INFO ] 2026-06-02 16:00:10.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:00:22.977 [15260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:00:24.910 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 16:00:24.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431662,ok=431662,error=0, records=41
[INFO ] 2026-06-02 16:00:25.386 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:00:37.983 [15238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:00:38.483 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21589/300s
[INFO ] 2026-06-02 16:00:39.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:00:39.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431663,ok=431663,error=0, records=41
[INFO ] 2026-06-02 16:00:40.387 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:00:43.617 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21598/300s
[WARN ] 2026-06-02 16:00:52.988 [15161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:00:54.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 16:00:54.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431664,ok=431664,error=0, records=41
[INFO ] 2026-06-02 16:00:55.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:01:07.993 [15161] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:01:09.925 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 16:01:09.925 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431665,ok=431665,error=0, records=41
[INFO ] 2026-06-02 16:01:10.388 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:01:22.998 [15343] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:01:24.930 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 16:01:24.930 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431666,ok=431666,error=0, records=41
[INFO ] 2026-06-02 16:01:24.930 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21585/300s
[INFO ] 2026-06-02 16:01:25.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:01:38.005 [15238] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:01:39.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 16:01:39.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431667,ok=431667,error=0, records=41
[INFO ] 2026-06-02 16:01:40.389 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:01:53.011 [15180] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:01:54.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:01:54.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431668,ok=431668,error=0, records=41
[INFO ] 2026-06-02 16:01:55.390 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:01:55.680 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21594/300s
[WARN ] 2026-06-02 16:02:08.016 [15385] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:02:09.947 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 16:02:09.947 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431669,ok=431669,error=0, records=41
[INFO ] 2026-06-02 16:02:10.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:02:10.391 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21597/300s
[INFO ] 2026-06-02 16:02:15.672 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21585/300s
[WARN ] 2026-06-02 16:02:23.021 [15371] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:02:24.952 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 16:02:24.952 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431670,ok=431670,error=0, records=41
[INFO ] 2026-06-02 16:02:25.391 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:02:35.419 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838848},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:02:35.609 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:02:35.609 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 16:02:35.609 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:02:35.609 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:02:35.609 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:02:35.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:02:35.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:02:35.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:02:35.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:02:35.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:02:35.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:02:38.026 [15357] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:02:39.960 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 16:02:39.960 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431671,ok=431671,error=0, records=41
[INFO ] 2026-06-02 16:02:40.392 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:02:53.031 [15428] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:02:54.964 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 16:02:54.964 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431672,ok=431672,error=0, records=41
[INFO ] 2026-06-02 16:02:55.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:03:03.349 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21595/300s
[INFO ] 2026-06-02 16:03:05.151 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21595/300s
[WARN ] 2026-06-02 16:03:08.036 [15385] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:03:09.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 16:03:09.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431673,ok=431673,error=0, records=41
[INFO ] 2026-06-02 16:03:10.393 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:03:11.956 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21595/300s
[WARN ] 2026-06-02 16:03:23.041 [15454] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:03:24.976 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:03:24.976 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431674,ok=431674,error=0, records=41
[INFO ] 2026-06-02 16:03:25.394 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:03:38.047 [15476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:03:39.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 16:03:39.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431675,ok=431675,error=0, records=41
[INFO ] 2026-06-02 16:03:40.395 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 16:03:40.395 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 16:03:53.052 [15493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:03:54.988 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 16:03:54.988 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431676,ok=431676,error=0, records=41
[INFO ] 2026-06-02 16:03:55.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:04:07.558 [15514] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:04:09.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 16:04:09.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431677,ok=431677,error=0, records=41
[INFO ] 2026-06-02 16:04:10.396 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:04:22.563 [15534] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:04:24.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 16:04:24.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431678,ok=431678,error=0, records=41
[INFO ] 2026-06-02 16:04:25.397 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:04:37.568 [15553] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:04:40.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 16:04:40.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431679,ok=431679,error=0, records=41
[INFO ] 2026-06-02 16:04:40.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:04:52.574 [15559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:04:55.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 16:04:55.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431680,ok=431680,error=0, records=41
[INFO ] 2026-06-02 16:04:55.398 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:05:02.211 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21599/300s
[WARN ] 2026-06-02 16:05:07.580 [15591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:05:10.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10336, records=41
[INFO ] 2026-06-02 16:05:10.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431681,ok=431681,error=0, records=41
[INFO ] 2026-06-02 16:05:10.399 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:05:22.585 [15546] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:05:25.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 16:05:25.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431682,ok=431682,error=0, records=41
[INFO ] 2026-06-02 16:05:25.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:05:35.609 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17982/300s
[INFO ] 2026-06-02 16:05:35.611 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838756},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:05:35.782 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:05:35.782 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 16:05:35.782 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:05:35.782 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:05:35.782 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:05:35.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:05:35.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:05:35.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:05:35.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:05:35.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:05:35.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:05:37.591 [15621] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:05:38.590 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21590/300s
[INFO ] 2026-06-02 16:05:40.025 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 16:05:40.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431683,ok=431683,error=0, records=41
[INFO ] 2026-06-02 16:05:40.400 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:05:43.623 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21599/300s
[WARN ] 2026-06-02 16:05:52.596 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:05:55.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 16:05:55.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431684,ok=431684,error=0, records=41
[INFO ] 2026-06-02 16:05:55.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:06:07.602 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:06:10.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 16:06:10.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431685,ok=431685,error=0, records=41
[INFO ] 2026-06-02 16:06:10.401 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:06:22.607 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:06:25.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 16:06:25.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431686,ok=431686,error=0, records=41
[INFO ] 2026-06-02 16:06:25.043 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21586/300s
[INFO ] 2026-06-02 16:06:25.402 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:06:37.613 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:06:40.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:06:40.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431687,ok=431687,error=0, records=41
[INFO ] 2026-06-02 16:06:40.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:06:52.619 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:06:55.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 16:06:55.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431688,ok=431688,error=0, records=41
[INFO ] 2026-06-02 16:06:55.403 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:06:55.738 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21595/300s
[WARN ] 2026-06-02 16:07:07.624 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:07:10.059 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10292, records=41
[INFO ] 2026-06-02 16:07:10.059 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431689,ok=431689,error=0, records=41
[INFO ] 2026-06-02 16:07:10.404 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:07:10.404 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21598/300s
[INFO ] 2026-06-02 16:07:15.857 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21586/300s
[WARN ] 2026-06-02 16:07:22.629 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:07:25.066 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 16:07:25.067 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431690,ok=431690,error=0, records=41
[INFO ] 2026-06-02 16:07:25.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:07:37.635 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:07:40.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-02 16:07:40.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431691,ok=431691,error=0, records=41
[INFO ] 2026-06-02 16:07:40.405 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:07:52.641 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:07:55.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 16:07:55.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431692,ok=431692,error=0, records=41
[INFO ] 2026-06-02 16:07:55.406 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:08:03.422 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21596/300s
[INFO ] 2026-06-02 16:08:05.224 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21596/300s
[WARN ] 2026-06-02 16:08:07.646 [15617] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:08:10.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 16:08:10.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431693,ok=431693,error=0, records=41
[INFO ] 2026-06-02 16:08:10.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:08:12.030 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21596/300s
[WARN ] 2026-06-02 16:08:22.651 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:08:25.198 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 16:08:25.198 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431694,ok=431694,error=0, records=41
[INFO ] 2026-06-02 16:08:25.407 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:08:35.784 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838684},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:08:35.935 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:08:35.935 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 16:08:35.935 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:08:35.935 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:08:35.935 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:08:35.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:08:35.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:08:35.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:08:35.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:08:35.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:08:35.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:08:37.656 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:08:40.203 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 16:08:40.203 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431695,ok=431695,error=0, records=41
[INFO ] 2026-06-02 16:08:40.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:08:52.663 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:08:55.208 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 16:08:55.208 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431696,ok=431696,error=0, records=41
[INFO ] 2026-06-02 16:08:55.408 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=32.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:08:55.408 [908  ] core/self_monitor.cpp:195: will malloc_trim
[WARN ] 2026-06-02 16:09:07.669 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:09:10.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 16:09:10.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431697,ok=431697,error=0, records=41
[INFO ] 2026-06-02 16:09:10.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:09:22.675 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:09:25.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 16:09:25.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431698,ok=431698,error=0, records=41
[INFO ] 2026-06-02 16:09:25.410 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=25.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:09:37.681 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:09:40.222 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 16:09:40.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431699,ok=431699,error=0, records=41
[INFO ] 2026-06-02 16:09:40.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:09:52.687 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:09:55.228 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-02 16:09:55.228 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431700,ok=431700,error=0, records=41
[INFO ] 2026-06-02 16:09:55.411 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=26.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:10:02.214 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21600/300s
[WARN ] 2026-06-02 16:10:07.693 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:10:10.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 16:10:10.234 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431701,ok=431701,error=0, records=41
[INFO ] 2026-06-02 16:10:10.412 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:10:22.698 [15617] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:10:25.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 16:10:25.238 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431702,ok=431702,error=0, records=41
[INFO ] 2026-06-02 16:10:25.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:10:37.704 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:10:38.704 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21591/300s
[INFO ] 2026-06-02 16:10:40.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 16:10:40.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431703,ok=431703,error=0, records=41
[INFO ] 2026-06-02 16:10:40.413 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:10:43.630 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21600/300s
[WARN ] 2026-06-02 16:10:52.709 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:10:55.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 16:10:55.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431704,ok=431704,error=0, records=41
[INFO ] 2026-06-02 16:10:55.414 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:11:07.716 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:11:10.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 16:11:10.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431705,ok=431705,error=0, records=41
[INFO ] 2026-06-02 16:11:10.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:11:22.722 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:11:25.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 16:11:25.266 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431706,ok=431706,error=0, records=41
[INFO ] 2026-06-02 16:11:25.266 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21587/300s
[INFO ] 2026-06-02 16:11:25.415 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:11:35.935 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17983/300s
[INFO ] 2026-06-02 16:11:35.937 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838616},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:11:36.093 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:11:36.093 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 16:11:36.093 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:11:36.094 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:11:36.094 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:11:36.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:11:36.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:11:36.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:11:36.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:11:36.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:11:36.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:11:37.727 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:11:40.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 16:11:40.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431707,ok=431707,error=0, records=41
[INFO ] 2026-06-02 16:11:40.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:11:50.245 [15533] sic/src/linux_system_information_collector.cpp:1324: /bin/udevadm info --query=property --name=/dev/vda1
DEVLINKS=/dev/disk/by-id/virtio-j6c1gqesu0zk5kutcqel-part1 /dev/disk/by-path/pci-0000:00:04.0-part1 /dev/disk/by-path/virtio-pci-0000:00:04.0-part1 /dev/disk/by-uuid/87ba1103-a0d7-49ef-a8ae-6ce1d3fd2453
DEVNAME=/dev/vda1
DEVPATH=/devices/pci0000:00/0000:00:04.0/virtio1/block/vda/vda1
DEVTYPE=partition
ID_FS_TYPE=ext4
ID_FS_USAGE=filesystem
ID_FS_UUID=87ba1103-a0d7-49ef-a8ae-6ce1d3fd2453
ID_FS_UUID_ENC=87ba1103-a0d7-49ef-a8ae-6ce1d3fd2453
ID_FS_VERSION=1.0
ID_PART_ENTRY_DISK=253:0
ID_PART_ENTRY_FLAGS=0x80
ID_PART_ENTRY_NUMBER=1
ID_PART_ENTRY_OFFSET=2048
ID_PART_ENTRY_SCHEME=dos
ID_PART_ENTRY_SIZE=209713119
ID_PART_ENTRY_TYPE=0x83
ID_PART_TABLE_TYPE=dos
ID_PATH=pci-0000:00:04.0
ID_PATH_TAG=pci-0000_00_04_0
ID_SERIAL=j6c1gqesu0zk5kutcqel
MAJOR=253
MINOR=1
SUBSYSTEM=block
TAGS=:systemd:
USEC_INITIALIZED=25124
[INFO ] 2026-06-02 16:11:50.245 [15533] sic/src/linux_system_information_collector.cpp:1335: queryDevSerialId: {"/dev/vda1":"j6c1gqesu0zk5kutcqel"}
[WARN ] 2026-06-02 16:11:52.732 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:11:55.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 16:11:55.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431708,ok=431708,error=0, records=41
[INFO ] 2026-06-02 16:11:55.416 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:11:55.792 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21596/300s
[WARN ] 2026-06-02 16:12:07.737 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:12:10.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 16:12:10.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431709,ok=431709,error=0, records=41
[INFO ] 2026-06-02 16:12:10.417 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:12:10.417 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21599/300s
[INFO ] 2026-06-02 16:12:16.039 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21587/300s
[WARN ] 2026-06-02 16:12:22.742 [15617] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:12:25.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 16:12:25.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431710,ok=431710,error=0, records=41
[INFO ] 2026-06-02 16:12:25.418 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:12:37.747 [15602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:12:40.300 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 16:12:40.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431711,ok=431711,error=0, records=41
[INFO ] 2026-06-02 16:12:40.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:12:52.752 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:12:55.305 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 16:12:55.305 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431712,ok=431712,error=0, records=41
[INFO ] 2026-06-02 16:12:55.419 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:13:03.472 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21597/300s
[INFO ] 2026-06-02 16:13:05.274 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21597/300s
[WARN ] 2026-06-02 16:13:07.758 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:13:10.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 16:13:10.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431713,ok=431713,error=0, records=41
[INFO ] 2026-06-02 16:13:10.420 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:13:12.079 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21597/300s
[WARN ] 2026-06-02 16:13:22.763 [15617] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:13:25.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 16:13:25.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431714,ok=431714,error=0, records=41
[INFO ] 2026-06-02 16:13:25.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:13:37.769 [15602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:13:40.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 16:13:40.326 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431715,ok=431715,error=0, records=41
[INFO ] 2026-06-02 16:13:40.421 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 16:13:40.421 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[WARN ] 2026-06-02 16:13:52.773 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:13:55.331 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 16:13:55.331 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431716,ok=431716,error=0, records=41
[INFO ] 2026-06-02 16:13:55.422 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:14:07.778 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:14:10.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 16:14:10.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431717,ok=431717,error=0, records=41
[INFO ] 2026-06-02 16:14:10.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:14:22.784 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:14:25.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:14:25.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431718,ok=431718,error=0, records=41
[INFO ] 2026-06-02 16:14:25.423 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:14:36.095 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838544},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:14:36.270 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:14:36.270 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 16:14:36.270 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:14:36.270 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:14:36.270 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:14:36.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:14:36.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:14:36.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:14:36.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:14:36.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:14:36.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:14:37.789 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:14:40.347 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 16:14:40.347 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431719,ok=431719,error=0, records=41
[INFO ] 2026-06-02 16:14:40.424 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:14:52.795 [15602] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:14:55.351 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 16:14:55.351 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431720,ok=431720,error=0, records=41
[INFO ] 2026-06-02 16:14:55.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:15:02.218 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21601/300s
[WARN ] 2026-06-02 16:15:07.801 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:15:10.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 16:15:10.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431721,ok=431721,error=0, records=41
[INFO ] 2026-06-02 16:15:10.425 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:15:22.807 [15625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:15:25.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 16:15:25.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431722,ok=431722,error=0, records=41
[INFO ] 2026-06-02 16:15:25.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 16:15:32.812 [15533] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10449/stat), No such file or directory
[WARN ] 2026-06-02 16:15:32.813 [15533] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10450/stat), No such file or directory
[WARN ] 2026-06-02 16:15:32.814 [15533] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10460/stat), No such file or directory
[WARN ] 2026-06-02 16:15:37.813 [16183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:15:38.813 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21592/300s
[INFO ] 2026-06-02 16:15:40.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 16:15:40.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431723,ok=431723,error=0, records=41
[INFO ] 2026-06-02 16:15:40.426 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:15:43.638 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21601/300s
[WARN ] 2026-06-02 16:15:47.818 [16208] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10449/stat), No such file or directory
[WARN ] 2026-06-02 16:15:47.820 [16208] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10450/stat), No such file or directory
[WARN ] 2026-06-02 16:15:47.820 [16208] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10460/stat), No such file or directory
[WARN ] 2026-06-02 16:15:52.819 [16213] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:15:55.427 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.85MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-02 16:15:55.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 16:15:55.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431724,ok=431724,error=0, records=41
[WARN ] 2026-06-02 16:16:07.824 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:16:10.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:16:10.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 16:16:10.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431725,ok=431725,error=0, records=41
[WARN ] 2026-06-02 16:16:22.829 [16193] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:16:25.428 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:16:25.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 16:16:25.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431726,ok=431726,error=0, records=41
[INFO ] 2026-06-02 16:16:25.458 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21588/300s
[WARN ] 2026-06-02 16:16:37.835 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:16:40.429 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:16:40.501 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 16:16:40.501 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431727,ok=431727,error=0, records=41
[WARN ] 2026-06-02 16:16:52.840 [16208] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:16:55.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:16:55.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:16:55.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431728,ok=431728,error=0, records=41
[INFO ] 2026-06-02 16:16:55.850 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21597/300s
[WARN ] 2026-06-02 16:17:07.844 [16208] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:17:10.430 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:17:10.430 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21600/300s
[INFO ] 2026-06-02 16:17:10.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 16:17:10.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431729,ok=431729,error=0, records=41
[INFO ] 2026-06-02 16:17:16.222 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21588/300s
[WARN ] 2026-06-02 16:17:22.849 [16291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:17:25.431 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:17:25.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:17:25.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431730,ok=431730,error=0, records=41
[INFO ] 2026-06-02 16:17:36.271 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17984/300s
[INFO ] 2026-06-02 16:17:36.272 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838460},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:17:36.447 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:17:36.447 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 16:17:36.447 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:17:36.447 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:17:36.447 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:17:36.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:17:36.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:17:36.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:17:36.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:17:36.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:17:36.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:17:37.855 [16291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:17:40.432 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:17:40.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 16:17:40.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431731,ok=431731,error=0, records=41
[WARN ] 2026-06-02 16:17:52.859 [15635] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:17:55.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:17:55.528 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 16:17:55.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431732,ok=431732,error=0, records=41
[INFO ] 2026-06-02 16:18:03.537 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21598/300s
[INFO ] 2026-06-02 16:18:05.340 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21598/300s
[WARN ] 2026-06-02 16:18:07.864 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:18:10.433 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:18:10.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 16:18:10.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431733,ok=431733,error=0, records=41
[INFO ] 2026-06-02 16:18:12.147 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21598/300s
[WARN ] 2026-06-02 16:18:22.868 [16334] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:18:25.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:18:25.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 16:18:25.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431734,ok=431734,error=0, records=41
[WARN ] 2026-06-02 16:18:37.873 [16291] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:18:40.434 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:18:40.547 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 16:18:40.547 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431735,ok=431735,error=0, records=41
[WARN ] 2026-06-02 16:18:52.878 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:18:55.435 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:18:55.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 16:18:55.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431736,ok=431736,error=0, records=41
[WARN ] 2026-06-02 16:19:07.882 [15533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:19:10.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:19:10.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 16:19:10.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431737,ok=431737,error=0, records=41
[WARN ] 2026-06-02 16:19:22.887 [16409] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:19:25.436 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:19:25.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 16:19:25.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431738,ok=431738,error=0, records=41
[WARN ] 2026-06-02 16:19:37.893 [16382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:19:40.437 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:19:40.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 16:19:40.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431739,ok=431739,error=0, records=41
[WARN ] 2026-06-02 16:19:52.898 [16436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:19:55.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:19:55.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 16:19:55.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431740,ok=431740,error=0, records=41
[INFO ] 2026-06-02 16:20:02.222 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21602/300s
[WARN ] 2026-06-02 16:20:07.903 [16469] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:20:10.438 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:20:10.586 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 16:20:10.586 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431741,ok=431741,error=0, records=41
[WARN ] 2026-06-02 16:20:22.909 [16463] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:20:25.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:20:25.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 16:20:25.592 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431742,ok=431742,error=0, records=41
[INFO ] 2026-06-02 16:20:36.448 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838392},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:20:36.614 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:20:36.614 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 16:20:36.615 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:20:36.615 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:20:36.615 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:20:36.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:20:36.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:20:36.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:20:36.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:20:36.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:20:36.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:20:37.915 [16436] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:20:38.915 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21593/300s
[INFO ] 2026-06-02 16:20:40.439 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:20:40.596 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 16:20:40.596 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431743,ok=431743,error=0, records=41
[INFO ] 2026-06-02 16:20:43.645 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21602/300s
[WARN ] 2026-06-02 16:20:52.921 [16486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:20:55.440 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:20:55.601 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 16:20:55.601 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431744,ok=431744,error=0, records=41
[WARN ] 2026-06-02 16:21:07.927 [16532] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:21:10.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:21:10.607 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 16:21:10.607 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431745,ok=431745,error=0, records=41
[WARN ] 2026-06-02 16:21:22.933 [16543] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:21:25.441 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:21:25.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 16:21:25.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431746,ok=431746,error=0, records=41
[INFO ] 2026-06-02 16:21:25.614 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21589/300s
[WARN ] 2026-06-02 16:21:37.939 [16565] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:21:40.442 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:21:40.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:21:40.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431747,ok=431747,error=0, records=41
[WARN ] 2026-06-02 16:21:52.944 [16554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:21:55.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:21:55.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 16:21:55.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431748,ok=431748,error=0, records=41
[INFO ] 2026-06-02 16:21:55.916 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21598/300s
[WARN ] 2026-06-02 16:22:07.949 [16565] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:22:10.443 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:22:10.443 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21601/300s
[INFO ] 2026-06-02 16:22:10.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 16:22:10.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431749,ok=431749,error=0, records=41
[INFO ] 2026-06-02 16:22:16.411 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21589/300s
[WARN ] 2026-06-02 16:22:22.955 [16593] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:22:25.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:22:25.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 16:22:25.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431750,ok=431750,error=0, records=41
[WARN ] 2026-06-02 16:22:37.960 [16565] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:22:40.444 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:22:40.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 16:22:40.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431751,ok=431751,error=0, records=41
[WARN ] 2026-06-02 16:22:52.964 [16636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:22:55.445 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:22:55.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 16:22:55.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431752,ok=431752,error=0, records=41
[INFO ] 2026-06-02 16:23:03.616 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21599/300s
[INFO ] 2026-06-02 16:23:05.417 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21599/300s
[WARN ] 2026-06-02 16:23:07.969 [16594] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:23:10.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:23:10.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10274, records=41
[INFO ] 2026-06-02 16:23:10.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431753,ok=431753,error=0, records=41
[INFO ] 2026-06-02 16:23:12.224 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21599/300s
[WARN ] 2026-06-02 16:23:22.974 [16664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:23:25.446 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:23:25.732 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 16:23:25.732 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431754,ok=431754,error=0, records=41
[INFO ] 2026-06-02 16:23:36.615 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17985/300s
[INFO ] 2026-06-02 16:23:36.617 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838320},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:23:36.783 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:23:36.783 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 16:23:36.784 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:23:36.784 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:23:36.784 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:23:36.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:23:36.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:23:36.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:23:36.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:23:36.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:23:36.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:23:37.979 [16678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:23:40.447 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 16:23:40.447 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 16:23:40.739 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 16:23:40.739 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431755,ok=431755,error=0, records=41
[WARN ] 2026-06-02 16:23:52.984 [16664] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:23:55.448 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:23:55.448 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 16:23:55.744 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 16:23:55.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431756,ok=431756,error=0, records=41
[WARN ] 2026-06-02 16:24:07.989 [16636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:24:10.449 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:24:10.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 16:24:10.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431757,ok=431757,error=0, records=41
[WARN ] 2026-06-02 16:24:22.994 [16636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:24:25.450 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:24:25.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 16:24:25.759 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431758,ok=431758,error=0, records=41
[WARN ] 2026-06-02 16:24:38.001 [16693] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:24:40.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:24:40.764 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 16:24:40.764 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431759,ok=431759,error=0, records=41
[WARN ] 2026-06-02 16:24:53.007 [16678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:24:55.451 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:24:55.769 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:24:55.769 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431760,ok=431760,error=0, records=41
[INFO ] 2026-06-02 16:25:02.225 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21603/300s
[WARN ] 2026-06-02 16:25:08.012 [16636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:25:10.452 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:25:10.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 16:25:10.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431761,ok=431761,error=0, records=41
[WARN ] 2026-06-02 16:25:23.016 [16678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:25:25.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:25:25.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 16:25:25.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431762,ok=431762,error=0, records=41
[WARN ] 2026-06-02 16:25:38.021 [16678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:25:39.021 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21594/300s
[INFO ] 2026-06-02 16:25:40.453 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:25:40.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 16:25:40.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431763,ok=431763,error=0, records=41
[INFO ] 2026-06-02 16:25:43.652 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21603/300s
[WARN ] 2026-06-02 16:25:53.026 [16707] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:25:55.454 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.18MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:25:55.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 16:25:55.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431764,ok=431764,error=0, records=41
[WARN ] 2026-06-02 16:26:08.032 [16761] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:26:10.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:26:10.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 16:26:10.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431765,ok=431765,error=0, records=41
[WARN ] 2026-06-02 16:26:23.037 [16636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:26:25.455 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:26:25.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 16:26:25.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431766,ok=431766,error=0, records=41
[INFO ] 2026-06-02 16:26:25.802 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21590/300s
[INFO ] 2026-06-02 16:26:36.785 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838256},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:26:36.964 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:26:36.964 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 16:26:36.964 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:26:36.964 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:26:36.964 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:26:37.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:26:37.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:26:37.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:26:37.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:26:37.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:26:37.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:26:38.043 [16636] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:26:40.456 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:26:40.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 16:26:40.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431767,ok=431767,error=0, records=41
[WARN ] 2026-06-02 16:26:53.048 [16870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:26:55.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:26:55.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 16:26:55.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431768,ok=431768,error=0, records=41
[INFO ] 2026-06-02 16:26:55.974 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21599/300s
[WARN ] 2026-06-02 16:27:08.053 [16881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:27:10.457 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:27:10.457 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21602/300s
[INFO ] 2026-06-02 16:27:10.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 16:27:10.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431769,ok=431769,error=0, records=41
[INFO ] 2026-06-02 16:27:16.591 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21590/300s
[WARN ] 2026-06-02 16:27:22.558 [16898] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:27:25.458 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:27:25.825 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 16:27:25.825 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431770,ok=431770,error=0, records=41
[WARN ] 2026-06-02 16:27:37.563 [16870] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:27:40.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:27:40.832 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 16:27:40.832 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431771,ok=431771,error=0, records=41
[WARN ] 2026-06-02 16:27:52.570 [16933] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:27:55.459 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:27:55.839 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 16:27:55.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431772,ok=431772,error=0, records=41
[INFO ] 2026-06-02 16:28:03.691 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21600/300s
[INFO ] 2026-06-02 16:28:05.492 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21600/300s
[WARN ] 2026-06-02 16:28:07.575 [16901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:28:10.460 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:28:10.844 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 16:28:10.844 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431773,ok=431773,error=0, records=41
[INFO ] 2026-06-02 16:28:12.298 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21600/300s
[WARN ] 2026-06-02 16:28:22.580 [16969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:28:25.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:28:25.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 16:28:25.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431774,ok=431774,error=0, records=41
[WARN ] 2026-06-02 16:28:37.585 [16944] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:28:40.461 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:28:40.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 16:28:40.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431775,ok=431775,error=0, records=41
[WARN ] 2026-06-02 16:28:52.592 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:28:55.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:28:55.868 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 16:28:55.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431776,ok=431776,error=0, records=41
[WARN ] 2026-06-02 16:29:07.597 [16980] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:29:10.462 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:29:10.873 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 16:29:10.873 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431777,ok=431777,error=0, records=41
[WARN ] 2026-06-02 16:29:22.601 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:29:25.463 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:29:25.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 16:29:25.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431778,ok=431778,error=0, records=41
[INFO ] 2026-06-02 16:29:36.964 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17986/300s
[INFO ] 2026-06-02 16:29:36.966 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838184},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:29:37.128 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:29:37.129 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 16:29:37.129 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:29:37.129 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:29:37.129 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:29:37.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:29:37.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:29:37.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:29:37.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:29:37.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:29:37.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:29:37.607 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:29:40.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:29:40.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 16:29:40.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431779,ok=431779,error=0, records=41
[WARN ] 2026-06-02 16:29:52.612 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:29:55.464 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:29:55.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 16:29:55.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431780,ok=431780,error=0, records=41
[INFO ] 2026-06-02 16:30:02.229 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21604/300s
[WARN ] 2026-06-02 16:30:07.617 [16980] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:30:10.465 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:30:10.904 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 16:30:10.904 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431781,ok=431781,error=0, records=41
[WARN ] 2026-06-02 16:30:22.622 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:30:25.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:30:25.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 16:30:25.910 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431782,ok=431782,error=0, records=41
[WARN ] 2026-06-02 16:30:37.627 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:30:39.127 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21595/300s
[INFO ] 2026-06-02 16:30:40.466 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:30:40.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 16:30:40.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431783,ok=431783,error=0, records=41
[INFO ] 2026-06-02 16:30:43.659 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21604/300s
[WARN ] 2026-06-02 16:30:52.632 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:30:55.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:30:55.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 16:30:55.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431784,ok=431784,error=0, records=41
[WARN ] 2026-06-02 16:31:07.638 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:31:10.467 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:31:10.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 16:31:10.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431785,ok=431785,error=0, records=41
[WARN ] 2026-06-02 16:31:22.642 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:31:25.468 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:31:25.931 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 16:31:25.931 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431786,ok=431786,error=0, records=41
[INFO ] 2026-06-02 16:31:25.931 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21591/300s
[WARN ] 2026-06-02 16:31:37.649 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:31:40.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:31:40.936 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 16:31:40.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431787,ok=431787,error=0, records=41
[WARN ] 2026-06-02 16:31:52.655 [17013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:31:55.469 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:31:55.945 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 16:31:55.945 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431788,ok=431788,error=0, records=41
[INFO ] 2026-06-02 16:31:56.033 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21600/300s
[WARN ] 2026-06-02 16:32:07.661 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:32:10.470 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:32:10.470 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21603/300s
[INFO ] 2026-06-02 16:32:10.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 16:32:10.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431789,ok=431789,error=0, records=41
[INFO ] 2026-06-02 16:32:16.778 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21591/300s
[WARN ] 2026-06-02 16:32:22.666 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:32:25.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:32:25.958 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 16:32:25.958 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431790,ok=431790,error=0, records=41
[INFO ] 2026-06-02 16:32:37.130 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:32:37.299 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:32:37.299 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 16:32:37.299 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:32:37.299 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:32:37.299 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:32:37.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:32:37.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:32:37.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:32:37.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:32:37.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:32:37.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:32:37.670 [17013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:32:40.471 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:32:40.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 16:32:40.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431791,ok=431791,error=0, records=41
[WARN ] 2026-06-02 16:32:52.676 [16980] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:32:55.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:32:55.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 16:32:55.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431792,ok=431792,error=0, records=41
[INFO ] 2026-06-02 16:33:03.766 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21601/300s
[INFO ] 2026-06-02 16:33:05.568 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21601/300s
[WARN ] 2026-06-02 16:33:07.681 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:33:10.472 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:33:10.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 16:33:10.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431793,ok=431793,error=0, records=41
[INFO ] 2026-06-02 16:33:12.374 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21601/300s
[WARN ] 2026-06-02 16:33:22.688 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:33:25.473 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:33:26.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 16:33:26.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431794,ok=431794,error=0, records=41
[WARN ] 2026-06-02 16:33:37.692 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:33:40.474 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 16:33:40.474 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 16:33:41.071 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 16:33:41.071 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431795,ok=431795,error=0, records=41
[WARN ] 2026-06-02 16:33:52.698 [16980] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:33:55.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:33:56.077 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 16:33:56.077 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431796,ok=431796,error=0, records=41
[WARN ] 2026-06-02 16:34:07.703 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:34:10.475 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:34:11.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 16:34:11.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431797,ok=431797,error=0, records=41
[WARN ] 2026-06-02 16:34:22.710 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:34:25.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:34:26.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:34:26.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431798,ok=431798,error=0, records=41
[WARN ] 2026-06-02 16:34:37.716 [17013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:34:40.476 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:34:41.097 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 16:34:41.097 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431799,ok=431799,error=0, records=41
[WARN ] 2026-06-02 16:34:52.721 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:34:55.477 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:34:56.103 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 16:34:56.103 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431800,ok=431800,error=0, records=41
[INFO ] 2026-06-02 16:35:02.233 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21605/300s
[WARN ] 2026-06-02 16:35:07.725 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:35:10.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:35:11.108 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 16:35:11.108 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431801,ok=431801,error=0, records=41
[WARN ] 2026-06-02 16:35:22.730 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:35:25.478 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:35:26.113 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 16:35:26.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431802,ok=431802,error=0, records=41
[INFO ] 2026-06-02 16:35:37.300 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17987/300s
[INFO ] 2026-06-02 16:35:37.301 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20838036},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:35:37.474 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:35:37.474 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 16:35:37.474 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:35:37.474 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:35:37.474 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:35:37.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:35:37.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:35:37.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:35:37.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:35:37.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:35:37.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:35:37.735 [17013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:35:39.235 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21596/300s
[INFO ] 2026-06-02 16:35:40.479 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:35:41.120 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 16:35:41.120 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431803,ok=431803,error=0, records=41
[INFO ] 2026-06-02 16:35:43.665 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21605/300s
[WARN ] 2026-06-02 16:35:52.739 [16981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:35:55.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:35:56.127 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 16:35:56.127 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431804,ok=431804,error=0, records=41
[WARN ] 2026-06-02 16:36:07.746 [17013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:36:10.480 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:36:11.132 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 16:36:11.132 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431805,ok=431805,error=0, records=41
[WARN ] 2026-06-02 16:36:22.750 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:36:25.481 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:36:26.139 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 16:36:26.139 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431806,ok=431806,error=0, records=41
[INFO ] 2026-06-02 16:36:26.139 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21592/300s
[WARN ] 2026-06-02 16:36:37.757 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:36:40.482 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:36:41.145 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10279, records=41
[INFO ] 2026-06-02 16:36:41.145 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431807,ok=431807,error=0, records=41
[WARN ] 2026-06-02 16:36:52.770 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:36:55.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:36:56.090 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21601/300s
[INFO ] 2026-06-02 16:36:56.150 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 16:36:56.150 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431808,ok=431808,error=0, records=41
[WARN ] 2026-06-02 16:37:07.775 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:37:10.483 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:37:10.484 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21604/300s
[INFO ] 2026-06-02 16:37:11.156 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 16:37:11.156 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431809,ok=431809,error=0, records=41
[INFO ] 2026-06-02 16:37:16.964 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21592/300s
[WARN ] 2026-06-02 16:37:17.780 [16980] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16131/stat), No such file or directory
[WARN ] 2026-06-02 16:37:17.780 [16980] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10477/stat), No such file or directory
[WARN ] 2026-06-02 16:37:22.781 [17013] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:37:25.484 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:37:26.163 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 16:37:26.163 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431810,ok=431810,error=0, records=41
[WARN ] 2026-06-02 16:37:32.784 [16980] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16131/stat), No such file or directory
[WARN ] 2026-06-02 16:37:32.784 [16980] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10477/stat), No such file or directory
[WARN ] 2026-06-02 16:37:32.785 [16980] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16155/stat), No such file or directory
[WARN ] 2026-06-02 16:37:32.785 [16980] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16130/stat), No such file or directory
[WARN ] 2026-06-02 16:37:37.786 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:37:40.485 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:37:41.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 16:37:41.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431811,ok=431811,error=0, records=41
[WARN ] 2026-06-02 16:37:47.791 [16981] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16131/stat), No such file or directory
[WARN ] 2026-06-02 16:37:47.791 [16981] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/10477/stat), No such file or directory
[WARN ] 2026-06-02 16:37:47.791 [16981] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16155/stat), No such file or directory
[WARN ] 2026-06-02 16:37:47.791 [16981] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/16130/stat), No such file or directory
[WARN ] 2026-06-02 16:37:52.792 [16996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:37:55.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:37:56.174 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 16:37:56.174 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431812,ok=431812,error=0, records=41
[INFO ] 2026-06-02 16:38:03.770 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21602/300s
[INFO ] 2026-06-02 16:38:05.571 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21602/300s
[WARN ] 2026-06-02 16:38:07.797 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:38:10.486 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:38:11.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 16:38:11.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431813,ok=431813,error=0, records=41
[INFO ] 2026-06-02 16:38:12.377 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21602/300s
[WARN ] 2026-06-02 16:38:22.803 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:38:25.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:38:26.186 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 16:38:26.186 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431814,ok=431814,error=0, records=41
[INFO ] 2026-06-02 16:38:37.476 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837940},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:38:37.644 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:38:37.644 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 16:38:37.645 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:38:37.645 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:38:37.645 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:38:37.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:38:37.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:38:37.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:38:37.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:38:37.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:38:37.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:38:37.809 [17620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:38:40.487 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:38:41.194 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 16:38:41.194 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431815,ok=431815,error=0, records=41
[WARN ] 2026-06-02 16:38:52.814 [17641] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:38:55.488 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:38:55.488 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 16:38:56.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 16:38:56.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431816,ok=431816,error=0, records=41
[WARN ] 2026-06-02 16:39:07.819 [17018] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:39:10.490 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:39:11.205 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 16:39:11.205 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431817,ok=431817,error=0, records=41
[WARN ] 2026-06-02 16:39:22.825 [17620] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:39:25.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:39:26.211 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 16:39:26.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431818,ok=431818,error=0, records=41
[WARN ] 2026-06-02 16:39:37.830 [17683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:39:40.491 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:39:41.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 16:39:41.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431819,ok=431819,error=0, records=41
[WARN ] 2026-06-02 16:39:52.836 [17698] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:39:55.492 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:39:56.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 16:39:56.222 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431820,ok=431820,error=0, records=41
[INFO ] 2026-06-02 16:40:02.236 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21606/300s
[WARN ] 2026-06-02 16:40:07.841 [17683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:40:10.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:40:11.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 16:40:11.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431821,ok=431821,error=0, records=41
[WARN ] 2026-06-02 16:40:22.846 [17727] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:40:25.493 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.52MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:40:26.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 16:40:26.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431822,ok=431822,error=0, records=41
[WARN ] 2026-06-02 16:40:37.852 [17741] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:40:39.352 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21597/300s
[INFO ] 2026-06-02 16:40:40.494 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:40:41.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-02 16:40:41.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431823,ok=431823,error=0, records=41
[INFO ] 2026-06-02 16:40:43.671 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21606/300s
[WARN ] 2026-06-02 16:40:52.857 [17683] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:40:55.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:40:56.243 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10266, records=41
[INFO ] 2026-06-02 16:40:56.243 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431824,ok=431824,error=0, records=41
[WARN ] 2026-06-02 16:41:07.864 [17755] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:41:10.495 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:41:11.249 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10272, records=41
[INFO ] 2026-06-02 16:41:11.249 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431825,ok=431825,error=0, records=41
[WARN ] 2026-06-02 16:41:22.867 [17713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:41:25.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:41:26.254 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 16:41:26.254 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431826,ok=431826,error=0, records=41
[INFO ] 2026-06-02 16:41:26.254 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21593/300s
[INFO ] 2026-06-02 16:41:37.645 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17988/300s
[INFO ] 2026-06-02 16:41:37.646 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:41:37.827 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:41:37.827 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 16:41:37.827 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:41:37.827 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:41:37.827 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:41:37.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:41:37.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:41:37.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:41:37.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:41:37.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:41:37.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 16:41:37.872 [17713] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:41:40.496 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.40MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:41:41.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 16:41:41.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431827,ok=431827,error=0, records=41
[WARN ] 2026-06-02 16:41:52.877 [17818] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:41:55.497 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:41:56.152 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21602/300s
[INFO ] 2026-06-02 16:41:56.266 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 16:41:56.267 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431828,ok=431828,error=0, records=41
[WARN ] 2026-06-02 16:42:07.883 [17824] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:42:10.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:42:10.498 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21605/300s
[INFO ] 2026-06-02 16:42:11.272 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 16:42:11.272 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431829,ok=431829,error=0, records=41
[INFO ] 2026-06-02 16:42:17.153 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21593/300s
[WARN ] 2026-06-02 16:42:22.888 [17851] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:42:25.498 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.60MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:42:26.277 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 16:42:26.278 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431830,ok=431830,error=0, records=41
[WARN ] 2026-06-02 16:42:37.894 [17845] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:42:40.499 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:42:41.282 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:42:41.283 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431831,ok=431831,error=0, records=41
[WARN ] 2026-06-02 16:42:52.898 [17862] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:42:55.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:42:56.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 16:42:56.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431832,ok=431832,error=0, records=41
[INFO ] 2026-06-02 16:43:03.843 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21603/300s
[INFO ] 2026-06-02 16:43:05.645 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21603/300s
[WARN ] 2026-06-02 16:43:07.903 [17895] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:43:10.500 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:43:11.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 16:43:11.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431833,ok=431833,error=0, records=41
[INFO ] 2026-06-02 16:43:12.451 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21603/300s
[WARN ] 2026-06-02 16:43:22.909 [17907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:43:25.501 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:43:26.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:43:26.302 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431834,ok=431834,error=0, records=41
[WARN ] 2026-06-02 16:43:37.915 [17907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:43:40.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 16:43:40.502 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 16:43:41.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 16:43:41.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431835,ok=431835,error=0, records=41
[WARN ] 2026-06-02 16:43:52.921 [17948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:43:55.502 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:43:56.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 16:43:56.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431836,ok=431836,error=0, records=41
[WARN ] 2026-06-02 16:44:07.932 [17948] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:44:10.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:44:11.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 16:44:11.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431837,ok=431837,error=0, records=41
[WARN ] 2026-06-02 16:44:22.938 [17978] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:44:25.503 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:44:26.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 16:44:26.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431838,ok=431838,error=0, records=41
[INFO ] 2026-06-02 16:44:37.829 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837800},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-02 16:44:37.944 [17989] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:44:37.982 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:44:37.982 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 16:44:37.983 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:44:37.983 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:44:37.983 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:44:38.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:44:38.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:44:38.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:44:38.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:44:38.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:44:38.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:44:40.504 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:44:41.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 16:44:41.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431839,ok=431839,error=0, records=41
[WARN ] 2026-06-02 16:44:52.949 [17994] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:44:55.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:44:56.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 16:44:56.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431840,ok=431840,error=0, records=41
[INFO ] 2026-06-02 16:45:02.240 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21607/300s
[WARN ] 2026-06-02 16:45:07.955 [17973] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:45:10.505 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:45:11.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 16:45:11.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431841,ok=431841,error=0, records=41
[WARN ] 2026-06-02 16:45:22.960 [18035] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:45:25.506 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:45:26.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-02 16:45:26.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431842,ok=431842,error=0, records=41
[WARN ] 2026-06-02 16:45:37.966 [18006] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:45:39.467 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21598/300s
[INFO ] 2026-06-02 16:45:40.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:45:41.399 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 16:45:41.399 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431843,ok=431843,error=0, records=41
[INFO ] 2026-06-02 16:45:43.678 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21607/300s
[WARN ] 2026-06-02 16:45:52.972 [18063] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:45:55.507 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:45:56.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 16:45:56.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431844,ok=431844,error=0, records=41
[WARN ] 2026-06-02 16:46:07.977 [18063] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:46:10.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:46:11.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 16:46:11.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431845,ok=431845,error=0, records=41
[WARN ] 2026-06-02 16:46:22.982 [18063] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:46:25.508 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:46:26.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 16:46:26.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431846,ok=431846,error=0, records=41
[INFO ] 2026-06-02 16:46:26.423 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21594/300s
[WARN ] 2026-06-02 16:46:37.987 [18105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:46:40.509 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:46:41.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 16:46:41.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431847,ok=431847,error=0, records=41
[WARN ] 2026-06-02 16:46:52.994 [17994] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:46:55.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:46:56.208 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21603/300s
[INFO ] 2026-06-02 16:46:56.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 16:46:56.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431848,ok=431848,error=0, records=41
[WARN ] 2026-06-02 16:47:07.998 [18063] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:47:10.510 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:47:10.510 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21606/300s
[INFO ] 2026-06-02 16:47:11.440 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 16:47:11.440 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431849,ok=431849,error=0, records=41
[INFO ] 2026-06-02 16:47:17.336 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21594/300s
[WARN ] 2026-06-02 16:47:23.004 [18035] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:47:25.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:47:26.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 16:47:26.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431850,ok=431850,error=0, records=41
[INFO ] 2026-06-02 16:47:37.983 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17989/300s
[INFO ] 2026-06-02 16:47:37.984 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837732},"versionInfo":{"version":"3.5.10"}}
[WARN ] 2026-06-02 16:47:38.009 [18132] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:47:38.150 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:47:38.150 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 16:47:38.150 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:47:38.150 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:47:38.150 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:47:38.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:47:38.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:47:38.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:47:38.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:47:38.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:47:38.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:47:40.511 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:47:41.507 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 16:47:41.507 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431851,ok=431851,error=0, records=41
[WARN ] 2026-06-02 16:47:53.014 [18035] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:47:55.512 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:47:56.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 16:47:56.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431852,ok=431852,error=0, records=41
[INFO ] 2026-06-02 16:48:03.909 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21604/300s
[INFO ] 2026-06-02 16:48:05.710 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21604/300s
[WARN ] 2026-06-02 16:48:08.019 [18105] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:48:10.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:48:11.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 16:48:11.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431853,ok=431853,error=0, records=41
[INFO ] 2026-06-02 16:48:12.516 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21604/300s
[WARN ] 2026-06-02 16:48:23.024 [18063] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:48:25.513 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:48:26.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 16:48:26.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431854,ok=431854,error=0, records=41
[WARN ] 2026-06-02 16:48:38.029 [18063] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:48:40.514 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:48:41.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 16:48:41.534 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431855,ok=431855,error=0, records=41
[WARN ] 2026-06-02 16:48:53.035 [18159] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:48:55.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:48:56.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 16:48:56.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431856,ok=431856,error=0, records=41
[WARN ] 2026-06-02 16:49:08.042 [18241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:49:10.515 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:49:11.548 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 16:49:11.548 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431857,ok=431857,error=0, records=41
[WARN ] 2026-06-02 16:49:23.047 [18262] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:49:25.516 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:49:26.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 16:49:26.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431858,ok=431858,error=0, records=41
[WARN ] 2026-06-02 16:49:38.052 [18241] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:49:40.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:49:41.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 16:49:41.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431859,ok=431859,error=0, records=41
[WARN ] 2026-06-02 16:49:52.558 [18294] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:49:55.517 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:49:56.565 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 16:49:56.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431860,ok=431860,error=0, records=41
[INFO ] 2026-06-02 16:50:02.243 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21608/300s
[WARN ] 2026-06-02 16:50:07.563 [18321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:50:10.518 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:50:11.571 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 16:50:11.571 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431861,ok=431861,error=0, records=41
[WARN ] 2026-06-02 16:50:22.568 [18328] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:50:25.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:50:26.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 16:50:26.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431862,ok=431862,error=0, records=41
[WARN ] 2026-06-02 16:50:37.573 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:50:38.152 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837660},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:50:38.325 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:50:38.325 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 16:50:38.325 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:50:38.326 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:50:38.326 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:50:38.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:50:38.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:50:38.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:50:38.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:50:38.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:50:38.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:50:39.574 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21599/300s
[INFO ] 2026-06-02 16:50:40.519 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:50:41.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 16:50:41.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431863,ok=431863,error=0, records=41
[INFO ] 2026-06-02 16:50:43.685 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21608/300s
[WARN ] 2026-06-02 16:50:52.579 [18372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:50:55.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.76MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:50:56.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 16:50:56.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431864,ok=431864,error=0, records=41
[WARN ] 2026-06-02 16:51:07.584 [18386] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:51:10.520 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:51:11.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 16:51:11.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431865,ok=431865,error=0, records=41
[WARN ] 2026-06-02 16:51:22.589 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:51:25.521 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:51:26.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 16:51:26.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431866,ok=431866,error=0, records=41
[INFO ] 2026-06-02 16:51:26.611 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21595/300s
[WARN ] 2026-06-02 16:51:37.594 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:51:40.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:51:41.618 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 16:51:41.618 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431867,ok=431867,error=0, records=41
[WARN ] 2026-06-02 16:51:52.600 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:51:55.522 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:51:56.266 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21604/300s
[INFO ] 2026-06-02 16:51:56.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 16:51:56.623 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431868,ok=431868,error=0, records=41
[WARN ] 2026-06-02 16:52:07.607 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:52:10.523 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:52:10.523 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21607/300s
[INFO ] 2026-06-02 16:52:11.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 16:52:11.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431869,ok=431869,error=0, records=41
[INFO ] 2026-06-02 16:52:17.515 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21595/300s
[WARN ] 2026-06-02 16:52:22.613 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:52:25.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:52:26.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10271, records=41
[INFO ] 2026-06-02 16:52:26.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431870,ok=431870,error=0, records=41
[WARN ] 2026-06-02 16:52:37.617 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:52:40.524 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:52:41.640 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 16:52:41.640 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431871,ok=431871,error=0, records=41
[WARN ] 2026-06-02 16:52:52.622 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:52:55.525 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:52:56.646 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 16:52:56.646 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431872,ok=431872,error=0, records=41
[INFO ] 2026-06-02 16:53:03.975 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21605/300s
[INFO ] 2026-06-02 16:53:05.776 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21605/300s
[WARN ] 2026-06-02 16:53:07.628 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:53:10.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:53:11.652 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 16:53:11.652 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431873,ok=431873,error=0, records=41
[INFO ] 2026-06-02 16:53:12.582 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21605/300s
[WARN ] 2026-06-02 16:53:22.634 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:53:25.526 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:53:26.657 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 16:53:26.657 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431874,ok=431874,error=0, records=41
[WARN ] 2026-06-02 16:53:37.639 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:53:38.326 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17990/300s
[INFO ] 2026-06-02 16:53:38.327 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837584},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:53:38.507 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:53:38.507 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[]}
[INFO ] 2026-06-02 16:53:38.507 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:53:38.507 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:53:38.507 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:53:38.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:53:38.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:53:38.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:53:38.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:53:38.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:53:38.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:53:40.527 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 16:53:40.527 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 16:53:41.663 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 16:53:41.663 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431875,ok=431875,error=0, records=41
[WARN ] 2026-06-02 16:53:52.645 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:53:55.528 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:53:55.528 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 16:53:56.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 16:53:56.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431876,ok=431876,error=0, records=41
[WARN ] 2026-06-02 16:54:07.651 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:54:10.529 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.78MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:54:11.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-02 16:54:11.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431877,ok=431877,error=0, records=41
[WARN ] 2026-06-02 16:54:22.655 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:54:25.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:54:26.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 16:54:26.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431878,ok=431878,error=0, records=41
[WARN ] 2026-06-02 16:54:37.661 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:54:40.530 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:54:41.689 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 16:54:41.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431879,ok=431879,error=0, records=41
[WARN ] 2026-06-02 16:54:52.666 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:54:55.531 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:54:56.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 16:54:56.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431880,ok=431880,error=0, records=41
[INFO ] 2026-06-02 16:55:02.247 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21609/300s
[WARN ] 2026-06-02 16:55:07.671 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:55:10.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:55:11.702 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 16:55:11.702 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431881,ok=431881,error=0, records=41
[WARN ] 2026-06-02 16:55:22.675 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:55:25.532 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:55:26.710 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 16:55:26.710 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431882,ok=431882,error=0, records=41
[WARN ] 2026-06-02 16:55:37.680 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:55:39.681 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21600/300s
[INFO ] 2026-06-02 16:55:40.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:55:41.714 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10277, records=41
[INFO ] 2026-06-02 16:55:41.714 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431883,ok=431883,error=0, records=41
[INFO ] 2026-06-02 16:55:43.691 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21609/300s
[WARN ] 2026-06-02 16:55:52.685 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:55:55.533 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:55:56.719 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 16:55:56.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431884,ok=431884,error=0, records=41
[WARN ] 2026-06-02 16:56:07.691 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:56:10.534 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:56:11.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 16:56:11.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431885,ok=431885,error=0, records=41
[WARN ] 2026-06-02 16:56:22.697 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:56:25.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:56:26.730 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 16:56:26.730 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431886,ok=431886,error=0, records=41
[INFO ] 2026-06-02 16:56:26.730 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21596/300s
[WARN ] 2026-06-02 16:56:37.702 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:56:38.508 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837524},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:56:38.679 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:56:38.680 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[]}
[INFO ] 2026-06-02 16:56:38.680 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:56:38.680 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:56:38.680 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:56:38.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:56:38.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:56:38.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:56:38.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:56:38.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:56:38.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:56:40.535 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:56:41.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 16:56:41.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431887,ok=431887,error=0, records=41
[WARN ] 2026-06-02 16:56:52.707 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:56:55.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:56:56.318 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21605/300s
[INFO ] 2026-06-02 16:56:56.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 16:56:56.765 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431888,ok=431888,error=0, records=41
[WARN ] 2026-06-02 16:57:07.713 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:57:10.536 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:57:10.536 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21608/300s
[INFO ] 2026-06-02 16:57:11.771 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-02 16:57:11.771 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431889,ok=431889,error=0, records=41
[INFO ] 2026-06-02 16:57:17.696 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21596/300s
[WARN ] 2026-06-02 16:57:22.718 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:57:25.537 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:57:26.843 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10343, records=41
[INFO ] 2026-06-02 16:57:26.843 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431890,ok=431890,error=0, records=41
[WARN ] 2026-06-02 16:57:37.724 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:57:40.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:57:41.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 16:57:41.902 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431891,ok=431891,error=0, records=41
[WARN ] 2026-06-02 16:57:52.729 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:57:55.538 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:57:56.993 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 16:57:56.993 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431892,ok=431892,error=0, records=41
[INFO ] 2026-06-02 16:58:04.023 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21606/300s
[INFO ] 2026-06-02 16:58:05.825 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21606/300s
[WARN ] 2026-06-02 16:58:07.734 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:58:10.539 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:58:11.998 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 16:58:11.998 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431893,ok=431893,error=0, records=41
[INFO ] 2026-06-02 16:58:12.631 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21606/300s
[WARN ] 2026-06-02 16:58:22.738 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:58:25.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:58:27.006 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 16:58:27.006 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431894,ok=431894,error=0, records=41
[WARN ] 2026-06-02 16:58:37.745 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:58:40.540 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:58:42.013 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 16:58:42.013 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431895,ok=431895,error=0, records=41
[WARN ] 2026-06-02 16:58:52.750 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:58:55.541 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:58:57.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 16:58:57.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431896,ok=431896,error=0, records=41
[WARN ] 2026-06-02 16:59:07.757 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:59:10.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:59:12.031 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 16:59:12.031 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431897,ok=431897,error=0, records=41
[WARN ] 2026-06-02 16:59:22.762 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:59:25.542 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:59:27.036 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 16:59:27.036 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431898,ok=431898,error=0, records=41
[WARN ] 2026-06-02 16:59:37.766 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:59:38.680 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17991/300s
[INFO ] 2026-06-02 16:59:38.681 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837456},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 16:59:38.850 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 16:59:38.850 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 16:59:38.850 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 16:59:38.850 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 16:59:38.850 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 16:59:38.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:59:38.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 16:59:38.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:59:38.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 16:59:38.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:59:38.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 16:59:40.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:59:42.043 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 16:59:42.043 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431899,ok=431899,error=0, records=41
[WARN ] 2026-06-02 16:59:52.771 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 16:59:55.543 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 16:59:57.052 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 16:59:57.052 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431900,ok=431900,error=0, records=41
[INFO ] 2026-06-02 17:00:02.250 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21610/300s
[WARN ] 2026-06-02 17:00:07.776 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:00:10.544 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:00:12.058 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 17:00:12.058 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431901,ok=431901,error=0, records=41
[WARN ] 2026-06-02 17:00:22.781 [18299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:00:25.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:00:27.065 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 17:00:27.065 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431902,ok=431902,error=0, records=41
[WARN ] 2026-06-02 17:00:37.786 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:00:39.786 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21601/300s
[INFO ] 2026-06-02 17:00:40.545 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:00:42.072 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 17:00:42.072 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431903,ok=431903,error=0, records=41
[INFO ] 2026-06-02 17:00:43.697 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21610/300s
[WARN ] 2026-06-02 17:00:52.791 [18410] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:00:55.546 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:00:57.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10263, records=41
[INFO ] 2026-06-02 17:00:57.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431904,ok=431904,error=0, records=41
[WARN ] 2026-06-02 17:01:07.795 [18404] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:01:10.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:01:12.087 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:01:12.087 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431905,ok=431905,error=0, records=41
[WARN ] 2026-06-02 17:01:22.801 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:01:25.547 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.69MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:01:27.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 17:01:27.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431906,ok=431906,error=0, records=41
[INFO ] 2026-06-02 17:01:27.093 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21597/300s
[WARN ] 2026-06-02 17:01:37.806 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:01:40.548 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:01:42.098 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 17:01:42.098 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431907,ok=431907,error=0, records=41
[WARN ] 2026-06-02 17:01:52.811 [18407] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:01:55.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:01:56.375 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21606/300s
[INFO ] 2026-06-02 17:01:57.107 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 17:01:57.107 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431908,ok=431908,error=0, records=41
[WARN ] 2026-06-02 17:02:07.817 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:02:10.549 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:02:10.549 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21609/300s
[INFO ] 2026-06-02 17:02:12.112 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 17:02:12.113 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431909,ok=431909,error=0, records=41
[INFO ] 2026-06-02 17:02:17.876 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21597/300s
[WARN ] 2026-06-02 17:02:22.823 [18438] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:02:25.550 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:02:27.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 17:02:27.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431910,ok=431910,error=0, records=41
[WARN ] 2026-06-02 17:02:37.828 [19015] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:02:38.852 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837380},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:02:39.007 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:02:39.007 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 17:02:39.007 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:02:39.007 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:02:39.007 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:02:39.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:02:39.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:02:39.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:02:39.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:02:39.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:02:39.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:02:40.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:02:42.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 17:02:42.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431911,ok=431911,error=0, records=41
[WARN ] 2026-06-02 17:02:52.834 [18997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:02:55.551 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:02:57.233 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 17:02:57.233 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431912,ok=431912,error=0, records=41
[INFO ] 2026-06-02 17:03:04.088 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21607/300s
[INFO ] 2026-06-02 17:03:05.889 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21607/300s
[WARN ] 2026-06-02 17:03:07.840 [18973] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:03:10.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:03:12.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 17:03:12.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431913,ok=431913,error=0, records=41
[INFO ] 2026-06-02 17:03:12.695 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21607/300s
[WARN ] 2026-06-02 17:03:22.845 [19065] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:03:25.552 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:03:27.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 17:03:27.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431914,ok=431914,error=0, records=41
[WARN ] 2026-06-02 17:03:37.850 [18997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:03:40.553 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 17:03:40.553 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 17:03:42.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 17:03:42.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431915,ok=431915,error=0, records=41
[WARN ] 2026-06-02 17:03:52.855 [19015] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:03:55.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:03:57.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 17:03:57.258 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431916,ok=431916,error=0, records=41
[WARN ] 2026-06-02 17:04:07.861 [18997] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:04:10.554 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:04:12.262 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 17:04:12.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431917,ok=431917,error=0, records=41
[WARN ] 2026-06-02 17:04:22.867 [19123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:04:25.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:04:27.270 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:04:27.270 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431918,ok=431918,error=0, records=41
[WARN ] 2026-06-02 17:04:37.871 [18973] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:04:40.555 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:04:42.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 17:04:42.276 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431919,ok=431919,error=0, records=41
[WARN ] 2026-06-02 17:04:52.875 [19123] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:04:55.556 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:04:57.281 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 17:04:57.281 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431920,ok=431920,error=0, records=41
[INFO ] 2026-06-02 17:05:02.253 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21611/300s
[WARN ] 2026-06-02 17:05:07.881 [19167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:05:10.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:05:12.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 17:05:12.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431921,ok=431921,error=0, records=41
[WARN ] 2026-06-02 17:05:22.886 [19168] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:05:25.557 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:05:27.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 17:05:27.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431922,ok=431922,error=0, records=41
[WARN ] 2026-06-02 17:05:37.893 [19151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:05:39.007 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17992/300s
[INFO ] 2026-06-02 17:05:39.009 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837312},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:05:39.168 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:05:39.168 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:05:39.168 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:05:39.168 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:05:39.168 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:05:39.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:05:39.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:05:39.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:05:39.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:05:39.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:05:39.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:05:39.893 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21602/300s
[INFO ] 2026-06-02 17:05:40.558 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:05:42.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 17:05:42.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431923,ok=431923,error=0, records=41
[INFO ] 2026-06-02 17:05:43.704 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21611/300s
[WARN ] 2026-06-02 17:05:52.898 [19195] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:05:55.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:05:57.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 17:05:57.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431924,ok=431924,error=0, records=41
[WARN ] 2026-06-02 17:06:07.906 [19151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:06:10.559 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:06:12.319 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 17:06:12.319 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431925,ok=431925,error=0, records=41
[WARN ] 2026-06-02 17:06:22.911 [19252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:06:25.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:06:27.328 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:06:27.328 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431926,ok=431926,error=0, records=41
[INFO ] 2026-06-02 17:06:27.328 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21598/300s
[WARN ] 2026-06-02 17:06:37.917 [19151] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:06:40.560 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:06:42.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 17:06:42.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431927,ok=431927,error=0, records=41
[WARN ] 2026-06-02 17:06:52.922 [19282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:06:55.561 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:06:56.428 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21607/300s
[INFO ] 2026-06-02 17:06:57.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 17:06:57.338 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431928,ok=431928,error=0, records=41
[WARN ] 2026-06-02 17:07:07.928 [19242] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:07:10.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:07:10.562 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21610/300s
[INFO ] 2026-06-02 17:07:12.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 17:07:12.344 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431929,ok=431929,error=0, records=41
[INFO ] 2026-06-02 17:07:18.058 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21598/300s
[WARN ] 2026-06-02 17:07:22.933 [19283] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:07:25.562 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:07:27.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 17:07:27.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431930,ok=431930,error=0, records=41
[WARN ] 2026-06-02 17:07:37.938 [19316] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:07:40.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:07:42.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 17:07:42.355 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431931,ok=431931,error=0, records=41
[WARN ] 2026-06-02 17:07:52.944 [19356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:07:55.563 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:07:57.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 17:07:57.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431932,ok=431932,error=0, records=41
[INFO ] 2026-06-02 17:08:04.142 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21608/300s
[INFO ] 2026-06-02 17:08:05.944 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21608/300s
[WARN ] 2026-06-02 17:08:07.948 [19373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:08:10.564 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:08:12.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 17:08:12.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431933,ok=431933,error=0, records=41
[INFO ] 2026-06-02 17:08:12.750 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21608/300s
[WARN ] 2026-06-02 17:08:22.952 [19339] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:08:25.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:08:27.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 17:08:27.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431934,ok=431934,error=0, records=41
[WARN ] 2026-06-02 17:08:37.958 [19373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:08:39.170 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837248},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:08:39.313 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:08:39.313 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"HTTP":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 17:08:39.313 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:08:39.313 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:08:39.313 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:08:39.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:08:39.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:08:39.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:08:39.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:08:39.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:08:39.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:08:40.565 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:08:42.380 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 17:08:42.380 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431935,ok=431935,error=0, records=41
[WARN ] 2026-06-02 17:08:52.963 [19350] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:08:55.566 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:08:55.566 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 17:08:57.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 17:08:57.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431936,ok=431936,error=0, records=41
[WARN ] 2026-06-02 17:09:07.969 [19397] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:09:10.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:09:12.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 17:09:12.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431937,ok=431937,error=0, records=41
[WARN ] 2026-06-02 17:09:22.974 [19426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:09:25.568 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:09:27.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:09:27.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431938,ok=431938,error=0, records=41
[WARN ] 2026-06-02 17:09:37.978 [19383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:09:40.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:09:42.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 17:09:42.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431939,ok=431939,error=0, records=41
[WARN ] 2026-06-02 17:09:52.984 [19373] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:09:55.569 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:09:57.442 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 17:09:57.442 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431940,ok=431940,error=0, records=41
[INFO ] 2026-06-02 17:10:02.257 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21612/300s
[WARN ] 2026-06-02 17:10:07.989 [19486] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:10:10.570 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:10:12.447 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 17:10:12.447 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431941,ok=431941,error=0, records=41
[WARN ] 2026-06-02 17:10:22.994 [19500] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:10:25.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:10:27.453 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 17:10:27.453 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431942,ok=431942,error=0, records=41
[WARN ] 2026-06-02 17:10:37.999 [19397] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:10:39.999 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21603/300s
[INFO ] 2026-06-02 17:10:40.571 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.20MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:10:42.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 17:10:42.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431943,ok=431943,error=0, records=41
[INFO ] 2026-06-02 17:10:43.711 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21612/300s
[WARN ] 2026-06-02 17:10:53.004 [19500] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:10:55.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:10:57.463 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:10:57.464 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431944,ok=431944,error=0, records=41
[WARN ] 2026-06-02 17:11:08.009 [19528] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:11:10.572 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:11:12.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 17:11:12.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431945,ok=431945,error=0, records=41
[WARN ] 2026-06-02 17:11:23.014 [19397] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:11:25.573 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.21MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:11:27.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 17:11:27.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431946,ok=431946,error=0, records=41
[INFO ] 2026-06-02 17:11:27.476 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21599/300s
[WARN ] 2026-06-02 17:11:38.019 [19453] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:11:39.313 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17993/300s
[INFO ] 2026-06-02 17:11:39.315 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837176},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:11:39.463 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:11:39.463 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:11:39.463 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:11:39.463 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:11:39.463 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:11:39.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:11:39.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:11:39.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:11:39.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:11:39.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:11:39.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:11:40.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:11:42.483 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 17:11:42.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431947,ok=431947,error=0, records=41
[WARN ] 2026-06-02 17:11:53.025 [19569] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:11:55.574 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:11:56.485 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21608/300s
[INFO ] 2026-06-02 17:11:57.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 17:11:57.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431948,ok=431948,error=0, records=41
[WARN ] 2026-06-02 17:12:08.030 [19598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:12:10.575 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:12:10.575 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21611/300s
[INFO ] 2026-06-02 17:12:12.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 17:12:12.495 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431949,ok=431949,error=0, records=41
[INFO ] 2026-06-02 17:12:18.241 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21599/300s
[WARN ] 2026-06-02 17:12:23.037 [19584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:12:25.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:12:27.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 17:12:27.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431950,ok=431950,error=0, records=41
[WARN ] 2026-06-02 17:12:38.043 [19623] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:12:40.576 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:12:42.505 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 17:12:42.505 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431951,ok=431951,error=0, records=41
[WARN ] 2026-06-02 17:12:53.048 [19638] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:12:55.577 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:12:57.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 17:12:57.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431952,ok=431952,error=0, records=41
[INFO ] 2026-06-02 17:13:04.209 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21609/300s
[INFO ] 2026-06-02 17:13:06.010 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21609/300s
[WARN ] 2026-06-02 17:13:08.053 [19584] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:13:10.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:13:12.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 17:13:12.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431953,ok=431953,error=0, records=41
[INFO ] 2026-06-02 17:13:12.817 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21609/300s
[WARN ] 2026-06-02 17:13:22.557 [19665] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:13:25.578 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:13:27.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 17:13:27.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431954,ok=431954,error=0, records=41
[WARN ] 2026-06-02 17:13:37.562 [19675] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:13:40.579 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 17:13:40.579 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 17:13:42.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 17:13:42.527 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431955,ok=431955,error=0, records=41
[WARN ] 2026-06-02 17:13:52.569 [19643] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:13:55.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:13:57.532 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 17:13:57.532 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431956,ok=431956,error=0, records=41
[WARN ] 2026-06-02 17:14:07.575 [19729] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:14:10.580 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:14:12.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 17:14:12.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431957,ok=431957,error=0, records=41
[WARN ] 2026-06-02 17:14:22.581 [19715] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:14:25.581 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:14:27.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 17:14:27.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431958,ok=431958,error=0, records=41
[WARN ] 2026-06-02 17:14:37.586 [19732] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:14:39.465 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837108},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:14:39.647 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:14:39.647 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 17:14:39.647 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:14:39.647 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:14:39.647 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:14:39.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:14:39.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:14:39.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:14:39.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:14:39.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:14:39.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:14:40.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:14:42.552 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 17:14:42.552 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431959,ok=431959,error=0, records=41
[WARN ] 2026-06-02 17:14:52.592 [19760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:14:55.582 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:14:57.559 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 17:14:57.559 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431960,ok=431960,error=0, records=41
[INFO ] 2026-06-02 17:15:02.260 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21613/300s
[WARN ] 2026-06-02 17:15:07.597 [19778] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:15:10.583 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:15:12.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 17:15:12.565 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431961,ok=431961,error=0, records=41
[WARN ] 2026-06-02 17:15:22.603 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:15:25.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:15:27.570 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 17:15:27.570 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431962,ok=431962,error=0, records=41
[WARN ] 2026-06-02 17:15:37.607 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:15:40.108 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21604/300s
[INFO ] 2026-06-02 17:15:40.584 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:15:42.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 17:15:42.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431963,ok=431963,error=0, records=41
[INFO ] 2026-06-02 17:15:43.718 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21613/300s
[WARN ] 2026-06-02 17:15:52.613 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:15:55.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:15:57.580 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 17:15:57.580 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431964,ok=431964,error=0, records=41
[WARN ] 2026-06-02 17:16:07.619 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:16:10.585 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:16:12.585 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 17:16:12.585 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431965,ok=431965,error=0, records=41
[WARN ] 2026-06-02 17:16:22.623 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:16:25.586 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:16:27.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 17:16:27.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431966,ok=431966,error=0, records=41
[INFO ] 2026-06-02 17:16:27.591 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21600/300s
[WARN ] 2026-06-02 17:16:37.629 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:16:40.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:16:42.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 17:16:42.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431967,ok=431967,error=0, records=41
[WARN ] 2026-06-02 17:16:52.634 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:16:55.587 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:16:56.544 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21609/300s
[INFO ] 2026-06-02 17:16:57.602 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 17:16:57.602 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431968,ok=431968,error=0, records=41
[WARN ] 2026-06-02 17:17:07.639 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:17:10.588 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:17:10.588 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21612/300s
[INFO ] 2026-06-02 17:17:12.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 17:17:12.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431969,ok=431969,error=0, records=41
[INFO ] 2026-06-02 17:17:18.427 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21600/300s
[WARN ] 2026-06-02 17:17:22.644 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:17:25.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:17:27.614 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 17:17:27.614 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431970,ok=431970,error=0, records=41
[WARN ] 2026-06-02 17:17:37.650 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:17:39.648 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17994/300s
[INFO ] 2026-06-02 17:17:39.649 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20837036},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:17:39.820 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:17:39.820 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 17:17:39.821 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:17:39.821 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:17:39.821 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:17:39.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:17:39.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:17:39.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:17:39.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:17:39.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:17:39.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:17:40.589 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:17:42.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 17:17:42.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431971,ok=431971,error=0, records=41
[WARN ] 2026-06-02 17:17:52.654 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:17:55.590 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:17:57.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 17:17:57.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431972,ok=431972,error=0, records=41
[INFO ] 2026-06-02 17:18:04.277 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21610/300s
[INFO ] 2026-06-02 17:18:06.079 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21610/300s
[WARN ] 2026-06-02 17:18:07.661 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:18:10.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:18:12.633 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 17:18:12.633 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431973,ok=431973,error=0, records=41
[INFO ] 2026-06-02 17:18:12.886 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21610/300s
[WARN ] 2026-06-02 17:18:22.665 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:18:25.591 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:18:27.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 17:18:27.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431974,ok=431974,error=0, records=41
[WARN ] 2026-06-02 17:18:37.671 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:18:40.592 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:18:42.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 17:18:42.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431975,ok=431975,error=0, records=41
[WARN ] 2026-06-02 17:18:52.676 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:18:55.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:18:57.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 17:18:57.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431976,ok=431976,error=0, records=41
[WARN ] 2026-06-02 17:19:07.681 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:19:10.593 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:19:12.654 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:19:12.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431977,ok=431977,error=0, records=41
[WARN ] 2026-06-02 17:19:22.686 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:19:25.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:19:27.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:19:27.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431978,ok=431978,error=0, records=41
[WARN ] 2026-06-02 17:19:37.691 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:19:40.594 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:19:42.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 17:19:42.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431979,ok=431979,error=0, records=41
[WARN ] 2026-06-02 17:19:52.696 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:19:55.595 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:19:57.675 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:19:57.675 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431980,ok=431980,error=0, records=41
[INFO ] 2026-06-02 17:20:02.263 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21614/300s
[WARN ] 2026-06-02 17:20:07.701 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:20:10.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:20:12.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 17:20:12.681 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431981,ok=431981,error=0, records=41
[WARN ] 2026-06-02 17:20:22.705 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:20:25.596 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:20:27.688 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 17:20:27.689 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431982,ok=431982,error=0, records=41
[WARN ] 2026-06-02 17:20:37.710 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:20:39.822 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836928},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:20:40.002 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:20:40.002 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:20:40.002 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:20:40.002 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:20:40.002 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:20:40.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:20:40.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:20:40.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:20:40.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:20:40.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:20:40.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:20:40.211 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21605/300s
[INFO ] 2026-06-02 17:20:40.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:20:42.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 17:20:42.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431983,ok=431983,error=0, records=41
[INFO ] 2026-06-02 17:20:43.725 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21614/300s
[WARN ] 2026-06-02 17:20:52.716 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:20:55.597 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:20:57.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-02 17:20:57.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431984,ok=431984,error=0, records=41
[WARN ] 2026-06-02 17:21:07.721 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:21:10.598 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:21:12.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 17:21:12.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431985,ok=431985,error=0, records=41
[WARN ] 2026-06-02 17:21:22.725 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:21:25.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:21:27.717 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 17:21:27.717 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431986,ok=431986,error=0, records=41
[INFO ] 2026-06-02 17:21:27.718 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21601/300s
[WARN ] 2026-06-02 17:21:37.730 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:21:40.599 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:21:42.724 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 17:21:42.724 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431987,ok=431987,error=0, records=41
[WARN ] 2026-06-02 17:21:52.734 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:21:55.600 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:21:56.598 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21610/300s
[INFO ] 2026-06-02 17:21:57.729 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 17:21:57.729 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431988,ok=431988,error=0, records=41
[WARN ] 2026-06-02 17:22:07.740 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:22:10.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:22:10.601 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21613/300s
[INFO ] 2026-06-02 17:22:12.735 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 17:22:12.735 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431989,ok=431989,error=0, records=41
[INFO ] 2026-06-02 17:22:18.610 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21601/300s
[WARN ] 2026-06-02 17:22:22.746 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:22:25.601 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:22:27.741 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 17:22:27.741 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431990,ok=431990,error=0, records=41
[WARN ] 2026-06-02 17:22:37.751 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:22:40.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:22:42.750 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 17:22:42.750 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431991,ok=431991,error=0, records=41
[WARN ] 2026-06-02 17:22:52.756 [19783] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:22:55.602 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:22:57.755 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 17:22:57.755 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431992,ok=431992,error=0, records=41
[INFO ] 2026-06-02 17:23:04.344 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21611/300s
[INFO ] 2026-06-02 17:23:06.146 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21611/300s
[WARN ] 2026-06-02 17:23:07.761 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:23:10.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:23:12.760 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 17:23:12.760 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431993,ok=431993,error=0, records=41
[INFO ] 2026-06-02 17:23:12.950 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21611/300s
[WARN ] 2026-06-02 17:23:22.768 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:23:25.603 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:23:27.765 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 17:23:27.766 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431994,ok=431994,error=0, records=41
[WARN ] 2026-06-02 17:23:37.773 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:23:40.002 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17995/300s
[INFO ] 2026-06-02 17:23:40.004 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836820},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:23:40.173 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:23:40.173 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 17:23:40.174 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:23:40.174 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:23:40.174 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:23:40.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:23:40.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:23:40.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:23:40.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:23:40.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:23:40.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:23:40.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 17:23:40.604 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 17:23:42.770 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 17:23:42.770 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431995,ok=431995,error=0, records=41
[WARN ] 2026-06-02 17:23:52.777 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:23:55.604 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:23:55.605 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 17:23:57.776 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:23:57.776 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431996,ok=431996,error=0, records=41
[WARN ] 2026-06-02 17:24:07.783 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:24:10.606 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:24:12.794 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10365, records=41
[INFO ] 2026-06-02 17:24:12.795 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431997,ok=431997,error=0, records=41
[WARN ] 2026-06-02 17:24:22.790 [19777] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:24:25.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:24:27.802 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10367, records=41
[INFO ] 2026-06-02 17:24:27.802 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431998,ok=431998,error=0, records=41
[WARN ] 2026-06-02 17:24:37.797 [19798] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:24:40.607 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:24:42.817 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10356, records=41
[INFO ] 2026-06-02 17:24:42.817 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=431999,ok=431999,error=0, records=41
[WARN ] 2026-06-02 17:24:52.801 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:24:55.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:24:57.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 17:24:57.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432000,ok=432000,error=0, records=41
[INFO ] 2026-06-02 17:25:02.267 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21615/300s
[WARN ] 2026-06-02 17:25:07.807 [19793] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:25:10.608 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:25:12.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 17:25:12.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432001,ok=432001,error=0, records=41
[WARN ] 2026-06-02 17:25:22.811 [20341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:25:25.609 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:25:27.837 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 17:25:27.837 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432002,ok=432002,error=0, records=41
[WARN ] 2026-06-02 17:25:37.818 [20346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:25:40.318 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21606/300s
[INFO ] 2026-06-02 17:25:40.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:25:42.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 17:25:42.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432003,ok=432003,error=0, records=41
[INFO ] 2026-06-02 17:25:43.731 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21615/300s
[WARN ] 2026-06-02 17:25:52.823 [20346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:25:55.610 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:25:57.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 17:25:57.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432004,ok=432004,error=0, records=41
[WARN ] 2026-06-02 17:26:07.828 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:26:10.611 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:26:12.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 17:26:12.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432005,ok=432005,error=0, records=41
[WARN ] 2026-06-02 17:26:22.834 [20356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:26:25.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.44MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:26:27.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 17:26:27.868 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432006,ok=432006,error=0, records=41
[INFO ] 2026-06-02 17:26:27.868 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21602/300s
[WARN ] 2026-06-02 17:26:37.841 [20331] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:26:40.175 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836748},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:26:40.355 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:26:40.356 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:26:40.356 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:26:40.356 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:26:40.356 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:26:40.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:26:40.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:26:40.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:26:40.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:26:40.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:26:40.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:26:40.612 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:26:42.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 17:26:42.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432007,ok=432007,error=0, records=41
[WARN ] 2026-06-02 17:26:52.846 [20341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:26:55.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:26:56.652 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21611/300s
[INFO ] 2026-06-02 17:26:57.879 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 17:26:57.879 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432008,ok=432008,error=0, records=41
[WARN ] 2026-06-02 17:27:07.851 [20341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:27:10.613 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:27:10.614 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21614/300s
[INFO ] 2026-06-02 17:27:12.969 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 17:27:12.969 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432009,ok=432009,error=0, records=41
[INFO ] 2026-06-02 17:27:18.792 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21602/300s
[WARN ] 2026-06-02 17:27:22.856 [20425] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:27:25.614 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:27:27.975 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-02 17:27:27.975 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432010,ok=432010,error=0, records=41
[WARN ] 2026-06-02 17:27:37.862 [20425] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:27:40.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:27:42.981 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 17:27:42.981 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432011,ok=432011,error=0, records=41
[WARN ] 2026-06-02 17:27:52.868 [19766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:27:55.615 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.38MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:27:57.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 17:27:57.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432012,ok=432012,error=0, records=41
[INFO ] 2026-06-02 17:28:04.395 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21612/300s
[INFO ] 2026-06-02 17:28:06.197 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21612/300s
[WARN ] 2026-06-02 17:28:07.872 [20341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:28:10.616 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:28:12.990 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 17:28:12.990 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432013,ok=432013,error=0, records=41
[INFO ] 2026-06-02 17:28:13.003 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21612/300s
[WARN ] 2026-06-02 17:28:22.878 [20508] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:28:25.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:28:28.050 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 17:28:28.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432014,ok=432014,error=0, records=41
[WARN ] 2026-06-02 17:28:37.882 [20525] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:28:40.617 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:28:43.056 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 17:28:43.056 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432015,ok=432015,error=0, records=41
[WARN ] 2026-06-02 17:28:52.888 [20548] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:28:55.618 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:28:58.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 17:28:58.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432016,ok=432016,error=0, records=41
[WARN ] 2026-06-02 17:29:07.893 [20543] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:29:10.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:29:13.070 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 17:29:13.070 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432017,ok=432017,error=0, records=41
[WARN ] 2026-06-02 17:29:22.898 [20542] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:29:25.619 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:29:28.075 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 17:29:28.075 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432018,ok=432018,error=0, records=41
[WARN ] 2026-06-02 17:29:37.903 [20543] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:29:40.356 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17996/300s
[INFO ] 2026-06-02 17:29:40.357 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836672},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:29:40.537 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:29:40.537 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 17:29:40.537 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:29:40.537 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:29:40.537 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:29:40.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:29:40.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:29:40.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:29:40.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:29:40.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:29:40.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:29:40.620 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:29:43.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 17:29:43.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432019,ok=432019,error=0, records=41
[WARN ] 2026-06-02 17:29:52.909 [20613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:29:55.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:29:58.086 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 17:29:58.086 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432020,ok=432020,error=0, records=41
[INFO ] 2026-06-02 17:30:02.270 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21616/300s
[WARN ] 2026-06-02 17:30:07.914 [20613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:30:10.621 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:30:13.091 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 17:30:13.091 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432021,ok=432021,error=0, records=41
[WARN ] 2026-06-02 17:30:22.929 [20613] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:30:25.622 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:30:28.096 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 17:30:28.096 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432022,ok=432022,error=0, records=41
[WARN ] 2026-06-02 17:30:37.935 [20665] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:30:40.436 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21607/300s
[INFO ] 2026-06-02 17:30:40.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:30:43.101 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10289, records=41
[INFO ] 2026-06-02 17:30:43.101 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432023,ok=432023,error=0, records=41
[INFO ] 2026-06-02 17:30:43.738 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21616/300s
[WARN ] 2026-06-02 17:30:52.941 [20601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:30:55.623 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:30:58.176 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 17:30:58.176 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432024,ok=432024,error=0, records=41
[WARN ] 2026-06-02 17:31:07.947 [20628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:31:10.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:31:13.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 17:31:13.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432025,ok=432025,error=0, records=41
[WARN ] 2026-06-02 17:31:22.953 [20700] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:31:25.624 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:31:28.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 17:31:28.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432026,ok=432026,error=0, records=41
[INFO ] 2026-06-02 17:31:28.187 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21603/300s
[WARN ] 2026-06-02 17:31:37.959 [20601] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:31:40.625 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:31:43.195 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10147, records=41
[INFO ] 2026-06-02 17:31:43.195 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432027,ok=432027,error=0, records=41
[WARN ] 2026-06-02 17:31:52.964 [20716] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:31:55.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:31:56.704 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21612/300s
[INFO ] 2026-06-02 17:31:58.202 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 17:31:58.202 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432028,ok=432028,error=0, records=41
[WARN ] 2026-06-02 17:32:07.970 [20628] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:32:10.626 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:32:10.626 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21615/300s
[INFO ] 2026-06-02 17:32:13.207 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 17:32:13.207 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432029,ok=432029,error=0, records=41
[INFO ] 2026-06-02 17:32:18.972 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21603/300s
[WARN ] 2026-06-02 17:32:22.974 [20716] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:32:25.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:32:28.212 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 17:32:28.212 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432030,ok=432030,error=0, records=41
[WARN ] 2026-06-02 17:32:37.979 [20684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:32:40.539 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836348},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:32:40.627 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.86MB[>=200.00MB 0/4], openFiles=13[>=300 0/4]
[INFO ] 2026-06-02 17:32:40.692 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:32:40.692 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 17:32:40.692 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:32:40.692 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:32:40.692 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:32:40.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:32:40.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:32:40.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:32:40.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:32:40.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:32:40.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:32:43.220 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 17:32:43.220 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432031,ok=432031,error=0, records=41
[WARN ] 2026-06-02 17:32:52.985 [20684] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:32:55.628 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:32:58.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:32:58.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432032,ok=432032,error=0, records=41
[INFO ] 2026-06-02 17:33:04.448 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21613/300s
[INFO ] 2026-06-02 17:33:06.250 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21613/300s
[WARN ] 2026-06-02 17:33:07.990 [20786] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:33:10.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:33:13.055 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21613/300s
[INFO ] 2026-06-02 17:33:13.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10337, records=41
[INFO ] 2026-06-02 17:33:13.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432033,ok=432033,error=0, records=41
[WARN ] 2026-06-02 17:33:22.994 [20786] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:33:25.629 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:33:28.234 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10331, records=41
[INFO ] 2026-06-02 17:33:28.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432034,ok=432034,error=0, records=41
[WARN ] 2026-06-02 17:33:38.000 [20801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:33:40.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 17:33:40.630 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 17:33:43.242 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10332, records=41
[INFO ] 2026-06-02 17:33:43.242 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432035,ok=432035,error=0, records=41
[WARN ] 2026-06-02 17:33:53.005 [20842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:33:55.630 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:33:58.248 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10328, records=41
[INFO ] 2026-06-02 17:33:58.248 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432036,ok=432036,error=0, records=41
[WARN ] 2026-06-02 17:34:08.009 [20801] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:34:10.631 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:34:13.253 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 17:34:13.253 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432037,ok=432037,error=0, records=41
[WARN ] 2026-06-02 17:34:23.014 [20856] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:34:25.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:34:28.259 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 17:34:28.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432038,ok=432038,error=0, records=41
[WARN ] 2026-06-02 17:34:38.019 [20889] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:34:40.632 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:34:43.275 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 17:34:43.275 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432039,ok=432039,error=0, records=41
[WARN ] 2026-06-02 17:34:53.023 [20889] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:34:55.633 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:34:58.280 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 17:34:58.280 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432040,ok=432040,error=0, records=41
[INFO ] 2026-06-02 17:35:02.273 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21617/300s
[WARN ] 2026-06-02 17:35:08.028 [20842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:35:10.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:35:13.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10168, records=41
[INFO ] 2026-06-02 17:35:13.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432041,ok=432041,error=0, records=41
[WARN ] 2026-06-02 17:35:23.033 [20842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:35:25.634 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=31.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:35:28.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 17:35:28.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432042,ok=432042,error=0, records=41
[WARN ] 2026-06-02 17:35:38.039 [20961] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:35:40.540 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21608/300s
[INFO ] 2026-06-02 17:35:40.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:35:40.693 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17997/300s
[INFO ] 2026-06-02 17:35:40.694 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836268},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:35:40.851 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:35:40.852 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:35:40.852 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:35:40.852 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:35:40.852 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:35:40.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:35:40.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:35:40.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:35:40.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:35:40.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:35:40.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:35:43.297 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10166, records=41
[INFO ] 2026-06-02 17:35:43.297 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432043,ok=432043,error=0, records=41
[INFO ] 2026-06-02 17:35:43.744 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21617/300s
[WARN ] 2026-06-02 17:35:53.046 [20962] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:35:55.635 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:35:58.302 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 17:35:58.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432044,ok=432044,error=0, records=41
[WARN ] 2026-06-02 17:36:08.050 [20955] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:36:10.636 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:36:13.308 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 17:36:13.308 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432045,ok=432045,error=0, records=41
[WARN ] 2026-06-02 17:36:22.557 [20995] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:36:25.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:36:28.314 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 17:36:28.314 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432046,ok=432046,error=0, records=41
[INFO ] 2026-06-02 17:36:28.314 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21604/300s
[WARN ] 2026-06-02 17:36:37.561 [21029] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:36:40.637 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:36:43.320 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 17:36:43.320 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432047,ok=432047,error=0, records=41
[WARN ] 2026-06-02 17:36:52.565 [21061] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:36:55.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:36:56.761 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21613/300s
[INFO ] 2026-06-02 17:36:58.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 17:36:58.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432048,ok=432048,error=0, records=41
[WARN ] 2026-06-02 17:37:07.570 [21074] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:37:10.638 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:37:10.638 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21616/300s
[INFO ] 2026-06-02 17:37:13.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 17:37:13.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432049,ok=432049,error=0, records=41
[INFO ] 2026-06-02 17:37:19.148 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21604/300s
[WARN ] 2026-06-02 17:37:22.575 [21097] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:37:25.639 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.11MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:37:28.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10141, records=41
[INFO ] 2026-06-02 17:37:28.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432050,ok=432050,error=0, records=41
[WARN ] 2026-06-02 17:37:37.581 [21114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:37:40.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:37:43.340 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10154, records=41
[INFO ] 2026-06-02 17:37:43.340 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432051,ok=432051,error=0, records=41
[WARN ] 2026-06-02 17:37:52.586 [21115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:37:55.640 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:37:58.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 17:37:58.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432052,ok=432052,error=0, records=41
[INFO ] 2026-06-02 17:38:04.496 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21614/300s
[INFO ] 2026-06-02 17:38:06.297 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21614/300s
[WARN ] 2026-06-02 17:38:07.591 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:38:10.641 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:38:13.103 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21614/300s
[INFO ] 2026-06-02 17:38:13.355 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 17:38:13.356 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432053,ok=432053,error=0, records=41
[WARN ] 2026-06-02 17:38:22.596 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:38:25.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:38:28.361 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 17:38:28.361 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432054,ok=432054,error=0, records=41
[WARN ] 2026-06-02 17:38:37.601 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:38:40.642 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:38:40.854 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836196},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:38:41.047 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:38:41.047 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 17:38:41.048 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:38:41.048 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:38:41.048 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:38:41.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:38:41.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:38:41.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:38:41.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:38:41.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:38:41.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:38:43.367 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 17:38:43.367 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432055,ok=432055,error=0, records=41
[WARN ] 2026-06-02 17:38:52.607 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:38:55.643 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=31.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:38:55.643 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 17:38:58.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 17:38:58.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432056,ok=432056,error=0, records=41
[WARN ] 2026-06-02 17:39:07.611 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:39:10.644 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:39:13.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 17:39:13.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432057,ok=432057,error=0, records=41
[WARN ] 2026-06-02 17:39:22.618 [21102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:39:25.645 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:39:28.383 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:39:28.384 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432058,ok=432058,error=0, records=41
[WARN ] 2026-06-02 17:39:37.623 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:39:40.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:39:43.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 17:39:43.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432059,ok=432059,error=0, records=41
[WARN ] 2026-06-02 17:39:52.628 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:39:55.646 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:39:58.396 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 17:39:58.396 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432060,ok=432060,error=0, records=41
[INFO ] 2026-06-02 17:40:02.276 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21618/300s
[WARN ] 2026-06-02 17:40:07.632 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:40:10.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:40:13.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:40:13.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432061,ok=432061,error=0, records=41
[WARN ] 2026-06-02 17:40:22.638 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:40:25.647 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:40:28.410 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 17:40:28.410 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432062,ok=432062,error=0, records=41
[WARN ] 2026-06-02 17:40:37.643 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:40:40.643 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21609/300s
[INFO ] 2026-06-02 17:40:40.648 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:40:43.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 17:40:43.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432063,ok=432063,error=0, records=41
[INFO ] 2026-06-02 17:40:43.751 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21618/300s
[WARN ] 2026-06-02 17:40:52.647 [21102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:40:55.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:40:58.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 17:40:58.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432064,ok=432064,error=0, records=41
[WARN ] 2026-06-02 17:41:07.652 [21102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:41:10.649 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:41:13.427 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 17:41:13.427 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432065,ok=432065,error=0, records=41
[WARN ] 2026-06-02 17:41:22.657 [21102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:41:25.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:41:28.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 17:41:28.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432066,ok=432066,error=0, records=41
[INFO ] 2026-06-02 17:41:28.432 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21605/300s
[WARN ] 2026-06-02 17:41:37.662 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:41:40.650 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:41:41.048 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17998/300s
[INFO ] 2026-06-02 17:41:41.049 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836124},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:41:41.227 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:41:41.227 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 17:41:41.227 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:41:41.227 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:41:41.227 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:41:41.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:41:41.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:41:41.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:41:41.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:41:41.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:41:41.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:41:43.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 17:41:43.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432067,ok=432067,error=0, records=41
[WARN ] 2026-06-02 17:41:52.666 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:41:55.651 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:41:56.817 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21614/300s
[INFO ] 2026-06-02 17:41:58.445 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 17:41:58.445 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432068,ok=432068,error=0, records=41
[WARN ] 2026-06-02 17:42:07.672 [21102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:42:10.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:42:10.652 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21617/300s
[INFO ] 2026-06-02 17:42:13.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10380, records=41
[INFO ] 2026-06-02 17:42:13.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432069,ok=432069,error=0, records=41
[INFO ] 2026-06-02 17:42:19.333 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21605/300s
[WARN ] 2026-06-02 17:42:22.677 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:42:25.652 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:42:28.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 17:42:28.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432070,ok=432070,error=0, records=41
[WARN ] 2026-06-02 17:42:37.683 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:42:40.653 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:42:43.461 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 17:42:43.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432071,ok=432071,error=0, records=41
[WARN ] 2026-06-02 17:42:52.689 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:42:55.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:42:58.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-02 17:42:58.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432072,ok=432072,error=0, records=41
[INFO ] 2026-06-02 17:43:04.564 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21615/300s
[INFO ] 2026-06-02 17:43:06.365 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21615/300s
[WARN ] 2026-06-02 17:43:07.694 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:43:10.654 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:43:13.170 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21615/300s
[INFO ] 2026-06-02 17:43:13.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 17:43:13.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432073,ok=432073,error=0, records=41
[WARN ] 2026-06-02 17:43:22.700 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:43:25.655 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:43:28.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 17:43:28.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432074,ok=432074,error=0, records=41
[WARN ] 2026-06-02 17:43:37.706 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:43:40.656 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 17:43:40.656 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 17:43:43.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 17:43:43.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432075,ok=432075,error=0, records=41
[WARN ] 2026-06-02 17:43:52.712 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:43:55.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:43:58.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:43:58.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432076,ok=432076,error=0, records=41
[WARN ] 2026-06-02 17:44:07.717 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:44:10.657 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:44:13.498 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 17:44:13.498 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432077,ok=432077,error=0, records=41
[WARN ] 2026-06-02 17:44:22.722 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:44:25.658 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:44:28.504 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 17:44:28.504 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432078,ok=432078,error=0, records=41
[WARN ] 2026-06-02 17:44:37.728 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:44:40.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:44:41.229 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20836048},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:44:41.389 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:44:41.389 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:44:41.389 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:44:41.389 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:44:41.389 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:44:41.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:44:41.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:44:41.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:44:41.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:44:41.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:44:41.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:44:43.509 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 17:44:43.509 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432079,ok=432079,error=0, records=41
[WARN ] 2026-06-02 17:44:52.733 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:44:55.659 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:44:58.514 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 17:44:58.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432080,ok=432080,error=0, records=41
[INFO ] 2026-06-02 17:45:02.279 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21619/300s
[WARN ] 2026-06-02 17:45:07.738 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:45:10.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:45:13.519 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 17:45:13.519 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432081,ok=432081,error=0, records=41
[WARN ] 2026-06-02 17:45:22.743 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:45:25.660 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:45:28.525 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 17:45:28.525 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432082,ok=432082,error=0, records=41
[WARN ] 2026-06-02 17:45:37.749 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:45:40.661 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:45:40.749 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21610/300s
[INFO ] 2026-06-02 17:45:43.530 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 17:45:43.530 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432083,ok=432083,error=0, records=41
[INFO ] 2026-06-02 17:45:43.757 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21619/300s
[WARN ] 2026-06-02 17:45:52.754 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:45:55.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:45:58.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10285, records=41
[INFO ] 2026-06-02 17:45:58.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432084,ok=432084,error=0, records=41
[WARN ] 2026-06-02 17:46:07.760 [21102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:46:10.662 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:46:13.541 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 17:46:13.541 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432085,ok=432085,error=0, records=41
[WARN ] 2026-06-02 17:46:22.764 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:46:25.663 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:46:28.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 17:46:28.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432086,ok=432086,error=0, records=41
[INFO ] 2026-06-02 17:46:28.549 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21606/300s
[WARN ] 2026-06-02 17:46:37.770 [21158] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:46:40.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:46:43.554 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 17:46:43.554 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432087,ok=432087,error=0, records=41
[WARN ] 2026-06-02 17:46:52.774 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:46:55.664 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:46:56.873 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21615/300s
[INFO ] 2026-06-02 17:46:58.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 17:46:58.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432088,ok=432088,error=0, records=41
[WARN ] 2026-06-02 17:47:07.779 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:47:10.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:47:10.665 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21618/300s
[INFO ] 2026-06-02 17:47:13.564 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10280, records=41
[INFO ] 2026-06-02 17:47:13.564 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432089,ok=432089,error=0, records=41
[INFO ] 2026-06-02 17:47:19.518 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21606/300s
[WARN ] 2026-06-02 17:47:22.785 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:47:25.665 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:47:28.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 17:47:28.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432090,ok=432090,error=0, records=41
[WARN ] 2026-06-02 17:47:37.790 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:47:40.666 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:47:41.389 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 17999/300s
[INFO ] 2026-06-02 17:47:41.391 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835968},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:47:41.567 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:47:41.567 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:47:41.568 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:47:41.568 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:47:41.568 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:47:41.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:47:41.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:47:41.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:47:41.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:47:41.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:47:41.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:47:43.578 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 17:47:43.578 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432091,ok=432091,error=0, records=41
[WARN ] 2026-06-02 17:47:52.795 [21163] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:47:55.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:47:58.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 17:47:58.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432092,ok=432092,error=0, records=41
[INFO ] 2026-06-02 17:48:04.629 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21616/300s
[INFO ] 2026-06-02 17:48:06.431 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21616/300s
[WARN ] 2026-06-02 17:48:07.801 [21127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:48:10.667 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:48:13.237 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21616/300s
[INFO ] 2026-06-02 17:48:13.591 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10360, records=41
[INFO ] 2026-06-02 17:48:13.591 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432093,ok=432093,error=0, records=41
[WARN ] 2026-06-02 17:48:22.806 [21143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:48:25.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.46MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:48:28.597 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 17:48:28.597 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432094,ok=432094,error=0, records=41
[WARN ] 2026-06-02 17:48:37.811 [21694] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:48:40.668 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:48:43.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 17:48:43.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432095,ok=432095,error=0, records=41
[WARN ] 2026-06-02 17:48:52.816 [21709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:48:55.669 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:48:58.609 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10348, records=41
[INFO ] 2026-06-02 17:48:58.609 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432096,ok=432096,error=0, records=41
[WARN ] 2026-06-02 17:49:07.821 [21709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:49:10.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:49:13.619 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 17:49:13.619 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432097,ok=432097,error=0, records=41
[WARN ] 2026-06-02 17:49:22.827 [21694] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:49:25.670 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:49:28.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10149, records=41
[INFO ] 2026-06-02 17:49:28.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432098,ok=432098,error=0, records=41
[WARN ] 2026-06-02 17:49:37.831 [21766] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:49:40.671 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:49:43.631 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 17:49:43.631 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432099,ok=432099,error=0, records=41
[WARN ] 2026-06-02 17:49:52.837 [21694] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:49:55.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:49:58.636 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 17:49:58.636 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432100,ok=432100,error=0, records=41
[INFO ] 2026-06-02 17:50:02.283 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21620/300s
[WARN ] 2026-06-02 17:50:07.842 [21752] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:50:10.672 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.03MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:50:13.642 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 17:50:13.642 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432101,ok=432101,error=0, records=41
[WARN ] 2026-06-02 17:50:22.847 [21794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:50:25.673 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:50:28.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:50:28.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432102,ok=432102,error=0, records=41
[WARN ] 2026-06-02 17:50:37.851 [21808] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:50:40.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:50:40.852 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21611/300s
[INFO ] 2026-06-02 17:50:41.569 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835896},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:50:41.732 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:50:41.732 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 17:50:41.732 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:50:41.732 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:50:41.732 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:50:41.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:50:41.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:50:41.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:50:41.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:50:41.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:50:41.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:50:43.655 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 17:50:43.655 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432103,ok=432103,error=0, records=41
[INFO ] 2026-06-02 17:50:43.764 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21620/300s
[WARN ] 2026-06-02 17:50:52.857 [21822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:50:55.674 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.04MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:50:58.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 17:50:58.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432104,ok=432104,error=0, records=41
[WARN ] 2026-06-02 17:51:07.862 [21794] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:51:10.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:51:13.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 17:51:13.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432105,ok=432105,error=0, records=41
[WARN ] 2026-06-02 17:51:22.866 [21822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:51:25.675 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:51:28.680 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 17:51:28.680 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432106,ok=432106,error=0, records=41
[INFO ] 2026-06-02 17:51:28.680 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21607/300s
[WARN ] 2026-06-02 17:51:37.871 [21850] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:51:40.676 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:51:43.684 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 17:51:43.684 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432107,ok=432107,error=0, records=41
[WARN ] 2026-06-02 17:51:52.876 [21892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:51:55.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:51:56.931 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21616/300s
[INFO ] 2026-06-02 17:51:58.691 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 17:51:58.691 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432108,ok=432108,error=0, records=41
[WARN ] 2026-06-02 17:52:07.882 [21892] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:52:10.677 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:52:10.677 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21619/300s
[INFO ] 2026-06-02 17:52:13.697 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 17:52:13.697 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432109,ok=432109,error=0, records=41
[INFO ] 2026-06-02 17:52:19.699 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21607/300s
[WARN ] 2026-06-02 17:52:22.888 [21927] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:52:25.678 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:52:28.704 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 17:52:28.704 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432110,ok=432110,error=0, records=41
[WARN ] 2026-06-02 17:52:37.894 [21927] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:52:40.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:52:43.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 17:52:43.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432111,ok=432111,error=0, records=41
[WARN ] 2026-06-02 17:52:52.899 [21927] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:52:55.679 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:52:58.761 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 17:52:58.761 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432112,ok=432112,error=0, records=41
[INFO ] 2026-06-02 17:53:04.693 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21617/300s
[INFO ] 2026-06-02 17:53:06.494 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21617/300s
[WARN ] 2026-06-02 17:53:07.905 [21893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:53:10.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:53:13.300 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21617/300s
[INFO ] 2026-06-02 17:53:13.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10315, records=41
[INFO ] 2026-06-02 17:53:13.768 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432113,ok=432113,error=0, records=41
[WARN ] 2026-06-02 17:53:22.909 [21893] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:53:25.680 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:53:28.775 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 17:53:28.775 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432114,ok=432114,error=0, records=41
[WARN ] 2026-06-02 17:53:37.914 [22009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:53:40.681 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 17:53:40.681 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 17:53:41.732 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18000/300s
[INFO ] 2026-06-02 17:53:41.734 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835820},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:53:41.901 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:53:41.901 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 17:53:41.901 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:53:41.901 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:53:41.901 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:53:41.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:53:41.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:53:41.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:53:41.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:53:41.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:53:41.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:53:43.780 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 17:53:43.780 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432115,ok=432115,error=0, records=41
[WARN ] 2026-06-02 17:53:52.920 [22009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:53:55.682 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:53:55.682 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 17:53:58.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 17:53:58.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432116,ok=432116,error=0, records=41
[WARN ] 2026-06-02 17:54:07.925 [22040] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:54:10.683 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:54:13.791 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 17:54:13.791 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432117,ok=432117,error=0, records=41
[WARN ] 2026-06-02 17:54:22.929 [21951] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:54:25.684 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:54:28.798 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 17:54:28.798 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432118,ok=432118,error=0, records=41
[WARN ] 2026-06-02 17:54:37.936 [22045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:54:40.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:54:43.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 17:54:43.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432119,ok=432119,error=0, records=41
[WARN ] 2026-06-02 17:54:52.942 [22083] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:54:55.685 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:54:58.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 17:54:58.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432120,ok=432120,error=0, records=41
[INFO ] 2026-06-02 17:55:02.286 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21621/300s
[WARN ] 2026-06-02 17:55:07.946 [22103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:55:10.686 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:55:13.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 17:55:13.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432121,ok=432121,error=0, records=41
[WARN ] 2026-06-02 17:55:22.952 [22045] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:55:25.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:55:28.824 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 17:55:28.824 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432122,ok=432122,error=0, records=41
[WARN ] 2026-06-02 17:55:37.957 [22098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:55:40.687 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:55:40.958 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21612/300s
[INFO ] 2026-06-02 17:55:43.770 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21621/300s
[INFO ] 2026-06-02 17:55:43.902 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 17:55:43.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432123,ok=432123,error=0, records=41
[WARN ] 2026-06-02 17:55:52.962 [22141] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:55:55.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:55:58.908 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 17:55:58.908 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432124,ok=432124,error=0, records=41
[WARN ] 2026-06-02 17:56:07.967 [22098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:56:10.688 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:56:13.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 17:56:13.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432125,ok=432125,error=0, records=41
[WARN ] 2026-06-02 17:56:22.972 [22098] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:56:25.689 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.05MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:56:28.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 17:56:28.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432126,ok=432126,error=0, records=41
[INFO ] 2026-06-02 17:56:28.921 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21608/300s
[WARN ] 2026-06-02 17:56:37.978 [22169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:56:40.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:56:41.903 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835752},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:56:42.066 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:56:42.066 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 17:56:42.067 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:56:42.067 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:56:42.067 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:56:42.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:56:42.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:56:42.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:56:42.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:56:42.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:56:42.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:56:43.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 17:56:43.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432127,ok=432127,error=0, records=41
[WARN ] 2026-06-02 17:56:52.984 [22097] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:56:55.690 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:56:56.985 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21617/300s
[INFO ] 2026-06-02 17:56:58.934 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 17:56:58.934 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432128,ok=432128,error=0, records=41
[WARN ] 2026-06-02 17:57:07.989 [22169] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:57:10.691 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:57:10.691 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21620/300s
[INFO ] 2026-06-02 17:57:13.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 17:57:13.942 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432129,ok=432129,error=0, records=41
[INFO ] 2026-06-02 17:57:19.883 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21608/300s
[WARN ] 2026-06-02 17:57:22.994 [22225] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:57:25.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:57:28.949 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 17:57:28.949 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432130,ok=432130,error=0, records=41
[WARN ] 2026-06-02 17:57:37.998 [22183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:57:40.692 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:57:43.955 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 17:57:43.955 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432131,ok=432131,error=0, records=41
[WARN ] 2026-06-02 17:57:53.003 [22183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:57:55.693 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:57:58.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 17:57:58.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432132,ok=432132,error=0, records=41
[INFO ] 2026-06-02 17:58:04.758 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21618/300s
[INFO ] 2026-06-02 17:58:06.559 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21618/300s
[WARN ] 2026-06-02 17:58:08.008 [22183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:58:10.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:58:13.365 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21618/300s
[INFO ] 2026-06-02 17:58:13.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 17:58:13.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432133,ok=432133,error=0, records=41
[WARN ] 2026-06-02 17:58:23.013 [22183] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:58:25.694 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:58:28.972 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 17:58:28.972 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432134,ok=432134,error=0, records=41
[WARN ] 2026-06-02 17:58:38.019 [22279] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:58:40.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:58:43.982 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 17:58:43.982 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432135,ok=432135,error=0, records=41
[WARN ] 2026-06-02 17:58:53.024 [22307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:58:55.695 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.72MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:58:58.987 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 17:58:58.987 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432136,ok=432136,error=0, records=41
[WARN ] 2026-06-02 17:59:08.028 [22293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:59:10.696 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:59:13.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 17:59:13.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432137,ok=432137,error=0, records=41
[WARN ] 2026-06-02 17:59:23.033 [22321] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:59:25.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:59:29.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 17:59:29.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432138,ok=432138,error=0, records=41
[WARN ] 2026-06-02 17:59:38.038 [22293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:59:40.697 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:59:42.067 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18001/300s
[INFO ] 2026-06-02 17:59:42.068 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835680},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 17:59:42.224 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 17:59:42.224 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 17:59:42.224 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 17:59:42.224 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 17:59:42.224 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 17:59:42.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:59:42.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 17:59:42.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:59:42.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 17:59:42.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:59:42.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 17:59:44.008 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 17:59:44.008 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432139,ok=432139,error=0, records=41
[WARN ] 2026-06-02 17:59:53.043 [22369] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 17:59:55.698 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 17:59:59.019 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 17:59:59.019 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432140,ok=432140,error=0, records=41
[INFO ] 2026-06-02 18:00:02.290 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21622/300s
[WARN ] 2026-06-02 18:00:08.048 [22356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:00:10.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:00:14.027 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 18:00:14.027 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432141,ok=432141,error=0, records=41
[WARN ] 2026-06-02 18:00:23.053 [22356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:00:25.699 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:00:29.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 18:00:29.032 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432142,ok=432142,error=0, records=41
[WARN ] 2026-06-02 18:00:37.558 [22395] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:00:40.700 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:00:41.059 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21613/300s
[INFO ] 2026-06-02 18:00:43.776 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21622/300s
[INFO ] 2026-06-02 18:00:44.037 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 18:00:44.037 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432143,ok=432143,error=0, records=41
[WARN ] 2026-06-02 18:00:52.564 [22395] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:00:55.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:00:59.042 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 18:00:59.042 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432144,ok=432144,error=0, records=41
[WARN ] 2026-06-02 18:01:07.567 [22429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:01:10.701 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:01:14.048 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10290, records=41
[INFO ] 2026-06-02 18:01:14.048 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432145,ok=432145,error=0, records=41
[WARN ] 2026-06-02 18:01:22.573 [22469] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:01:25.702 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.85MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:01:29.054 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10276, records=41
[INFO ] 2026-06-02 18:01:29.054 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432146,ok=432146,error=0, records=41
[INFO ] 2026-06-02 18:01:29.054 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21609/300s
[WARN ] 2026-06-02 18:01:37.578 [22470] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:01:40.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:01:44.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-02 18:01:44.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432147,ok=432147,error=0, records=41
[WARN ] 2026-06-02 18:01:52.584 [22429] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:01:55.703 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:01:57.044 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21618/300s
[INFO ] 2026-06-02 18:01:59.067 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 18:01:59.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432148,ok=432148,error=0, records=41
[WARN ] 2026-06-02 18:02:07.589 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:02:10.704 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:02:10.704 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21621/300s
[INFO ] 2026-06-02 18:02:14.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 18:02:14.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432149,ok=432149,error=0, records=41
[INFO ] 2026-06-02 18:02:20.068 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21609/300s
[WARN ] 2026-06-02 18:02:22.593 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:02:25.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:02:29.081 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 18:02:29.081 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432150,ok=432150,error=0, records=41
[WARN ] 2026-06-02 18:02:37.598 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:02:40.705 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:02:42.226 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835596},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:02:42.384 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:02:42.384 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 18:02:42.384 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:02:42.384 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:02:42.384 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:02:42.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:02:42.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:02:42.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:02:42.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:02:42.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:02:42.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:02:44.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 18:02:44.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432151,ok=432151,error=0, records=41
[WARN ] 2026-06-02 18:02:52.603 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:02:55.706 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:02:59.093 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 18:02:59.093 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432152,ok=432152,error=0, records=41
[INFO ] 2026-06-02 18:03:04.828 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21619/300s
[INFO ] 2026-06-02 18:03:06.630 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21619/300s
[WARN ] 2026-06-02 18:03:07.609 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:03:10.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:03:13.437 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21619/300s
[INFO ] 2026-06-02 18:03:14.099 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 18:03:14.099 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432153,ok=432153,error=0, records=41
[WARN ] 2026-06-02 18:03:22.614 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:03:25.707 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:03:29.105 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 18:03:29.105 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432154,ok=432154,error=0, records=41
[WARN ] 2026-06-02 18:03:37.619 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:03:40.708 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 18:03:40.708 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 18:03:44.111 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 18:03:44.111 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432155,ok=432155,error=0, records=41
[WARN ] 2026-06-02 18:03:52.624 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:03:55.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:03:59.117 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 18:03:59.117 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432156,ok=432156,error=0, records=41
[WARN ] 2026-06-02 18:04:07.629 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:04:10.709 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:04:14.124 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10173, records=41
[INFO ] 2026-06-02 18:04:14.124 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432157,ok=432157,error=0, records=41
[WARN ] 2026-06-02 18:04:22.634 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:04:25.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:04:29.129 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10165, records=41
[INFO ] 2026-06-02 18:04:29.129 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432158,ok=432158,error=0, records=41
[WARN ] 2026-06-02 18:04:37.640 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:04:40.710 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.98MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:04:44.134 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 18:04:44.134 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432159,ok=432159,error=0, records=41
[WARN ] 2026-06-02 18:04:52.645 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:04:55.711 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:04:59.141 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 18:04:59.141 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432160,ok=432160,error=0, records=41
[INFO ] 2026-06-02 18:05:02.293 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21623/300s
[WARN ] 2026-06-02 18:05:07.649 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:05:10.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:05:14.147 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 18:05:14.147 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432161,ok=432161,error=0, records=41
[WARN ] 2026-06-02 18:05:22.655 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:05:25.712 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:05:29.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 18:05:29.154 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432162,ok=432162,error=0, records=41
[WARN ] 2026-06-02 18:05:37.661 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:05:40.713 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:05:41.162 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21614/300s
[INFO ] 2026-06-02 18:05:42.384 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18002/300s
[INFO ] 2026-06-02 18:05:42.386 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835520},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:05:42.552 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:05:42.552 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:05:42.552 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:05:42.552 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:05:42.552 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:05:42.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:05:42.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:05:42.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:05:42.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:05:42.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:05:42.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:05:43.783 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21623/300s
[INFO ] 2026-06-02 18:05:44.161 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 18:05:44.161 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432163,ok=432163,error=0, records=41
[WARN ] 2026-06-02 18:05:52.666 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:05:55.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:05:59.166 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 18:05:59.166 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432164,ok=432164,error=0, records=41
[WARN ] 2026-06-02 18:06:07.671 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:06:10.714 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:06:14.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 18:06:14.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432165,ok=432165,error=0, records=41
[WARN ] 2026-06-02 18:06:22.676 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:06:25.715 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:06:29.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 18:06:29.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432166,ok=432166,error=0, records=41
[INFO ] 2026-06-02 18:06:29.181 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21610/300s
[WARN ] 2026-06-02 18:06:37.681 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:06:40.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:06:44.187 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 18:06:44.187 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432167,ok=432167,error=0, records=41
[WARN ] 2026-06-02 18:06:52.687 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:06:55.716 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:06:57.098 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21619/300s
[INFO ] 2026-06-02 18:06:59.193 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10298, records=41
[INFO ] 2026-06-02 18:06:59.193 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432168,ok=432168,error=0, records=41
[WARN ] 2026-06-02 18:07:07.692 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:07:10.717 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:07:10.717 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21622/300s
[INFO ] 2026-06-02 18:07:14.199 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 18:07:14.199 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432169,ok=432169,error=0, records=41
[INFO ] 2026-06-02 18:07:20.247 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21610/300s
[WARN ] 2026-06-02 18:07:22.697 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:07:25.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:07:29.204 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 18:07:29.204 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432170,ok=432170,error=0, records=41
[WARN ] 2026-06-02 18:07:37.701 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:07:40.718 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:07:44.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 18:07:44.211 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432171,ok=432171,error=0, records=41
[WARN ] 2026-06-02 18:07:52.707 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:07:55.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:07:59.217 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 18:07:59.217 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432172,ok=432172,error=0, records=41
[INFO ] 2026-06-02 18:08:04.910 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21620/300s
[INFO ] 2026-06-02 18:08:06.711 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21620/300s
[WARN ] 2026-06-02 18:08:07.712 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:08:10.719 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:08:13.517 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21620/300s
[INFO ] 2026-06-02 18:08:14.225 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 18:08:14.225 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432173,ok=432173,error=0, records=41
[WARN ] 2026-06-02 18:08:22.718 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:08:25.720 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:08:29.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 18:08:29.231 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432174,ok=432174,error=0, records=41
[WARN ] 2026-06-02 18:08:37.723 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:08:40.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:08:42.554 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835448},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:08:42.695 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:08:42.695 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 18:08:42.695 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:08:42.695 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:08:42.695 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:08:42.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:08:42.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:08:42.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:08:42.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:08:42.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:08:42.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:08:44.237 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 18:08:44.237 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432175,ok=432175,error=0, records=41
[WARN ] 2026-06-02 18:08:52.729 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:08:55.721 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:08:55.721 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 18:08:59.258 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 18:08:59.259 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432176,ok=432176,error=0, records=41
[WARN ] 2026-06-02 18:09:07.734 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:09:10.723 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:09:14.263 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10162, records=41
[INFO ] 2026-06-02 18:09:14.263 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432177,ok=432177,error=0, records=41
[WARN ] 2026-06-02 18:09:22.740 [22555] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:09:25.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:09:29.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10164, records=41
[INFO ] 2026-06-02 18:09:29.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432178,ok=432178,error=0, records=41
[WARN ] 2026-06-02 18:09:37.746 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:09:40.724 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=24.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:09:44.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10178, records=41
[INFO ] 2026-06-02 18:09:44.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432179,ok=432179,error=0, records=41
[WARN ] 2026-06-02 18:09:52.752 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:09:55.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.95MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:09:59.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 18:09:59.285 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432180,ok=432180,error=0, records=41
[INFO ] 2026-06-02 18:10:02.298 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21624/300s
[WARN ] 2026-06-02 18:10:07.757 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:10:10.725 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:10:14.289 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 18:10:14.289 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432181,ok=432181,error=0, records=41
[WARN ] 2026-06-02 18:10:22.762 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:10:25.726 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:10:29.295 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 18:10:29.295 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432182,ok=432182,error=0, records=41
[WARN ] 2026-06-02 18:10:37.767 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:10:40.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:10:41.268 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21615/300s
[INFO ] 2026-06-02 18:10:43.790 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21624/300s
[INFO ] 2026-06-02 18:10:44.301 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 18:10:44.301 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432183,ok=432183,error=0, records=41
[WARN ] 2026-06-02 18:10:52.771 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:10:55.727 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:10:59.307 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 18:10:59.307 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432184,ok=432184,error=0, records=41
[WARN ] 2026-06-02 18:11:07.777 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:11:10.728 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:11:14.313 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 18:11:14.313 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432185,ok=432185,error=0, records=41
[WARN ] 2026-06-02 18:11:22.782 [22533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:11:25.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:11:29.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 18:11:29.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432186,ok=432186,error=0, records=41
[INFO ] 2026-06-02 18:11:29.321 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21611/300s
[WARN ] 2026-06-02 18:11:37.789 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:11:40.729 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:11:42.695 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18003/300s
[INFO ] 2026-06-02 18:11:42.697 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835372},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:11:42.853 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:11:42.853 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 18:11:42.853 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:11:42.853 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:11:42.853 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:11:42.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:11:42.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:11:42.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:11:42.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:11:42.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:11:42.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:11:44.326 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 18:11:44.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432187,ok=432187,error=0, records=41
[WARN ] 2026-06-02 18:11:52.795 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:11:55.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:11:57.156 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21620/300s
[INFO ] 2026-06-02 18:11:59.332 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 18:11:59.332 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432188,ok=432188,error=0, records=41
[WARN ] 2026-06-02 18:12:07.800 [22538] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:12:10.730 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:12:10.730 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21623/300s
[INFO ] 2026-06-02 18:12:14.337 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10181, records=41
[INFO ] 2026-06-02 18:12:14.337 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432189,ok=432189,error=0, records=41
[INFO ] 2026-06-02 18:12:20.430 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21611/300s
[WARN ] 2026-06-02 18:12:22.805 [22570] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:12:25.731 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:12:29.342 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-02 18:12:29.342 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432190,ok=432190,error=0, records=41
[WARN ] 2026-06-02 18:12:37.810 [23104] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:12:40.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:12:44.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10141, records=41
[INFO ] 2026-06-02 18:12:44.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432191,ok=432191,error=0, records=41
[WARN ] 2026-06-02 18:12:52.815 [23088] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:12:55.732 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:12:59.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 18:12:59.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432192,ok=432192,error=0, records=41
[INFO ] 2026-06-02 18:13:04.968 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21621/300s
[INFO ] 2026-06-02 18:13:06.769 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21621/300s
[WARN ] 2026-06-02 18:13:07.820 [23133] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:13:10.733 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.70MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:13:13.574 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21621/300s
[INFO ] 2026-06-02 18:13:14.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 18:13:14.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432193,ok=432193,error=0, records=41
[WARN ] 2026-06-02 18:13:22.825 [23114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:13:25.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:13:29.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 18:13:29.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432194,ok=432194,error=0, records=41
[WARN ] 2026-06-02 18:13:37.830 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:13:40.734 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 18:13:40.734 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 18:13:44.374 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 18:13:44.374 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432195,ok=432195,error=0, records=41
[WARN ] 2026-06-02 18:13:52.836 [23114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:13:55.735 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:13:59.386 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 18:13:59.386 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432196,ok=432196,error=0, records=41
[WARN ] 2026-06-02 18:14:07.840 [23174] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:14:10.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:14:14.391 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 18:14:14.391 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432197,ok=432197,error=0, records=41
[WARN ] 2026-06-02 18:14:22.845 [23099] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:14:25.736 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:14:29.397 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 18:14:29.397 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432198,ok=432198,error=0, records=41
[WARN ] 2026-06-02 18:14:37.851 [22517] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:14:40.737 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:14:42.855 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835304},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:14:43.045 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:14:43.046 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 18:14:43.046 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:14:43.046 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:14:43.046 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:14:43.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:14:43.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:14:43.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:14:43.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:14:43.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:14:43.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:14:44.402 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 18:14:44.402 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432199,ok=432199,error=0, records=41
[WARN ] 2026-06-02 18:14:52.855 [23211] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:14:55.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:14:59.407 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 18:14:59.407 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432200,ok=432200,error=0, records=41
[INFO ] 2026-06-02 18:15:02.301 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21625/300s
[WARN ] 2026-06-02 18:15:07.860 [23211] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:15:10.738 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:15:14.414 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 18:15:14.414 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432201,ok=432201,error=0, records=41
[WARN ] 2026-06-02 18:15:22.864 [23211] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:15:25.739 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:15:29.436 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 18:15:29.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432202,ok=432202,error=0, records=41
[WARN ] 2026-06-02 18:15:37.869 [23268] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:15:40.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:15:41.370 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21616/300s
[INFO ] 2026-06-02 18:15:43.796 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21625/300s
[INFO ] 2026-06-02 18:15:44.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 18:15:44.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432203,ok=432203,error=0, records=41
[WARN ] 2026-06-02 18:15:52.874 [23114] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:15:55.740 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:15:59.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:15:59.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432204,ok=432204,error=0, records=41
[WARN ] 2026-06-02 18:16:07.879 [23299] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:16:10.741 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:16:14.474 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 18:16:14.474 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432205,ok=432205,error=0, records=41
[WARN ] 2026-06-02 18:16:22.884 [23298] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:16:25.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:16:29.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 18:16:29.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432206,ok=432206,error=0, records=41
[INFO ] 2026-06-02 18:16:29.480 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21612/300s
[WARN ] 2026-06-02 18:16:37.889 [23282] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:16:40.742 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:16:44.486 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 18:16:44.486 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432207,ok=432207,error=0, records=41
[WARN ] 2026-06-02 18:16:52.894 [23347] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:16:55.743 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:16:57.211 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21621/300s
[INFO ] 2026-06-02 18:16:59.491 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 18:16:59.491 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432208,ok=432208,error=0, records=41
[WARN ] 2026-06-02 18:17:07.901 [23363] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:17:10.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:17:10.744 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21624/300s
[INFO ] 2026-06-02 18:17:14.496 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 18:17:14.496 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432209,ok=432209,error=0, records=41
[INFO ] 2026-06-02 18:17:20.615 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21612/300s
[WARN ] 2026-06-02 18:17:22.907 [23372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:17:25.744 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:17:29.502 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 18:17:29.502 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432210,ok=432210,error=0, records=41
[WARN ] 2026-06-02 18:17:37.912 [23395] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:17:40.745 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.29MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:17:43.046 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18004/300s
[INFO ] 2026-06-02 18:17:43.047 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835220},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:17:43.208 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:17:43.208 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:17:43.208 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:17:43.208 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:17:43.208 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:17:43.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:17:43.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:17:43.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:17:43.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:17:43.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:17:43.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:17:44.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 18:17:44.510 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432211,ok=432211,error=0, records=41
[WARN ] 2026-06-02 18:17:52.917 [23418] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:17:55.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:17:59.515 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 18:17:59.515 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432212,ok=432212,error=0, records=41
[INFO ] 2026-06-02 18:18:05.036 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21622/300s
[INFO ] 2026-06-02 18:18:06.838 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21622/300s
[WARN ] 2026-06-02 18:18:07.923 [23395] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:18:10.746 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:18:13.644 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21622/300s
[INFO ] 2026-06-02 18:18:14.522 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 18:18:14.522 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432213,ok=432213,error=0, records=41
[WARN ] 2026-06-02 18:18:22.927 [23449] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:18:25.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:18:29.527 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:18:29.528 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432214,ok=432214,error=0, records=41
[WARN ] 2026-06-02 18:18:37.934 [23444] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:18:40.747 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:18:44.533 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 18:18:44.533 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432215,ok=432215,error=0, records=41
[WARN ] 2026-06-02 18:18:52.940 [23482] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:18:55.748 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.54MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:18:59.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 18:18:59.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432216,ok=432216,error=0, records=41
[WARN ] 2026-06-02 18:19:07.945 [23498] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:19:10.749 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:19:14.544 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10354, records=41
[INFO ] 2026-06-02 18:19:14.545 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432217,ok=432217,error=0, records=41
[WARN ] 2026-06-02 18:19:22.950 [23476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:19:25.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:19:29.549 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 18:19:29.549 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432218,ok=432218,error=0, records=41
[WARN ] 2026-06-02 18:19:37.956 [23509] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:19:40.750 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:19:44.555 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10341, records=41
[INFO ] 2026-06-02 18:19:44.555 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432219,ok=432219,error=0, records=41
[WARN ] 2026-06-02 18:19:52.961 [23510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:19:55.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:19:59.561 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 18:19:59.561 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432220,ok=432220,error=0, records=41
[INFO ] 2026-06-02 18:20:02.305 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21626/300s
[WARN ] 2026-06-02 18:20:07.966 [23503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:20:10.751 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:20:14.572 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 18:20:14.572 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432221,ok=432221,error=0, records=41
[WARN ] 2026-06-02 18:20:22.972 [23476] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:20:25.752 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:20:29.577 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:20:29.577 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432222,ok=432222,error=0, records=41
[WARN ] 2026-06-02 18:20:37.977 [23510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:20:40.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:20:41.478 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21617/300s
[INFO ] 2026-06-02 18:20:43.210 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835156},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:20:43.379 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:20:43.379 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:20:43.379 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:20:43.379 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:20:43.379 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:20:43.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:20:43.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:20:43.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:20:43.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:20:43.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:20:43.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:20:43.803 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21626/300s
[INFO ] 2026-06-02 18:20:44.584 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 18:20:44.584 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432223,ok=432223,error=0, records=41
[WARN ] 2026-06-02 18:20:52.982 [23510] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:20:55.753 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:20:59.589 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:20:59.589 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432224,ok=432224,error=0, records=41
[WARN ] 2026-06-02 18:21:07.986 [23612] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:21:10.754 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:21:14.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 18:21:14.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432225,ok=432225,error=0, records=41
[WARN ] 2026-06-02 18:21:22.992 [23556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:21:25.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:21:29.599 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 18:21:29.599 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432226,ok=432226,error=0, records=41
[INFO ] 2026-06-02 18:21:29.599 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21613/300s
[WARN ] 2026-06-02 18:21:37.996 [23598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:21:40.755 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:21:44.606 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 18:21:44.606 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432227,ok=432227,error=0, records=41
[WARN ] 2026-06-02 18:21:53.001 [23598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:21:55.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:21:57.266 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21622/300s
[INFO ] 2026-06-02 18:21:59.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 18:21:59.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432228,ok=432228,error=0, records=41
[WARN ] 2026-06-02 18:22:08.006 [23556] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:22:10.756 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:22:10.756 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21625/300s
[INFO ] 2026-06-02 18:22:14.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 18:22:14.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432229,ok=432229,error=0, records=41
[INFO ] 2026-06-02 18:22:20.798 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21613/300s
[WARN ] 2026-06-02 18:22:23.012 [23598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:22:25.757 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:22:29.622 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10150, records=41
[INFO ] 2026-06-02 18:22:29.622 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432230,ok=432230,error=0, records=41
[WARN ] 2026-06-02 18:22:38.017 [23503] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:22:40.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:22:44.628 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 18:22:44.628 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432231,ok=432231,error=0, records=41
[WARN ] 2026-06-02 18:22:53.022 [23598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:22:55.758 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:22:59.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10180, records=41
[INFO ] 2026-06-02 18:22:59.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432232,ok=432232,error=0, records=41
[INFO ] 2026-06-02 18:23:05.096 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21623/300s
[INFO ] 2026-06-02 18:23:06.897 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21623/300s
[WARN ] 2026-06-02 18:23:08.028 [23598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:23:10.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:23:13.703 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21623/300s
[INFO ] 2026-06-02 18:23:14.725 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10284, records=41
[INFO ] 2026-06-02 18:23:14.725 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432233,ok=432233,error=0, records=41
[WARN ] 2026-06-02 18:23:23.032 [23598] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:23:25.759 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:23:29.733 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:23:29.733 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432234,ok=432234,error=0, records=41
[WARN ] 2026-06-02 18:23:38.037 [23670] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:23:40.760 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 18:23:40.760 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 18:23:43.379 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18005/300s
[INFO ] 2026-06-02 18:23:43.381 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20835076},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:23:43.551 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:23:43.552 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 18:23:43.552 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:23:43.552 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:23:43.552 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:23:43.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:23:43.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:23:43.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:23:43.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:23:43.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:23:43.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:23:44.738 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 18:23:44.738 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432235,ok=432235,error=0, records=41
[WARN ] 2026-06-02 18:23:53.041 [23780] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:23:55.761 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:23:55.761 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 18:23:59.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10275, records=41
[INFO ] 2026-06-02 18:23:59.744 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432236,ok=432236,error=0, records=41
[WARN ] 2026-06-02 18:24:08.046 [23785] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:24:10.762 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:24:14.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 18:24:14.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432237,ok=432237,error=0, records=41
[WARN ] 2026-06-02 18:24:23.051 [23810] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:24:25.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:24:29.757 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 18:24:29.757 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432238,ok=432238,error=0, records=41
[WARN ] 2026-06-02 18:24:37.555 [23831] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:24:40.763 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.68MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:24:44.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 18:24:44.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432239,ok=432239,error=0, records=41
[WARN ] 2026-06-02 18:24:52.561 [23792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:24:55.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=27.97MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:24:59.772 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 18:24:59.772 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432240,ok=432240,error=0, records=41
[INFO ] 2026-06-02 18:25:02.308 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21627/300s
[WARN ] 2026-06-02 18:25:07.566 [23792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:25:10.764 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:25:14.777 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10295, records=41
[INFO ] 2026-06-02 18:25:14.777 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432241,ok=432241,error=0, records=41
[WARN ] 2026-06-02 18:25:22.572 [23881] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:25:25.765 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.74MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:25:29.782 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:25:29.782 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432242,ok=432242,error=0, records=41
[WARN ] 2026-06-02 18:25:37.577 [23904] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:25:40.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:25:41.578 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21618/300s
[INFO ] 2026-06-02 18:25:43.810 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21627/300s
[INFO ] 2026-06-02 18:25:44.789 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 18:25:44.789 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432243,ok=432243,error=0, records=41
[WARN ] 2026-06-02 18:25:52.582 [23921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:25:55.766 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:25:59.796 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 18:25:59.796 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432244,ok=432244,error=0, records=41
[WARN ] 2026-06-02 18:26:07.587 [23921] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:26:10.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:26:14.805 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 18:26:14.805 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432245,ok=432245,error=0, records=41
[WARN ] 2026-06-02 18:26:22.592 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:26:25.767 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:26:29.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:26:29.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432246,ok=432246,error=0, records=41
[INFO ] 2026-06-02 18:26:29.810 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21614/300s
[WARN ] 2026-06-02 18:26:37.597 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:26:40.768 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:26:43.553 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834996},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:26:43.712 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:26:43.712 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:26:43.712 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:26:43.712 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:26:43.712 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:26:43.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:26:43.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:26:43.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:26:43.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:26:43.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:26:43.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:26:44.826 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 18:26:44.826 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432247,ok=432247,error=0, records=41
[WARN ] 2026-06-02 18:26:52.602 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:26:55.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:26:57.320 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21623/300s
[INFO ] 2026-06-02 18:26:59.831 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 18:26:59.831 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432248,ok=432248,error=0, records=41
[WARN ] 2026-06-02 18:27:07.608 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:27:10.769 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:27:10.769 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21626/300s
[INFO ] 2026-06-02 18:27:14.860 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:27:14.860 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432249,ok=432249,error=0, records=41
[INFO ] 2026-06-02 18:27:20.978 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21614/300s
[WARN ] 2026-06-02 18:27:22.613 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:27:25.770 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:27:29.866 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 18:27:29.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432250,ok=432250,error=0, records=41
[WARN ] 2026-06-02 18:27:37.618 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:27:40.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:27:44.876 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 18:27:44.876 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432251,ok=432251,error=0, records=41
[WARN ] 2026-06-02 18:27:52.623 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:27:55.771 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.89MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:27:59.882 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10191, records=41
[INFO ] 2026-06-02 18:27:59.882 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432252,ok=432252,error=0, records=41
[INFO ] 2026-06-02 18:28:05.146 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21624/300s
[INFO ] 2026-06-02 18:28:06.948 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21624/300s
[WARN ] 2026-06-02 18:28:07.628 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:28:10.772 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:28:13.754 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21624/300s
[INFO ] 2026-06-02 18:28:14.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10241, records=41
[INFO ] 2026-06-02 18:28:14.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432253,ok=432253,error=0, records=41
[WARN ] 2026-06-02 18:28:22.634 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:28:25.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:28:29.892 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 18:28:29.892 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432254,ok=432254,error=0, records=41
[WARN ] 2026-06-02 18:28:37.639 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:28:40.773 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:28:44.899 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 18:28:44.899 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432255,ok=432255,error=0, records=41
[WARN ] 2026-06-02 18:28:52.645 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:28:55.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:28:59.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10265, records=41
[INFO ] 2026-06-02 18:28:59.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432256,ok=432256,error=0, records=41
[WARN ] 2026-06-02 18:29:07.651 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:29:10.774 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:29:14.916 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 18:29:14.916 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432257,ok=432257,error=0, records=41
[WARN ] 2026-06-02 18:29:22.656 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:29:25.775 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:29:29.920 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 18:29:29.920 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432258,ok=432258,error=0, records=41
[WARN ] 2026-06-02 18:29:37.660 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:29:40.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:29:43.713 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18006/300s
[INFO ] 2026-06-02 18:29:43.714 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834916},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:29:43.866 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:29:43.866 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 18:29:43.867 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:29:43.867 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:29:43.867 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:29:43.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:29:43.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:29:43.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:29:43.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:29:43.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:29:43.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:29:44.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 18:29:44.926 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432259,ok=432259,error=0, records=41
[WARN ] 2026-06-02 18:29:52.664 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:29:55.776 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:29:59.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:29:59.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432260,ok=432260,error=0, records=41
[INFO ] 2026-06-02 18:30:02.311 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21628/300s
[WARN ] 2026-06-02 18:30:07.668 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:30:10.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.90MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:30:14.938 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 18:30:14.938 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432261,ok=432261,error=0, records=41
[WARN ] 2026-06-02 18:30:22.675 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:30:25.777 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:30:29.944 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 18:30:29.944 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432262,ok=432262,error=0, records=41
[WARN ] 2026-06-02 18:30:37.679 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:30:40.778 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:30:41.680 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21619/300s
[INFO ] 2026-06-02 18:30:43.816 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21628/300s
[INFO ] 2026-06-02 18:30:44.951 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 18:30:44.951 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432263,ok=432263,error=0, records=41
[WARN ] 2026-06-02 18:30:52.685 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:30:55.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:30:59.956 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 18:30:59.956 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432264,ok=432264,error=0, records=41
[WARN ] 2026-06-02 18:31:07.690 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:31:10.779 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:31:14.963 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 18:31:14.963 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432265,ok=432265,error=0, records=41
[WARN ] 2026-06-02 18:31:22.696 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:31:25.780 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:31:29.968 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 18:31:29.968 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432266,ok=432266,error=0, records=41
[INFO ] 2026-06-02 18:31:29.968 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21615/300s
[WARN ] 2026-06-02 18:31:37.703 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:31:40.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:31:44.973 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 18:31:44.973 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432267,ok=432267,error=0, records=41
[WARN ] 2026-06-02 18:31:52.707 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:31:55.781 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:31:57.377 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21624/300s
[INFO ] 2026-06-02 18:31:59.978 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 18:31:59.978 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432268,ok=432268,error=0, records=41
[WARN ] 2026-06-02 18:32:07.711 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:32:10.782 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:32:10.782 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21627/300s
[INFO ] 2026-06-02 18:32:14.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:32:14.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432269,ok=432269,error=0, records=41
[INFO ] 2026-06-02 18:32:21.163 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21615/300s
[WARN ] 2026-06-02 18:32:22.715 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:32:25.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:32:29.992 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 18:32:29.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432270,ok=432270,error=0, records=41
[WARN ] 2026-06-02 18:32:37.719 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:32:40.783 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:32:43.868 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834832},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:32:44.032 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:32:44.032 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:32:44.032 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:32:44.032 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:32:44.032 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:32:44.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:32:44.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:32:44.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:32:44.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:32:44.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:32:44.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:32:44.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 18:32:44.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432271,ok=432271,error=0, records=41
[WARN ] 2026-06-02 18:32:52.724 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:32:55.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:33:00.002 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 18:33:00.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432272,ok=432272,error=0, records=41
[INFO ] 2026-06-02 18:33:05.212 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21625/300s
[INFO ] 2026-06-02 18:33:07.013 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21625/300s
[WARN ] 2026-06-02 18:33:07.729 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:33:10.784 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:33:13.820 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21625/300s
[INFO ] 2026-06-02 18:33:15.010 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 18:33:15.010 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432273,ok=432273,error=0, records=41
[WARN ] 2026-06-02 18:33:22.733 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:33:25.785 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:33:30.018 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 18:33:30.018 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432274,ok=432274,error=0, records=41
[WARN ] 2026-06-02 18:33:37.739 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:33:40.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 18:33:40.786 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 18:33:45.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 18:33:45.023 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432275,ok=432275,error=0, records=41
[WARN ] 2026-06-02 18:33:52.743 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:33:55.786 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:34:00.028 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 18:34:00.028 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432276,ok=432276,error=0, records=41
[WARN ] 2026-06-02 18:34:07.747 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:34:10.787 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:34:15.038 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 18:34:15.038 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432277,ok=432277,error=0, records=41
[WARN ] 2026-06-02 18:34:22.752 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:34:25.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:34:30.049 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 18:34:30.049 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432278,ok=432278,error=0, records=41
[WARN ] 2026-06-02 18:34:37.756 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:34:40.788 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:34:45.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 18:34:45.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432279,ok=432279,error=0, records=41
[WARN ] 2026-06-02 18:34:52.761 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:34:55.789 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:35:00.060 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 18:35:00.060 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432280,ok=432280,error=0, records=41
[INFO ] 2026-06-02 18:35:02.315 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21629/300s
[WARN ] 2026-06-02 18:35:07.766 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:35:10.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:35:15.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10273, records=41
[INFO ] 2026-06-02 18:35:15.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432281,ok=432281,error=0, records=41
[WARN ] 2026-06-02 18:35:22.771 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:35:25.790 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:35:30.073 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:35:30.073 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432282,ok=432282,error=0, records=41
[WARN ] 2026-06-02 18:35:37.776 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:35:40.791 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:35:41.777 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21620/300s
[INFO ] 2026-06-02 18:35:43.823 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21629/300s
[INFO ] 2026-06-02 18:35:44.033 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18007/300s
[INFO ] 2026-06-02 18:35:44.034 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834748},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:35:44.194 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:35:44.194 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 18:35:44.194 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:35:44.194 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:35:44.194 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:35:44.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:35:44.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:35:44.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:35:44.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:35:44.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:35:44.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:35:45.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 18:35:45.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432283,ok=432283,error=0, records=41
[WARN ] 2026-06-02 18:35:52.782 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:35:55.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:36:00.164 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10255, records=41
[INFO ] 2026-06-02 18:36:00.164 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432284,ok=432284,error=0, records=41
[WARN ] 2026-06-02 18:36:07.787 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:36:10.792 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:36:15.169 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10363, records=41
[INFO ] 2026-06-02 18:36:15.169 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432285,ok=432285,error=0, records=41
[WARN ] 2026-06-02 18:36:22.792 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:36:25.793 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:36:30.175 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10353, records=41
[INFO ] 2026-06-02 18:36:30.175 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432286,ok=432286,error=0, records=41
[INFO ] 2026-06-02 18:36:30.175 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21616/300s
[WARN ] 2026-06-02 18:36:37.797 [23954] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:36:40.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:36:45.182 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10334, records=41
[INFO ] 2026-06-02 18:36:45.182 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432287,ok=432287,error=0, records=41
[WARN ] 2026-06-02 18:36:52.801 [23937] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:36:55.794 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:36:57.436 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21625/300s
[INFO ] 2026-06-02 18:37:00.190 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10349, records=41
[INFO ] 2026-06-02 18:37:00.190 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432288,ok=432288,error=0, records=41
[WARN ] 2026-06-02 18:37:07.806 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:37:10.795 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:37:10.795 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21628/300s
[INFO ] 2026-06-02 18:37:15.196 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:37:15.196 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432289,ok=432289,error=0, records=41
[INFO ] 2026-06-02 18:37:21.342 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21616/300s
[WARN ] 2026-06-02 18:37:22.811 [23964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:37:25.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:37:30.201 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 18:37:30.201 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432290,ok=432290,error=0, records=41
[WARN ] 2026-06-02 18:37:37.816 [24554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:37:40.796 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:37:45.206 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 18:37:45.206 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432291,ok=432291,error=0, records=41
[WARN ] 2026-06-02 18:37:52.822 [24559] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:37:55.797 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:38:00.213 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:38:00.213 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432292,ok=432292,error=0, records=41
[INFO ] 2026-06-02 18:38:05.274 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21626/300s
[INFO ] 2026-06-02 18:38:07.076 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21626/300s
[WARN ] 2026-06-02 18:38:07.827 [24554] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:38:10.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:38:13.881 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21626/300s
[INFO ] 2026-06-02 18:38:15.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 18:38:15.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432293,ok=432293,error=0, records=41
[WARN ] 2026-06-02 18:38:22.832 [23969] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:38:25.798 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:38:30.224 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 18:38:30.224 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432294,ok=432294,error=0, records=41
[WARN ] 2026-06-02 18:38:37.838 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:38:40.799 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:38:44.196 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834672},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:38:44.356 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:38:44.356 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:38:44.356 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:38:44.356 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:38:44.356 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:38:44.361 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:38:44.361 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:38:44.361 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:38:44.361 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:38:44.361 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:38:44.361 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:38:45.230 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 18:38:45.230 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432295,ok=432295,error=0, records=41
[WARN ] 2026-06-02 18:38:52.843 [23949] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:38:55.800 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.92MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:38:55.800 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 18:39:00.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 18:39:00.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432296,ok=432296,error=0, records=41
[WARN ] 2026-06-02 18:39:07.849 [24625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:39:10.801 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=24.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:39:15.241 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 18:39:15.241 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432297,ok=432297,error=0, records=41
[WARN ] 2026-06-02 18:39:22.855 [24625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:39:25.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:39:30.285 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 18:39:30.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432298,ok=432298,error=0, records=41
[WARN ] 2026-06-02 18:39:37.859 [24625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:39:40.802 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:39:45.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 18:39:45.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432299,ok=432299,error=0, records=41
[WARN ] 2026-06-02 18:39:52.863 [24615] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:39:55.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.66MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:40:00.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 18:40:00.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432300,ok=432300,error=0, records=41
[INFO ] 2026-06-02 18:40:02.318 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21630/300s
[WARN ] 2026-06-02 18:40:07.868 [24625] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:40:10.803 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:40:15.304 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10262, records=41
[INFO ] 2026-06-02 18:40:15.304 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432301,ok=432301,error=0, records=41
[WARN ] 2026-06-02 18:40:22.873 [24615] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:40:25.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:40:30.311 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 18:40:30.311 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432302,ok=432302,error=0, records=41
[WARN ] 2026-06-02 18:40:37.879 [24698] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:40:40.804 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:40:41.880 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21621/300s
[INFO ] 2026-06-02 18:40:43.830 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21630/300s
[INFO ] 2026-06-02 18:40:45.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 18:40:45.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432303,ok=432303,error=0, records=41
[WARN ] 2026-06-02 18:40:52.886 [24775] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:40:55.805 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:41:00.325 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 18:41:00.325 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432304,ok=432304,error=0, records=41
[WARN ] 2026-06-02 18:41:07.891 [24615] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:41:10.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:41:15.330 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 18:41:15.330 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432305,ok=432305,error=0, records=41
[WARN ] 2026-06-02 18:41:22.897 [24792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:41:25.806 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=28.14MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:41:30.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 18:41:30.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432306,ok=432306,error=0, records=41
[INFO ] 2026-06-02 18:41:30.335 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21617/300s
[WARN ] 2026-06-02 18:41:37.903 [24822] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:41:40.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:41:44.356 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18008/300s
[INFO ] 2026-06-02 18:41:44.357 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834576},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:41:44.495 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:41:44.495 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:41:44.495 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:41:44.495 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:41:44.495 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:41:44.561 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:41:44.561 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:41:44.561 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:41:44.561 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:41:44.561 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:41:44.561 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:41:45.341 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 18:41:45.341 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432307,ok=432307,error=0, records=41
[WARN ] 2026-06-02 18:41:52.909 [24792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:41:55.807 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:41:57.486 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21626/300s
[INFO ] 2026-06-02 18:42:00.346 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 18:42:00.346 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432308,ok=432308,error=0, records=41
[WARN ] 2026-06-02 18:42:07.916 [24839] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:42:10.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:42:10.808 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21629/300s
[INFO ] 2026-06-02 18:42:15.362 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 18:42:15.362 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432309,ok=432309,error=0, records=41
[INFO ] 2026-06-02 18:42:21.521 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21617/300s
[WARN ] 2026-06-02 18:42:22.922 [24792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:42:25.808 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=29.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:42:30.366 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 18:42:30.366 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432310,ok=432310,error=0, records=41
[WARN ] 2026-06-02 18:42:37.928 [24878] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:42:40.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:42:45.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 18:42:45.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432311,ok=432311,error=0, records=41
[WARN ] 2026-06-02 18:42:52.934 [24895] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:42:55.809 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:43:00.392 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 18:43:00.392 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432312,ok=432312,error=0, records=41
[INFO ] 2026-06-02 18:43:05.297 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21627/300s
[INFO ] 2026-06-02 18:43:07.101 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21627/300s
[WARN ] 2026-06-02 18:43:07.940 [24918] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:43:10.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:43:13.903 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21627/300s
[INFO ] 2026-06-02 18:43:15.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 18:43:15.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432313,ok=432313,error=0, records=41
[WARN ] 2026-06-02 18:43:22.945 [24934] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:43:25.810 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:43:30.408 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 18:43:30.408 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432314,ok=432314,error=0, records=41
[WARN ] 2026-06-02 18:43:37.951 [24902] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:43:40.811 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 18:43:40.811 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 18:43:45.415 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 18:43:45.415 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432315,ok=432315,error=0, records=41
[WARN ] 2026-06-02 18:43:52.956 [24964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:43:55.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:44:00.421 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 18:44:00.421 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432316,ok=432316,error=0, records=41
[WARN ] 2026-06-02 18:44:07.962 [24902] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:44:10.812 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.35MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:44:15.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10268, records=41
[INFO ] 2026-06-02 18:44:15.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432317,ok=432317,error=0, records=41
[WARN ] 2026-06-02 18:44:22.967 [24945] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:44:25.813 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:44:30.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 18:44:30.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432318,ok=432318,error=0, records=41
[WARN ] 2026-06-02 18:44:37.972 [24945] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:44:40.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:44:44.497 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834496},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:44:44.646 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:44:44.646 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:44:44.646 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:44:44.646 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:44:44.646 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:44:44.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:44:44.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:44:44.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:44:44.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:44:44.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:44:44.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:44:45.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10170, records=41
[INFO ] 2026-06-02 18:44:45.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432319,ok=432319,error=0, records=41
[WARN ] 2026-06-02 18:44:52.979 [24991] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:44:55.814 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:45:00.443 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10195, records=41
[INFO ] 2026-06-02 18:45:00.443 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432320,ok=432320,error=0, records=41
[INFO ] 2026-06-02 18:45:02.321 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21631/300s
[WARN ] 2026-06-02 18:45:07.984 [24934] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:45:10.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:45:15.448 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 18:45:15.448 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432321,ok=432321,error=0, records=41
[WARN ] 2026-06-02 18:45:22.988 [24934] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:45:25.815 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:45:30.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 18:45:30.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432322,ok=432322,error=0, records=41
[WARN ] 2026-06-02 18:45:37.994 [25062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:45:40.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:45:41.994 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21622/300s
[INFO ] 2026-06-02 18:45:43.836 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21631/300s
[INFO ] 2026-06-02 18:45:45.457 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10190, records=41
[INFO ] 2026-06-02 18:45:45.457 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432323,ok=432323,error=0, records=41
[WARN ] 2026-06-02 18:45:52.998 [24934] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:45:55.816 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:46:00.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10183, records=41
[INFO ] 2026-06-02 18:46:00.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432324,ok=432324,error=0, records=41
[WARN ] 2026-06-02 18:46:08.003 [25062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:46:10.817 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:46:15.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 18:46:15.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432325,ok=432325,error=0, records=41
[WARN ] 2026-06-02 18:46:23.009 [24901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:46:25.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:46:30.475 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 18:46:30.475 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432326,ok=432326,error=0, records=41
[INFO ] 2026-06-02 18:46:30.475 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21618/300s
[WARN ] 2026-06-02 18:46:38.013 [25062] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:46:40.818 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:46:45.480 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 18:46:45.480 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432327,ok=432327,error=0, records=41
[WARN ] 2026-06-02 18:46:53.020 [24901] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:46:55.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:46:57.536 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21627/300s
[INFO ] 2026-06-02 18:47:00.490 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 18:47:00.490 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432328,ok=432328,error=0, records=41
[WARN ] 2026-06-02 18:47:08.025 [24964] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:47:10.819 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:47:10.819 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21630/300s
[INFO ] 2026-06-02 18:47:15.499 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 18:47:15.499 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432329,ok=432329,error=0, records=41
[INFO ] 2026-06-02 18:47:21.697 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21618/300s
[WARN ] 2026-06-02 18:47:23.030 [25103] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:47:25.820 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:47:30.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10179, records=41
[INFO ] 2026-06-02 18:47:30.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432330,ok=432330,error=0, records=41
[WARN ] 2026-06-02 18:47:38.035 [25172] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:47:40.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:47:44.646 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18009/300s
[INFO ] 2026-06-02 18:47:44.648 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834424},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:47:44.798 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:47:44.798 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 18:47:44.799 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:47:44.799 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:47:44.799 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:47:44.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:47:44.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:47:44.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:47:44.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:47:44.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:47:44.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:47:45.512 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 18:47:45.512 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432331,ok=432331,error=0, records=41
[WARN ] 2026-06-02 18:47:53.041 [25189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:47:55.821 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:48:00.517 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 18:48:00.517 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432332,ok=432332,error=0, records=41
[INFO ] 2026-06-02 18:48:05.339 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21628/300s
[INFO ] 2026-06-02 18:48:07.141 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21628/300s
[WARN ] 2026-06-02 18:48:08.046 [25177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:48:10.822 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:48:13.947 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21628/300s
[INFO ] 2026-06-02 18:48:15.524 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10270, records=41
[INFO ] 2026-06-02 18:48:15.524 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432333,ok=432333,error=0, records=41
[WARN ] 2026-06-02 18:48:23.051 [25177] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:48:25.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:48:30.529 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 18:48:30.529 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432334,ok=432334,error=0, records=41
[WARN ] 2026-06-02 18:48:37.557 [25239] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:48:40.823 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:48:45.535 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 18:48:45.535 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432335,ok=432335,error=0, records=41
[WARN ] 2026-06-02 18:48:52.562 [25239] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:48:55.824 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:49:00.540 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10233, records=41
[INFO ] 2026-06-02 18:49:00.540 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432336,ok=432336,error=0, records=41
[WARN ] 2026-06-02 18:49:07.567 [25272] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:49:10.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:49:15.546 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10377, records=41
[INFO ] 2026-06-02 18:49:15.546 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432337,ok=432337,error=0, records=41
[WARN ] 2026-06-02 18:49:22.573 [25292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:49:25.825 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:49:30.553 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10361, records=41
[INFO ] 2026-06-02 18:49:30.553 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432338,ok=432338,error=0, records=41
[WARN ] 2026-06-02 18:49:37.578 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:49:40.826 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:49:45.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 18:49:45.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432339,ok=432339,error=0, records=41
[WARN ] 2026-06-02 18:49:52.582 [25313] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:49:55.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:50:00.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-02 18:50:00.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432340,ok=432340,error=0, records=41
[INFO ] 2026-06-02 18:50:02.324 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21632/300s
[WARN ] 2026-06-02 18:50:07.587 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:50:10.827 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:50:15.666 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 18:50:15.666 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432341,ok=432341,error=0, records=41
[WARN ] 2026-06-02 18:50:22.592 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:50:25.828 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:50:30.671 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:50:30.671 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432342,ok=432342,error=0, records=41
[WARN ] 2026-06-02 18:50:37.598 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:50:40.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:50:42.099 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21623/300s
[INFO ] 2026-06-02 18:50:43.842 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21632/300s
[INFO ] 2026-06-02 18:50:44.800 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834344},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:50:44.973 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:50:44.973 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"PING":[],"HTTP":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 18:50:44.973 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:50:44.973 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:50:44.973 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:50:45.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:50:45.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:50:45.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:50:45.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:50:45.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:50:45.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:50:45.677 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 18:50:45.677 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432343,ok=432343,error=0, records=41
[WARN ] 2026-06-02 18:50:52.602 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:50:55.829 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:51:00.682 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 18:51:00.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432344,ok=432344,error=0, records=41
[WARN ] 2026-06-02 18:51:07.607 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:51:10.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:51:15.687 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 18:51:15.687 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432345,ok=432345,error=0, records=41
[WARN ] 2026-06-02 18:51:22.612 [25382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:51:25.830 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:51:30.743 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 18:51:30.743 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432346,ok=432346,error=0, records=41
[INFO ] 2026-06-02 18:51:30.743 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21619/300s
[WARN ] 2026-06-02 18:51:37.617 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:51:40.831 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:51:45.751 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 18:51:45.751 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432347,ok=432347,error=0, records=41
[WARN ] 2026-06-02 18:51:52.622 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:51:55.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:51:57.591 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21628/300s
[INFO ] 2026-06-02 18:52:00.756 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 18:52:00.756 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432348,ok=432348,error=0, records=41
[WARN ] 2026-06-02 18:52:07.627 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:52:10.832 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:52:10.833 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21631/300s
[INFO ] 2026-06-02 18:52:15.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10246, records=41
[INFO ] 2026-06-02 18:52:15.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432349,ok=432349,error=0, records=41
[INFO ] 2026-06-02 18:52:21.882 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21619/300s
[WARN ] 2026-06-02 18:52:22.633 [25382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:52:25.833 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:52:30.822 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 18:52:30.822 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432350,ok=432350,error=0, records=41
[WARN ] 2026-06-02 18:52:37.638 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:52:40.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:52:45.829 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 18:52:45.829 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432351,ok=432351,error=0, records=41
[WARN ] 2026-06-02 18:52:52.643 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:52:55.834 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:53:00.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 18:53:00.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432352,ok=432352,error=0, records=41
[INFO ] 2026-06-02 18:53:05.403 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21629/300s
[INFO ] 2026-06-02 18:53:07.205 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21629/300s
[WARN ] 2026-06-02 18:53:07.648 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:53:10.835 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:53:14.011 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21629/300s
[INFO ] 2026-06-02 18:53:15.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 18:53:15.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432353,ok=432353,error=0, records=41
[WARN ] 2026-06-02 18:53:22.653 [25382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:53:25.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:53:30.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 18:53:30.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432354,ok=432354,error=0, records=41
[WARN ] 2026-06-02 18:53:37.658 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:53:40.836 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 18:53:40.836 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 18:53:44.974 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18010/300s
[INFO ] 2026-06-02 18:53:44.975 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834264},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:53:45.142 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:53:45.142 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 18:53:45.142 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:53:45.142 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:53:45.142 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:53:45.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:53:45.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:53:45.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:53:45.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:53:45.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:53:45.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:53:45.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 18:53:45.871 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432355,ok=432355,error=0, records=41
[WARN ] 2026-06-02 18:53:52.664 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:53:55.837 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:53:55.837 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 18:54:00.877 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 18:54:00.877 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432356,ok=432356,error=0, records=41
[WARN ] 2026-06-02 18:54:07.670 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:54:10.838 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:54:15.883 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 18:54:15.883 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432357,ok=432357,error=0, records=41
[WARN ] 2026-06-02 18:54:22.674 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:54:25.839 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:54:30.888 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 18:54:30.888 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432358,ok=432358,error=0, records=41
[WARN ] 2026-06-02 18:54:37.679 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:54:40.840 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.82MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:54:45.894 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 18:54:45.894 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432359,ok=432359,error=0, records=41
[WARN ] 2026-06-02 18:54:52.683 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:54:55.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:55:00.900 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 18:55:00.900 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432360,ok=432360,error=0, records=41
[INFO ] 2026-06-02 18:55:02.328 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21633/300s
[WARN ] 2026-06-02 18:55:07.688 [25382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:55:10.841 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:55:15.906 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 18:55:15.906 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432361,ok=432361,error=0, records=41
[WARN ] 2026-06-02 18:55:22.692 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:55:25.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:55:30.913 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 18:55:30.913 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432362,ok=432362,error=0, records=41
[WARN ] 2026-06-02 18:55:37.697 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:55:40.842 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:55:42.198 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21624/300s
[INFO ] 2026-06-02 18:55:43.849 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21633/300s
[INFO ] 2026-06-02 18:55:45.917 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 18:55:45.917 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432363,ok=432363,error=0, records=41
[WARN ] 2026-06-02 18:55:52.701 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:55:55.843 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:56:00.923 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 18:56:00.923 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432364,ok=432364,error=0, records=41
[WARN ] 2026-06-02 18:56:07.706 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:56:10.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:56:15.932 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 18:56:15.932 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432365,ok=432365,error=0, records=41
[WARN ] 2026-06-02 18:56:22.710 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:56:25.844 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:56:30.937 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 18:56:30.937 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432366,ok=432366,error=0, records=41
[INFO ] 2026-06-02 18:56:30.937 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21620/300s
[WARN ] 2026-06-02 18:56:37.715 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:56:40.845 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:56:45.144 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834192},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:56:45.441 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:56:45.441 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 18:56:45.441 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:56:45.441 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:56:45.441 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:56:45.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:56:45.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:56:45.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:56:45.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:56:45.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:56:45.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:56:45.942 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 18:56:45.943 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432367,ok=432367,error=0, records=41
[WARN ] 2026-06-02 18:56:52.720 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:56:55.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:56:57.649 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21629/300s
[INFO ] 2026-06-02 18:57:00.948 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 18:57:00.948 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432368,ok=432368,error=0, records=41
[WARN ] 2026-06-02 18:57:07.724 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:57:10.846 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:57:10.846 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21632/300s
[INFO ] 2026-06-02 18:57:15.954 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 18:57:15.954 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432369,ok=432369,error=0, records=41
[INFO ] 2026-06-02 18:57:22.066 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21620/300s
[WARN ] 2026-06-02 18:57:22.730 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:57:25.847 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:57:30.961 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 18:57:30.961 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432370,ok=432370,error=0, records=41
[WARN ] 2026-06-02 18:57:37.736 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:57:40.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:57:45.965 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 18:57:45.965 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432371,ok=432371,error=0, records=41
[WARN ] 2026-06-02 18:57:52.740 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:57:55.848 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:58:00.970 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 18:58:00.970 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432372,ok=432372,error=0, records=41
[INFO ] 2026-06-02 18:58:05.468 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21630/300s
[INFO ] 2026-06-02 18:58:07.270 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21630/300s
[WARN ] 2026-06-02 18:58:07.744 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:58:10.849 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:58:14.075 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21630/300s
[INFO ] 2026-06-02 18:58:15.977 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 18:58:15.977 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432373,ok=432373,error=0, records=41
[WARN ] 2026-06-02 18:58:22.749 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:58:25.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:58:30.986 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 18:58:30.986 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432374,ok=432374,error=0, records=41
[WARN ] 2026-06-02 18:58:37.754 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:58:40.850 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:58:45.991 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 18:58:45.992 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432375,ok=432375,error=0, records=41
[WARN ] 2026-06-02 18:58:52.758 [25382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:58:55.851 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:59:00.997 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 18:59:00.997 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432376,ok=432376,error=0, records=41
[WARN ] 2026-06-02 18:59:07.763 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:59:10.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:59:16.001 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 18:59:16.002 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432377,ok=432377,error=0, records=41
[WARN ] 2026-06-02 18:59:22.768 [25356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:59:25.852 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:59:31.006 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 18:59:31.006 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432378,ok=432378,error=0, records=41
[WARN ] 2026-06-02 18:59:37.772 [25307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:59:40.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 18:59:45.441 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18011/300s
[INFO ] 2026-06-02 18:59:45.443 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834116},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 18:59:45.606 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 18:59:45.606 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 18:59:45.606 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 18:59:45.606 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 18:59:45.606 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 18:59:45.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:59:45.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 18:59:45.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:59:45.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 18:59:45.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:59:45.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 18:59:46.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 18:59:46.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432379,ok=432379,error=0, records=41
[WARN ] 2026-06-02 18:59:52.778 [25362] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 18:59:55.853 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:00:01.142 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10196, records=41
[INFO ] 2026-06-02 19:00:01.143 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432380,ok=432380,error=0, records=41
[INFO ] 2026-06-02 19:00:02.332 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21634/300s
[WARN ] 2026-06-02 19:00:07.784 [25382] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:00:10.854 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:00:16.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 19:00:16.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432381,ok=432381,error=0, records=41
[WARN ] 2026-06-02 19:00:22.789 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:00:25.855 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:00:31.153 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 19:00:31.153 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432382,ok=432382,error=0, records=41
[WARN ] 2026-06-02 19:00:37.794 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:00:40.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:00:42.296 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21625/300s
[INFO ] 2026-06-02 19:00:43.856 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21634/300s
[INFO ] 2026-06-02 19:00:46.159 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 19:00:46.159 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432383,ok=432383,error=0, records=41
[WARN ] 2026-06-02 19:00:52.800 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:00:55.856 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:01:01.165 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 19:01:01.165 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432384,ok=432384,error=0, records=41
[WARN ] 2026-06-02 19:01:07.804 [25335] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:01:10.857 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:01:16.173 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:01:16.173 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432385,ok=432385,error=0, records=41
[WARN ] 2026-06-02 19:01:22.810 [25946] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:01:25.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:01:31.180 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 19:01:31.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432386,ok=432386,error=0, records=41
[INFO ] 2026-06-02 19:01:31.181 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21621/300s
[WARN ] 2026-06-02 19:01:37.815 [25946] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:01:40.858 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:01:46.238 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10182, records=41
[INFO ] 2026-06-02 19:01:46.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432387,ok=432387,error=0, records=41
[WARN ] 2026-06-02 19:01:52.820 [25981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:01:55.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:01:57.704 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21630/300s
[INFO ] 2026-06-02 19:02:01.244 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 19:02:01.244 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432388,ok=432388,error=0, records=41
[WARN ] 2026-06-02 19:02:07.825 [26009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:02:10.859 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:02:10.860 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21633/300s
[INFO ] 2026-06-02 19:02:16.250 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-02 19:02:16.250 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432389,ok=432389,error=0, records=41
[INFO ] 2026-06-02 19:02:22.251 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21621/300s
[WARN ] 2026-06-02 19:02:22.832 [26023] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:02:25.860 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:02:31.257 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 19:02:31.257 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432390,ok=432390,error=0, records=41
[WARN ] 2026-06-02 19:02:37.837 [26009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:02:40.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.62MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:02:45.608 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20834028},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:02:45.752 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:02:45.752 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 19:02:45.752 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:02:45.752 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:02:45.752 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:02:45.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:02:45.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:02:45.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:02:45.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:02:45.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:02:45.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:02:46.265 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-02 19:02:46.265 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432391,ok=432391,error=0, records=41
[WARN ] 2026-06-02 19:02:52.843 [25981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:02:55.861 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.79MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:03:01.274 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10358, records=41
[INFO ] 2026-06-02 19:03:01.274 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432392,ok=432392,error=0, records=41
[INFO ] 2026-06-02 19:03:05.533 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21631/300s
[INFO ] 2026-06-02 19:03:07.335 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21631/300s
[WARN ] 2026-06-02 19:03:07.848 [25981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:03:10.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:03:14.138 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21631/300s
[INFO ] 2026-06-02 19:03:16.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10296, records=41
[INFO ] 2026-06-02 19:03:16.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432393,ok=432393,error=0, records=41
[WARN ] 2026-06-02 19:03:22.853 [26009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:03:25.862 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:03:31.287 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 19:03:31.287 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432394,ok=432394,error=0, records=41
[WARN ] 2026-06-02 19:03:37.858 [26009] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:03:40.863 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 19:03:40.863 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 19:03:46.293 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10301, records=41
[INFO ] 2026-06-02 19:03:46.293 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432395,ok=432395,error=0, records=41
[WARN ] 2026-06-02 19:03:52.862 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:03:55.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:04:01.310 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 19:04:01.310 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432396,ok=432396,error=0, records=41
[WARN ] 2026-06-02 19:04:07.867 [26087] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:04:10.864 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:04:16.316 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 19:04:16.316 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432397,ok=432397,error=0, records=41
[WARN ] 2026-06-02 19:04:22.872 [25976] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:04:25.865 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:04:31.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 19:04:31.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432398,ok=432398,error=0, records=41
[WARN ] 2026-06-02 19:04:37.877 [26144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:04:40.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.81MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:04:46.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 19:04:46.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432399,ok=432399,error=0, records=41
[WARN ] 2026-06-02 19:04:52.883 [26165] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:04:55.866 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:05:01.334 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10232, records=41
[INFO ] 2026-06-02 19:05:01.334 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432400,ok=432400,error=0, records=41
[INFO ] 2026-06-02 19:05:02.338 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21635/300s
[WARN ] 2026-06-02 19:05:07.888 [26115] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:05:10.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:05:16.339 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 19:05:16.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432401,ok=432401,error=0, records=41
[WARN ] 2026-06-02 19:05:22.894 [26189] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:05:25.867 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.57MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:05:31.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10256, records=41
[INFO ] 2026-06-02 19:05:31.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432402,ok=432402,error=0, records=41
[WARN ] 2026-06-02 19:05:37.902 [26173] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:05:40.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.32MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:05:42.403 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21626/300s
[INFO ] 2026-06-02 19:05:43.862 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21635/300s
[INFO ] 2026-06-02 19:05:45.753 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18012/300s
[INFO ] 2026-06-02 19:05:45.754 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833944},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:05:45.952 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:05:45.952 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 19:05:45.953 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:05:45.953 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:05:45.953 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:05:45.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:05:45.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:05:45.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:05:45.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:05:45.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:05:45.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:05:46.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10258, records=41
[INFO ] 2026-06-02 19:05:46.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432403,ok=432403,error=0, records=41
[WARN ] 2026-06-02 19:05:52.907 [26129] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:05:55.868 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:06:01.359 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 19:06:01.359 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432404,ok=432404,error=0, records=41
[WARN ] 2026-06-02 19:06:07.914 [26215] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:06:10.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:06:16.363 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10267, records=41
[INFO ] 2026-06-02 19:06:16.363 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432405,ok=432405,error=0, records=41
[WARN ] 2026-06-02 19:06:22.919 [26259] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:06:25.869 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:06:31.369 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 19:06:31.369 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432406,ok=432406,error=0, records=41
[INFO ] 2026-06-02 19:06:31.369 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21622/300s
[WARN ] 2026-06-02 19:06:37.925 [26260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:06:40.870 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:06:46.375 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 19:06:46.375 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432407,ok=432407,error=0, records=41
[WARN ] 2026-06-02 19:06:52.930 [26279] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:06:55.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:06:57.759 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21631/300s
[INFO ] 2026-06-02 19:07:01.381 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 19:07:01.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432408,ok=432408,error=0, records=41
[WARN ] 2026-06-02 19:07:07.935 [26307] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:07:10.871 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:07:10.871 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21634/300s
[INFO ] 2026-06-02 19:07:16.387 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 19:07:16.387 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432409,ok=432409,error=0, records=41
[INFO ] 2026-06-02 19:07:22.432 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21622/300s
[WARN ] 2026-06-02 19:07:22.940 [26260] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:07:25.872 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:07:31.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 19:07:31.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432410,ok=432410,error=0, records=41
[WARN ] 2026-06-02 19:07:37.945 [26346] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:07:40.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:07:46.401 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:07:46.401 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432411,ok=432411,error=0, records=41
[WARN ] 2026-06-02 19:07:52.952 [26341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:07:55.873 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:08:01.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 19:08:01.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432412,ok=432412,error=0, records=41
[INFO ] 2026-06-02 19:08:05.582 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21632/300s
[INFO ] 2026-06-02 19:08:07.383 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21632/300s
[WARN ] 2026-06-02 19:08:07.956 [26341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:08:10.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:08:14.188 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21632/300s
[INFO ] 2026-06-02 19:08:16.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 19:08:16.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432413,ok=432413,error=0, records=41
[WARN ] 2026-06-02 19:08:22.961 [26384] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:08:25.874 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:08:31.420 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:08:31.420 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432414,ok=432414,error=0, records=41
[WARN ] 2026-06-02 19:08:37.965 [26398] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:08:40.875 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:08:45.954 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833868},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:08:46.107 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:08:46.107 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 19:08:46.108 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:08:46.108 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:08:46.108 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:08:46.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:08:46.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:08:46.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:08:46.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:08:46.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:08:46.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:08:46.426 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10214, records=41
[INFO ] 2026-06-02 19:08:46.426 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432415,ok=432415,error=0, records=41
[WARN ] 2026-06-02 19:08:52.971 [26370] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:08:55.876 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:08:55.876 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 19:09:01.432 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 19:09:01.432 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432416,ok=432416,error=0, records=41
[WARN ] 2026-06-02 19:09:07.975 [26308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:09:10.877 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.31MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:09:16.437 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 19:09:16.437 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432417,ok=432417,error=0, records=41
[WARN ] 2026-06-02 19:09:22.981 [26356] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:09:25.878 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.84MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:09:31.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10184, records=41
[INFO ] 2026-06-02 19:09:31.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432418,ok=432418,error=0, records=41
[WARN ] 2026-06-02 19:09:37.986 [26308] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:09:40.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:09:46.449 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 19:09:46.449 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432419,ok=432419,error=0, records=41
[WARN ] 2026-06-02 19:09:52.991 [26426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:09:55.879 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:10:01.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 19:10:01.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432420,ok=432420,error=0, records=41
[INFO ] 2026-06-02 19:10:02.342 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21636/300s
[WARN ] 2026-06-02 19:10:07.995 [26455] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:10:10.880 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:10:16.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10351, records=41
[INFO ] 2026-06-02 19:10:16.462 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432421,ok=432421,error=0, records=41
[WARN ] 2026-06-02 19:10:22.999 [26502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:10:25.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.61MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:10:31.470 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 19:10:31.470 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432422,ok=432422,error=0, records=41
[WARN ] 2026-06-02 19:10:38.004 [26488] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:10:40.881 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.86MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:10:42.506 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21627/300s
[INFO ] 2026-06-02 19:10:43.868 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21636/300s
[INFO ] 2026-06-02 19:10:46.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10335, records=41
[INFO ] 2026-06-02 19:10:46.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432423,ok=432423,error=0, records=41
[WARN ] 2026-06-02 19:10:53.011 [26502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:10:55.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:11:01.482 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10330, records=41
[INFO ] 2026-06-02 19:11:01.483 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432424,ok=432424,error=0, records=41
[WARN ] 2026-06-02 19:11:08.018 [26426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:11:10.882 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.12MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:11:16.489 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:11:16.489 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432425,ok=432425,error=0, records=41
[WARN ] 2026-06-02 19:11:23.023 [26426] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:11:25.883 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.37MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:11:31.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10240, records=41
[INFO ] 2026-06-02 19:11:31.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432426,ok=432426,error=0, records=41
[INFO ] 2026-06-02 19:11:31.494 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21623/300s
[WARN ] 2026-06-02 19:11:38.030 [26571] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:11:40.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:11:46.108 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18013/300s
[INFO ] 2026-06-02 19:11:46.109 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833792},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:11:46.258 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:11:46.258 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 19:11:46.259 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:11:46.259 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:11:46.259 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:11:46.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:11:46.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:11:46.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:11:46.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:11:46.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:11:46.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:11:46.503 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 19:11:46.503 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432427,ok=432427,error=0, records=41
[WARN ] 2026-06-02 19:11:53.034 [26502] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:11:55.884 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.27MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:11:57.815 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21632/300s
[INFO ] 2026-06-02 19:12:01.508 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 19:12:01.508 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432428,ok=432428,error=0, records=41
[WARN ] 2026-06-02 19:12:08.040 [26591] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:12:10.885 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.02MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:12:10.885 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21635/300s
[INFO ] 2026-06-02 19:12:16.513 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 19:12:16.514 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432429,ok=432429,error=0, records=41
[INFO ] 2026-06-02 19:12:22.613 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21623/300s
[WARN ] 2026-06-02 19:12:23.046 [26603] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:12:25.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:12:31.520 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 19:12:31.520 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432430,ok=432430,error=0, records=41
[WARN ] 2026-06-02 19:12:38.051 [26642] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:12:40.886 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:12:46.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 19:12:46.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432431,ok=432431,error=0, records=41
[WARN ] 2026-06-02 19:12:52.556 [26619] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:12:55.887 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.33MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:13:01.539 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 19:13:01.539 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432432,ok=432432,error=0, records=41
[INFO ] 2026-06-02 19:13:05.647 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21633/300s
[INFO ] 2026-06-02 19:13:07.449 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21633/300s
[WARN ] 2026-06-02 19:13:07.561 [26676] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:13:10.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:13:14.255 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21633/300s
[INFO ] 2026-06-02 19:13:16.544 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10177, records=41
[INFO ] 2026-06-02 19:13:16.544 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432433,ok=432433,error=0, records=41
[WARN ] 2026-06-02 19:13:22.566 [26676] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:13:25.888 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:13:31.551 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10158, records=41
[INFO ] 2026-06-02 19:13:31.551 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432434,ok=432434,error=0, records=41
[WARN ] 2026-06-02 19:13:37.571 [26676] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:13:40.889 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 19:13:40.889 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 19:13:46.558 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 19:13:46.558 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432435,ok=432435,error=0, records=41
[WARN ] 2026-06-02 19:13:52.577 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:13:55.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:14:01.569 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10157, records=41
[INFO ] 2026-06-02 19:14:01.569 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432436,ok=432436,error=0, records=41
[WARN ] 2026-06-02 19:14:07.582 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:14:10.890 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:14:16.575 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 19:14:16.575 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432437,ok=432437,error=0, records=41
[WARN ] 2026-06-02 19:14:22.588 [26658] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:14:25.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:14:31.581 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10140, records=41
[INFO ] 2026-06-02 19:14:31.582 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432438,ok=432438,error=0, records=41
[WARN ] 2026-06-02 19:14:37.594 [26747] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:14:40.891 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:14:46.260 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833720},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:14:46.420 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:14:46.420 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"HTTP":[],"TELNET":[]}
[INFO ] 2026-06-02 19:14:46.420 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:14:46.420 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:14:46.420 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:14:46.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:14:46.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:14:46.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:14:46.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:14:46.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:14:46.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:14:46.590 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10176, records=41
[INFO ] 2026-06-02 19:14:46.590 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432439,ok=432439,error=0, records=41
[WARN ] 2026-06-02 19:14:52.599 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:14:55.892 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:15:01.595 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10169, records=41
[INFO ] 2026-06-02 19:15:01.595 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432440,ok=432440,error=0, records=41
[INFO ] 2026-06-02 19:15:02.345 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21637/300s
[WARN ] 2026-06-02 19:15:07.604 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:15:10.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:15:16.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 19:15:16.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432441,ok=432441,error=0, records=41
[WARN ] 2026-06-02 19:15:22.610 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:15:25.893 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:15:31.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10200, records=41
[INFO ] 2026-06-02 19:15:31.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432442,ok=432442,error=0, records=41
[WARN ] 2026-06-02 19:15:37.615 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:15:40.894 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:15:42.617 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21628/300s
[INFO ] 2026-06-02 19:15:43.874 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21637/300s
[INFO ] 2026-06-02 19:15:46.613 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 19:15:46.613 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432443,ok=432443,error=0, records=41
[WARN ] 2026-06-02 19:15:52.621 [26771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:15:55.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:16:01.621 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 19:16:01.621 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432444,ok=432444,error=0, records=41
[WARN ] 2026-06-02 19:16:07.625 [26771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:16:10.895 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:16:16.627 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10249, records=41
[INFO ] 2026-06-02 19:16:16.627 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432445,ok=432445,error=0, records=41
[WARN ] 2026-06-02 19:16:22.629 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:16:25.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:16:31.634 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 19:16:31.634 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432446,ok=432446,error=0, records=41
[INFO ] 2026-06-02 19:16:31.634 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21624/300s
[WARN ] 2026-06-02 19:16:37.635 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:16:40.896 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:16:46.643 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 19:16:46.643 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432447,ok=432447,error=0, records=41
[WARN ] 2026-06-02 19:16:52.640 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:16:55.897 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:16:57.872 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21633/300s
[INFO ] 2026-06-02 19:17:01.649 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10221, records=41
[INFO ] 2026-06-02 19:17:01.649 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432448,ok=432448,error=0, records=41
[WARN ] 2026-06-02 19:17:07.645 [26771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:17:10.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:17:10.898 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21636/300s
[INFO ] 2026-06-02 19:17:16.656 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10294, records=41
[INFO ] 2026-06-02 19:17:16.656 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432449,ok=432449,error=0, records=41
[WARN ] 2026-06-02 19:17:22.650 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:17:22.798 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21624/300s
[INFO ] 2026-06-02 19:17:25.898 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:17:31.661 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 19:17:31.661 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432450,ok=432450,error=0, records=41
[WARN ] 2026-06-02 19:17:37.654 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:17:40.899 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:17:46.420 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18014/300s
[INFO ] 2026-06-02 19:17:46.421 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833628},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:17:46.593 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:17:46.593 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 19:17:46.593 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:17:46.593 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:17:46.593 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:17:46.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:17:46.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:17:46.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:17:46.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:17:46.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:17:46.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:17:46.668 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 19:17:46.668 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432451,ok=432451,error=0, records=41
[WARN ] 2026-06-02 19:17:52.660 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:17:55.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:18:01.676 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10283, records=41
[INFO ] 2026-06-02 19:18:01.676 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432452,ok=432452,error=0, records=41
[INFO ] 2026-06-02 19:18:05.703 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21634/300s
[INFO ] 2026-06-02 19:18:07.504 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21634/300s
[WARN ] 2026-06-02 19:18:07.666 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:18:10.900 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:18:14.309 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21634/300s
[INFO ] 2026-06-02 19:18:16.681 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10355, records=41
[INFO ] 2026-06-02 19:18:16.682 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432453,ok=432453,error=0, records=41
[WARN ] 2026-06-02 19:18:22.672 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:18:25.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:18:31.686 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 19:18:31.686 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432454,ok=432454,error=0, records=41
[WARN ] 2026-06-02 19:18:37.678 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:18:40.901 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:18:46.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10322, records=41
[INFO ] 2026-06-02 19:18:46.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432455,ok=432455,error=0, records=41
[WARN ] 2026-06-02 19:18:52.684 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:18:55.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:19:01.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 19:19:01.695 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432456,ok=432456,error=0, records=41
[WARN ] 2026-06-02 19:19:07.690 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:19:10.902 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:19:16.700 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10159, records=41
[INFO ] 2026-06-02 19:19:16.700 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432457,ok=432457,error=0, records=41
[WARN ] 2026-06-02 19:19:22.696 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:19:25.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:19:31.705 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10133, records=41
[INFO ] 2026-06-02 19:19:31.705 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432458,ok=432458,error=0, records=41
[WARN ] 2026-06-02 19:19:37.703 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:19:40.903 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:19:46.711 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10127, records=41
[INFO ] 2026-06-02 19:19:46.711 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432459,ok=432459,error=0, records=41
[WARN ] 2026-06-02 19:19:52.708 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:19:55.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:20:01.715 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10141, records=41
[INFO ] 2026-06-02 19:20:01.715 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432460,ok=432460,error=0, records=41
[INFO ] 2026-06-02 19:20:02.348 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21638/300s
[WARN ] 2026-06-02 19:20:07.714 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:20:10.904 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:20:16.720 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-02 19:20:16.720 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432461,ok=432461,error=0, records=41
[WARN ] 2026-06-02 19:20:22.719 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:20:25.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:20:31.747 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10288, records=41
[INFO ] 2026-06-02 19:20:31.747 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432462,ok=432462,error=0, records=41
[WARN ] 2026-06-02 19:20:37.725 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:20:40.905 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:20:42.726 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21629/300s
[INFO ] 2026-06-02 19:20:43.880 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21638/300s
[INFO ] 2026-06-02 19:20:46.594 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833492},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:20:46.753 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10259, records=41
[INFO ] 2026-06-02 19:20:46.753 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432463,ok=432463,error=0, records=41
[INFO ] 2026-06-02 19:20:46.765 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:20:46.765 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 19:20:46.765 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:20:46.765 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:20:46.765 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:20:46.861 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:20:46.861 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:20:46.861 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:20:46.861 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:20:46.861 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:20:46.861 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:20:52.731 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:20:55.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:21:01.758 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 19:21:01.758 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432464,ok=432464,error=0, records=41
[WARN ] 2026-06-02 19:21:07.736 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:21:10.906 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:21:16.763 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10145, records=41
[INFO ] 2026-06-02 19:21:16.763 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432465,ok=432465,error=0, records=41
[WARN ] 2026-06-02 19:21:22.741 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:21:25.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:21:31.767 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10118, records=41
[INFO ] 2026-06-02 19:21:31.767 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432466,ok=432466,error=0, records=41
[INFO ] 2026-06-02 19:21:31.767 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21625/300s
[WARN ] 2026-06-02 19:21:37.747 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:21:40.907 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:21:46.773 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10128, records=41
[INFO ] 2026-06-02 19:21:46.773 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432467,ok=432467,error=0, records=41
[WARN ] 2026-06-02 19:21:52.752 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:21:55.908 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:21:57.915 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21634/300s
[INFO ] 2026-06-02 19:22:01.778 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10099, records=41
[INFO ] 2026-06-02 19:22:01.778 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432468,ok=432468,error=0, records=41
[WARN ] 2026-06-02 19:22:07.758 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:22:10.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:22:10.909 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21637/300s
[INFO ] 2026-06-02 19:22:16.784 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 19:22:16.784 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432469,ok=432469,error=0, records=41
[WARN ] 2026-06-02 19:22:22.763 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:22:22.965 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21625/300s
[INFO ] 2026-06-02 19:22:25.909 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:22:31.790 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 19:22:31.790 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432470,ok=432470,error=0, records=41
[WARN ] 2026-06-02 19:22:37.769 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:22:40.910 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:22:46.797 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10161, records=41
[INFO ] 2026-06-02 19:22:46.797 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432471,ok=432471,error=0, records=41
[WARN ] 2026-06-02 19:22:52.775 [26771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:22:55.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:23:01.803 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10174, records=41
[INFO ] 2026-06-02 19:23:01.803 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432472,ok=432472,error=0, records=41
[INFO ] 2026-06-02 19:23:05.708 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21635/300s
[INFO ] 2026-06-02 19:23:07.512 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21635/300s
[WARN ] 2026-06-02 19:23:07.780 [26771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:23:10.911 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:23:14.316 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21635/300s
[INFO ] 2026-06-02 19:23:16.808 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10254, records=41
[INFO ] 2026-06-02 19:23:16.808 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432473,ok=432473,error=0, records=41
[WARN ] 2026-06-02 19:23:22.785 [26771] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:23:25.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:23:31.814 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 19:23:31.814 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432474,ok=432474,error=0, records=41
[WARN ] 2026-06-02 19:23:37.790 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:23:40.912 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 19:23:40.913 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 19:23:46.765 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18015/300s
[INFO ] 2026-06-02 19:23:46.767 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833400},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:23:46.819 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 19:23:46.819 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432475,ok=432475,error=0, records=41
[INFO ] 2026-06-02 19:23:46.920 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:23:46.920 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 19:23:46.920 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:23:46.920 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:23:46.920 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:23:46.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:23:46.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:23:46.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:23:46.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:23:46.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:23:46.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:23:52.795 [26719] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:23:55.913 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.34MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:23:55.913 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 19:24:01.830 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 19:24:01.830 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432476,ok=432476,error=0, records=41
[WARN ] 2026-06-02 19:24:07.800 [26787] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:24:10.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:24:16.835 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 19:24:16.835 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432477,ok=432477,error=0, records=41
[WARN ] 2026-06-02 19:24:22.806 [27292] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:24:25.915 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.30MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:24:31.841 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10261, records=41
[INFO ] 2026-06-02 19:24:31.841 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432478,ok=432478,error=0, records=41
[WARN ] 2026-06-02 19:24:37.812 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:24:40.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:24:46.847 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 19:24:46.847 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432479,ok=432479,error=0, records=41
[WARN ] 2026-06-02 19:24:52.817 [26759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:24:55.916 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=26.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:25:01.853 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 19:25:01.853 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432480,ok=432480,error=0, records=41
[INFO ] 2026-06-02 19:25:02.351 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21639/300s
[WARN ] 2026-06-02 19:25:07.823 [27341] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:25:10.917 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.77MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:25:16.859 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10313, records=41
[INFO ] 2026-06-02 19:25:16.859 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432481,ok=432481,error=0, records=41
[WARN ] 2026-06-02 19:25:22.828 [26760] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:25:25.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.28MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:25:31.864 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10281, records=41
[INFO ] 2026-06-02 19:25:31.864 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432482,ok=432482,error=0, records=41
[WARN ] 2026-06-02 19:25:37.832 [27355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:25:40.918 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.53MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:25:42.834 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21630/300s
[INFO ] 2026-06-02 19:25:43.886 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21639/300s
[INFO ] 2026-06-02 19:25:46.871 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 19:25:46.872 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432483,ok=432483,error=0, records=41
[WARN ] 2026-06-02 19:25:52.838 [27355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:25:55.919 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:26:01.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10282, records=41
[INFO ] 2026-06-02 19:26:01.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432484,ok=432484,error=0, records=41
[WARN ] 2026-06-02 19:26:07.844 [27383] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:26:10.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.80MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:26:16.886 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 19:26:16.886 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432485,ok=432485,error=0, records=41
[WARN ] 2026-06-02 19:26:22.849 [27405] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:26:25.920 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:26:31.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 19:26:31.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432486,ok=432486,error=0, records=41
[INFO ] 2026-06-02 19:26:31.893 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21626/300s
[WARN ] 2026-06-02 19:26:37.853 [27355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:26:40.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.06MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:26:46.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 19:26:46.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432487,ok=432487,error=0, records=41
[INFO ] 2026-06-02 19:26:46.922 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833320},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:26:47.085 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:26:47.085 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"PING":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 19:26:47.086 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:26:47.086 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:26:47.086 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:26:47.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:26:47.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:26:47.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:26:47.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:26:47.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:26:47.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:26:52.859 [27355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:26:55.921 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:26:57.968 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21635/300s
[INFO ] 2026-06-02 19:27:01.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10231, records=41
[INFO ] 2026-06-02 19:27:01.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432488,ok=432488,error=0, records=41
[WARN ] 2026-06-02 19:27:07.864 [27355] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:27:10.922 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:27:10.922 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21638/300s
[INFO ] 2026-06-02 19:27:16.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 19:27:16.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432489,ok=432489,error=0, records=41
[WARN ] 2026-06-02 19:27:22.868 [27434] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:27:23.146 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21626/300s
[INFO ] 2026-06-02 19:27:25.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:27:31.915 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10188, records=41
[INFO ] 2026-06-02 19:27:31.915 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432490,ok=432490,error=0, records=41
[WARN ] 2026-06-02 19:27:37.873 [27434] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:27:40.923 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.47MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:27:46.921 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10175, records=41
[INFO ] 2026-06-02 19:27:46.921 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432491,ok=432491,error=0, records=41
[WARN ] 2026-06-02 19:27:47.378 [27495] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17443/stat), No such file or directory
[WARN ] 2026-06-02 19:27:52.879 [27501] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:27:55.924 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:28:01.926 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 19:28:01.927 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432492,ok=432492,error=0, records=41
[INFO ] 2026-06-02 19:28:05.759 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21636/300s
[INFO ] 2026-06-02 19:28:07.560 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21636/300s
[WARN ] 2026-06-02 19:28:07.884 [27518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:28:10.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:28:14.365 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21636/300s
[INFO ] 2026-06-02 19:28:17.012 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 19:28:17.012 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432493,ok=432493,error=0, records=41
[WARN ] 2026-06-02 19:28:22.889 [27533] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:28:25.925 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=29.48MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:28:32.020 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 19:28:32.020 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432494,ok=432494,error=0, records=41
[WARN ] 2026-06-02 19:28:37.894 [27518] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:28:40.926 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:28:47.024 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10152, records=41
[INFO ] 2026-06-02 19:28:47.025 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432495,ok=432495,error=0, records=41
[WARN ] 2026-06-02 19:28:52.900 [27566] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:28:55.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:29:02.032 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10163, records=41
[INFO ] 2026-06-02 19:29:02.033 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432496,ok=432496,error=0, records=41
[WARN ] 2026-06-02 19:29:07.906 [27577] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:29:10.927 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.73MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:29:17.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:29:17.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432497,ok=432497,error=0, records=41
[WARN ] 2026-06-02 19:29:22.913 [27583] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:29:25.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:29:32.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 19:29:32.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432498,ok=432498,error=0, records=41
[WARN ] 2026-06-02 19:29:37.918 [27495] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:29:40.928 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:29:47.050 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 19:29:47.050 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432499,ok=432499,error=0, records=41
[INFO ] 2026-06-02 19:29:47.086 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18016/300s
[INFO ] 2026-06-02 19:29:47.088 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833240},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:29:47.256 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:29:47.256 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 19:29:47.256 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:29:47.256 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:29:47.256 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:29:47.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:29:47.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:29:47.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:29:47.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:29:47.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:29:47.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:29:52.924 [27583] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:29:55.929 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:30:02.062 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 19:30:02.062 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432500,ok=432500,error=0, records=41
[INFO ] 2026-06-02 19:30:02.355 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21640/300s
[WARN ] 2026-06-02 19:30:07.929 [27495] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:30:10.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:30:17.068 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10291, records=41
[INFO ] 2026-06-02 19:30:17.068 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432501,ok=432501,error=0, records=41
[WARN ] 2026-06-02 19:30:17.435 [27626] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17459/stat), No such file or directory
[WARN ] 2026-06-02 19:30:22.935 [27657] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:30:25.930 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:30:32.074 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10133, records=41
[INFO ] 2026-06-02 19:30:32.074 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432502,ok=432502,error=0, records=41
[WARN ] 2026-06-02 19:30:32.440 [27637] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17459/stat), No such file or directory
[WARN ] 2026-06-02 19:30:37.940 [27668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:30:40.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:30:42.942 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21631/300s
[INFO ] 2026-06-02 19:30:43.892 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21640/300s
[INFO ] 2026-06-02 19:30:47.079 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10167, records=41
[INFO ] 2026-06-02 19:30:47.079 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432503,ok=432503,error=0, records=41
[WARN ] 2026-06-02 19:30:47.446 [27694] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/17459/stat), No such file or directory
[WARN ] 2026-06-02 19:30:52.946 [27668] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:30:55.931 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.24MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:31:02.085 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10153, records=41
[INFO ] 2026-06-02 19:31:02.085 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432504,ok=432504,error=0, records=41
[WARN ] 2026-06-02 19:31:07.951 [27595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:31:10.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:31:17.181 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10128, records=41
[INFO ] 2026-06-02 19:31:17.181 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432505,ok=432505,error=0, records=41
[WARN ] 2026-06-02 19:31:22.956 [27685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:31:25.932 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:31:32.188 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10091, records=41
[INFO ] 2026-06-02 19:31:32.188 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432506,ok=432506,error=0, records=41
[INFO ] 2026-06-02 19:31:32.188 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21627/300s
[WARN ] 2026-06-02 19:31:37.960 [27678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:31:40.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:31:47.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10082, records=41
[INFO ] 2026-06-02 19:31:47.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432507,ok=432507,error=0, records=41
[WARN ] 2026-06-02 19:31:52.966 [27678] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:31:55.933 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:31:58.020 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21636/300s
[INFO ] 2026-06-02 19:32:02.216 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10083, records=41
[INFO ] 2026-06-02 19:32:02.216 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432508,ok=432508,error=0, records=41
[WARN ] 2026-06-02 19:32:07.970 [27685] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:32:10.934 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:32:10.934 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21639/300s
[INFO ] 2026-06-02 19:32:17.221 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 19:32:17.221 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432509,ok=432509,error=0, records=41
[WARN ] 2026-06-02 19:32:22.975 [27595] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:32:23.325 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21627/300s
[INFO ] 2026-06-02 19:32:25.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:32:32.227 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10319, records=41
[INFO ] 2026-06-02 19:32:32.227 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432510,ok=432510,error=0, records=41
[WARN ] 2026-06-02 19:32:37.981 [27797] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:32:40.935 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:32:47.232 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 19:32:47.232 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432511,ok=432511,error=0, records=41
[INFO ] 2026-06-02 19:32:47.258 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833152},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:32:47.408 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:32:47.408 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 19:32:47.408 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:32:47.408 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:32:47.408 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:32:47.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:32:47.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:32:47.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:32:47.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:32:47.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:32:47.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:32:52.985 [27797] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:32:55.936 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:33:02.239 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10329, records=41
[INFO ] 2026-06-02 19:33:02.239 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432512,ok=432512,error=0, records=41
[INFO ] 2026-06-02 19:33:05.790 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21637/300s
[INFO ] 2026-06-02 19:33:07.592 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21637/300s
[WARN ] 2026-06-02 19:33:07.989 [27743] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:33:10.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:33:14.398 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21637/300s
[INFO ] 2026-06-02 19:33:17.245 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 19:33:17.245 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432513,ok=432513,error=0, records=41
[WARN ] 2026-06-02 19:33:22.993 [27743] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:33:25.937 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:33:32.251 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 19:33:32.251 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432514,ok=432514,error=0, records=41
[WARN ] 2026-06-02 19:33:37.999 [27840] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:33:40.938 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 19:33:40.938 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 19:33:47.256 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10209, records=41
[INFO ] 2026-06-02 19:33:47.256 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432515,ok=432515,error=0, records=41
[WARN ] 2026-06-02 19:33:53.004 [27854] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:33:55.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:34:02.321 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 19:34:02.321 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432516,ok=432516,error=0, records=41
[WARN ] 2026-06-02 19:34:08.010 [27743] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:34:10.939 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:34:17.327 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 19:34:17.327 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432517,ok=432517,error=0, records=41
[WARN ] 2026-06-02 19:34:23.015 [27854] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:34:25.940 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:34:32.333 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 19:34:32.333 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432518,ok=432518,error=0, records=41
[WARN ] 2026-06-02 19:34:38.019 [27797] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:34:40.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:34:47.338 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10207, records=41
[INFO ] 2026-06-02 19:34:47.339 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432519,ok=432519,error=0, records=41
[WARN ] 2026-06-02 19:34:53.024 [27923] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:34:55.941 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:35:02.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 19:35:02.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432520,ok=432520,error=0, records=41
[INFO ] 2026-06-02 19:35:02.358 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21641/300s
[WARN ] 2026-06-02 19:35:08.029 [27812] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:35:10.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:35:17.348 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 19:35:17.348 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432521,ok=432521,error=0, records=41
[WARN ] 2026-06-02 19:35:23.035 [27950] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:35:25.942 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:35:32.353 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 19:35:32.353 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432522,ok=432522,error=0, records=41
[WARN ] 2026-06-02 19:35:38.040 [27955] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:35:40.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:35:43.041 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21632/300s
[INFO ] 2026-06-02 19:35:43.898 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21641/300s
[INFO ] 2026-06-02 19:35:47.364 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 19:35:47.364 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432523,ok=432523,error=0, records=41
[INFO ] 2026-06-02 19:35:47.408 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18017/300s
[INFO ] 2026-06-02 19:35:47.410 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833076},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:35:47.580 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:35:47.580 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 19:35:47.581 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:35:47.581 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:35:47.581 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:35:47.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:35:47.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:35:47.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:35:47.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:35:47.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:35:47.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:35:53.044 [27983] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:35:55.943 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:36:02.373 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 19:36:02.373 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432524,ok=432524,error=0, records=41
[WARN ] 2026-06-02 19:36:08.051 [27998] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:36:10.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:36:17.378 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10243, records=41
[INFO ] 2026-06-02 19:36:17.378 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432525,ok=432525,error=0, records=41
[WARN ] 2026-06-02 19:36:22.555 [28008] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:36:25.944 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:36:32.385 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 19:36:32.385 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432526,ok=432526,error=0, records=41
[INFO ] 2026-06-02 19:36:32.385 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21628/300s
[WARN ] 2026-06-02 19:36:37.561 [28026] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:36:40.945 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:36:47.390 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 19:36:47.390 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432527,ok=432527,error=0, records=41
[WARN ] 2026-06-02 19:36:52.565 [28033] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:36:55.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:36:58.074 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21637/300s
[INFO ] 2026-06-02 19:37:02.395 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 19:37:02.395 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432528,ok=432528,error=0, records=41
[WARN ] 2026-06-02 19:37:07.570 [28067] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:37:10.946 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:37:10.946 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21640/300s
[INFO ] 2026-06-02 19:37:17.400 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 19:37:17.400 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432529,ok=432529,error=0, records=41
[WARN ] 2026-06-02 19:37:22.576 [28079] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:37:23.509 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21628/300s
[INFO ] 2026-06-02 19:37:25.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:37:32.406 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 19:37:32.406 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432530,ok=432530,error=0, records=41
[WARN ] 2026-06-02 19:37:37.580 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:37:40.947 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:37:47.411 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 19:37:47.411 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432531,ok=432531,error=0, records=41
[WARN ] 2026-06-02 19:37:52.585 [28126] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:37:55.948 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:38:02.416 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10208, records=41
[INFO ] 2026-06-02 19:38:02.416 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432532,ok=432532,error=0, records=41
[INFO ] 2026-06-02 19:38:05.840 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21638/300s
[WARN ] 2026-06-02 19:38:07.589 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:38:07.641 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21638/300s
[INFO ] 2026-06-02 19:38:10.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:38:14.445 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21638/300s
[INFO ] 2026-06-02 19:38:17.423 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:38:17.423 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432533,ok=432533,error=0, records=41
[WARN ] 2026-06-02 19:38:22.593 [28102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:38:25.949 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:38:32.429 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 19:38:32.429 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432534,ok=432534,error=0, records=41
[WARN ] 2026-06-02 19:38:37.598 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:38:40.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:38:47.434 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 19:38:47.434 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432535,ok=432535,error=0, records=41
[INFO ] 2026-06-02 19:38:47.582 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20833000},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:38:47.751 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:38:47.751 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 19:38:47.751 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:38:47.751 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:38:47.751 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:38:47.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:38:47.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:38:47.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:38:47.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:38:47.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:38:47.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:38:52.602 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:38:55.950 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.26MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:38:55.950 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 19:39:02.439 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10189, records=41
[INFO ] 2026-06-02 19:39:02.439 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432536,ok=432536,error=0, records=41
[WARN ] 2026-06-02 19:39:07.608 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:39:10.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.25MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:39:17.444 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10247, records=41
[INFO ] 2026-06-02 19:39:17.444 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432537,ok=432537,error=0, records=41
[WARN ] 2026-06-02 19:39:22.612 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:39:25.952 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.50MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:39:32.450 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10245, records=41
[INFO ] 2026-06-02 19:39:32.450 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432538,ok=432538,error=0, records=41
[WARN ] 2026-06-02 19:39:37.616 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:39:40.953 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.75MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:39:47.456 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 19:39:47.456 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432539,ok=432539,error=0, records=41
[WARN ] 2026-06-02 19:39:52.620 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:39:55.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:40:02.361 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21642/300s
[INFO ] 2026-06-02 19:40:02.460 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 19:40:02.460 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432540,ok=432540,error=0, records=41
[WARN ] 2026-06-02 19:40:07.625 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:40:10.954 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=24.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:40:17.469 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10287, records=41
[INFO ] 2026-06-02 19:40:17.469 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432541,ok=432541,error=0, records=41
[WARN ] 2026-06-02 19:40:22.630 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:40:25.955 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=24.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:40:32.473 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10307, records=41
[INFO ] 2026-06-02 19:40:32.473 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432542,ok=432542,error=0, records=41
[WARN ] 2026-06-02 19:40:37.635 [28167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:40:40.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:40:43.137 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21633/300s
[INFO ] 2026-06-02 19:40:43.905 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21642/300s
[INFO ] 2026-06-02 19:40:47.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10293, records=41
[INFO ] 2026-06-02 19:40:47.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432543,ok=432543,error=0, records=41
[WARN ] 2026-06-02 19:40:52.640 [28167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:40:55.956 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:41:02.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10304, records=41
[INFO ] 2026-06-02 19:41:02.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432544,ok=432544,error=0, records=41
[WARN ] 2026-06-02 19:41:07.645 [28167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:41:10.957 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=25.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:41:17.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 19:41:17.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432545,ok=432545,error=0, records=41
[WARN ] 2026-06-02 19:41:22.650 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:41:25.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:41:32.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10230, records=41
[INFO ] 2026-06-02 19:41:32.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432546,ok=432546,error=0, records=41
[INFO ] 2026-06-02 19:41:32.500 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21629/300s
[WARN ] 2026-06-02 19:41:37.655 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:41:40.958 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:41:47.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 19:41:47.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432547,ok=432547,error=0, records=41
[INFO ] 2026-06-02 19:41:47.751 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18018/300s
[INFO ] 2026-06-02 19:41:47.752 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832920},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:41:47.913 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:41:47.913 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"TELNET":[],"PING":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 19:41:47.913 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:41:47.914 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:41:47.914 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:41:47.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:41:47.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:41:47.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:41:47.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:41:47.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:41:47.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:41:52.661 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:41:55.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:41:58.127 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21638/300s
[INFO ] 2026-06-02 19:42:02.511 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10260, records=41
[INFO ] 2026-06-02 19:42:02.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432548,ok=432548,error=0, records=41
[WARN ] 2026-06-02 19:42:07.666 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:42:10.959 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:42:10.959 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21641/300s
[INFO ] 2026-06-02 19:42:17.516 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10193, records=41
[INFO ] 2026-06-02 19:42:17.516 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432549,ok=432549,error=0, records=41
[WARN ] 2026-06-02 19:42:22.671 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:42:23.687 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21629/300s
[INFO ] 2026-06-02 19:42:25.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:42:32.521 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10185, records=41
[INFO ] 2026-06-02 19:42:32.521 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432550,ok=432550,error=0, records=41
[WARN ] 2026-06-02 19:42:37.677 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:42:40.960 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:42:47.526 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 19:42:47.526 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432551,ok=432551,error=0, records=41
[WARN ] 2026-06-02 19:42:52.682 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:42:55.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.27%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:43:02.536 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 19:43:02.536 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432552,ok=432552,error=0, records=41
[INFO ] 2026-06-02 19:43:05.875 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21639/300s
[INFO ] 2026-06-02 19:43:07.677 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21639/300s
[WARN ] 2026-06-02 19:43:07.687 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:43:10.961 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:43:14.480 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21639/300s
[INFO ] 2026-06-02 19:43:17.588 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10364, records=41
[INFO ] 2026-06-02 19:43:17.588 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432553,ok=432553,error=0, records=41
[WARN ] 2026-06-02 19:43:22.693 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:43:25.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:43:32.593 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10370, records=41
[INFO ] 2026-06-02 19:43:32.593 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432554,ok=432554,error=0, records=41
[WARN ] 2026-06-02 19:43:37.698 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:43:40.962 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 19:43:40.962 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 19:43:47.598 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10371, records=41
[INFO ] 2026-06-02 19:43:47.598 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432555,ok=432555,error=0, records=41
[WARN ] 2026-06-02 19:43:52.703 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:43:55.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:44:02.603 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10342, records=41
[INFO ] 2026-06-02 19:44:02.603 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432556,ok=432556,error=0, records=41
[WARN ] 2026-06-02 19:44:07.708 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:44:10.963 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:44:17.608 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10368, records=41
[INFO ] 2026-06-02 19:44:17.608 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432557,ok=432557,error=0, records=41
[WARN ] 2026-06-02 19:44:22.714 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:44:25.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:44:32.615 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10325, records=41
[INFO ] 2026-06-02 19:44:32.615 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432558,ok=432558,error=0, records=41
[WARN ] 2026-06-02 19:44:37.719 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:44:40.964 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.33%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:44:47.620 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 19:44:47.620 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432559,ok=432559,error=0, records=41
[INFO ] 2026-06-02 19:44:47.915 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832844},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:44:48.054 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:44:48.054 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 19:44:48.054 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:44:48.054 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:44:48.054 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:44:48.061 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:44:48.061 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:44:48.061 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:44:48.061 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:44:48.061 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:44:48.061 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:44:52.724 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:44:55.965 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:45:02.364 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21643/300s
[INFO ] 2026-06-02 19:45:02.625 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10359, records=41
[INFO ] 2026-06-02 19:45:02.625 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432560,ok=432560,error=0, records=41
[WARN ] 2026-06-02 19:45:07.730 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:45:10.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:45:17.638 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10323, records=41
[INFO ] 2026-06-02 19:45:17.638 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432561,ok=432561,error=0, records=41
[WARN ] 2026-06-02 19:45:22.735 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:45:25.966 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:45:32.648 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10326, records=41
[INFO ] 2026-06-02 19:45:32.648 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432562,ok=432562,error=0, records=41
[WARN ] 2026-06-02 19:45:37.741 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:45:40.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:45:43.242 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21634/300s
[INFO ] 2026-06-02 19:45:43.911 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21643/300s
[INFO ] 2026-06-02 19:45:47.653 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10314, records=41
[INFO ] 2026-06-02 19:45:47.653 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432563,ok=432563,error=0, records=41
[WARN ] 2026-06-02 19:45:52.746 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:45:55.967 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:46:02.660 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10300, records=41
[INFO ] 2026-06-02 19:46:02.660 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432564,ok=432564,error=0, records=41
[WARN ] 2026-06-02 19:46:07.751 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:46:10.968 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:46:17.665 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 19:46:17.665 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432565,ok=432565,error=0, records=41
[WARN ] 2026-06-02 19:46:22.756 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:46:25.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:46:32.670 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:46:32.670 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432566,ok=432566,error=0, records=41
[INFO ] 2026-06-02 19:46:32.670 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21630/300s
[WARN ] 2026-06-02 19:46:37.760 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:46:40.969 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:46:47.679 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10187, records=41
[INFO ] 2026-06-02 19:46:47.679 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432567,ok=432567,error=0, records=41
[WARN ] 2026-06-02 19:46:52.765 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:46:55.970 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:46:58.173 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21639/300s
[INFO ] 2026-06-02 19:47:02.685 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10198, records=41
[INFO ] 2026-06-02 19:47:02.685 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432568,ok=432568,error=0, records=41
[WARN ] 2026-06-02 19:47:07.769 [28167] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:47:10.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:47:10.971 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21642/300s
[INFO ] 2026-06-02 19:47:17.690 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 19:47:17.690 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432569,ok=432569,error=0, records=41
[WARN ] 2026-06-02 19:47:22.773 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:47:23.865 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21630/300s
[INFO ] 2026-06-02 19:47:25.971 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:47:32.695 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10225, records=41
[INFO ] 2026-06-02 19:47:32.696 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432570,ok=432570,error=0, records=41
[WARN ] 2026-06-02 19:47:37.777 [28108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:47:40.972 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:47:47.786 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 19:47:47.786 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432571,ok=432571,error=0, records=41
[INFO ] 2026-06-02 19:47:48.054 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18019/300s
[INFO ] 2026-06-02 19:47:48.056 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832768},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:47:48.237 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:47:48.237 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 19:47:48.237 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:47:48.237 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:47:48.237 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:47:48.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:47:48.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:47:48.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:47:48.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:47:48.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:47:48.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:47:52.782 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:47:55.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:48:02.792 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10223, records=41
[INFO ] 2026-06-02 19:48:02.792 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432572,ok=432572,error=0, records=41
[INFO ] 2026-06-02 19:48:05.906 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21640/300s
[INFO ] 2026-06-02 19:48:07.708 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21640/300s
[WARN ] 2026-06-02 19:48:07.788 [28143] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:48:10.973 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:48:14.514 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21640/300s
[INFO ] 2026-06-02 19:48:17.799 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=11255, records=44
[INFO ] 2026-06-02 19:48:17.799 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432573,ok=432573,error=0, records=44
[WARN ] 2026-06-02 19:48:22.793 [28144] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:48:25.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:48:32.810 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 19:48:32.810 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432574,ok=432574,error=0, records=41
[WARN ] 2026-06-02 19:48:37.799 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:48:40.974 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:48:47.815 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13279, records=49
[INFO ] 2026-06-02 19:48:47.815 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432575,ok=432575,error=0, records=49
[WARN ] 2026-06-02 19:48:52.804 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:48:55.975 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:49:02.820 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=7174, records=33
[INFO ] 2026-06-02 19:49:02.820 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432576,ok=432576,error=0, records=33
[WARN ] 2026-06-02 19:49:07.809 [28724] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:49:10.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:49:17.828 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=13278, records=49
[INFO ] 2026-06-02 19:49:17.828 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432577,ok=432577,error=0, records=49
[WARN ] 2026-06-02 19:49:22.815 [28730] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:49:25.976 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:49:32.834 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:49:32.834 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432578,ok=432578,error=0, records=41
[WARN ] 2026-06-02 19:49:37.820 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:49:40.977 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=27.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:49:47.840 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 19:49:47.840 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432579,ok=432579,error=0, records=41
[WARN ] 2026-06-02 19:49:52.825 [28759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:49:55.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:50:02.367 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21644/300s
[INFO ] 2026-06-02 19:50:02.845 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 19:50:02.845 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432580,ok=432580,error=0, records=41
[WARN ] 2026-06-02 19:50:07.831 [28709] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:50:10.978 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=27.41MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:50:17.850 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:50:17.850 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432581,ok=432581,error=0, records=41
[WARN ] 2026-06-02 19:50:22.836 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:50:25.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.93MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:50:32.855 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 19:50:32.855 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432582,ok=432582,error=0, records=41
[WARN ] 2026-06-02 19:50:37.841 [28759] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:50:40.979 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:50:43.342 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21635/300s
[INFO ] 2026-06-02 19:50:43.918 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21644/300s
[INFO ] 2026-06-02 19:50:47.861 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 19:50:47.861 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432583,ok=432583,error=0, records=41
[INFO ] 2026-06-02 19:50:48.239 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832680},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:50:48.400 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:50:48.400 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"TELNET":[],"HTTP":[],"PING":[]}
[INFO ] 2026-06-02 19:50:48.400 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:50:48.400 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:50:48.400 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:50:48.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:50:48.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:50:48.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:50:48.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:50:48.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:50:48.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:50:52.846 [28127] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:50:55.980 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:51:02.867 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:51:02.867 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432584,ok=432584,error=0, records=41
[WARN ] 2026-06-02 19:51:07.851 [28792] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:51:10.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:51:17.874 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10210, records=41
[INFO ] 2026-06-02 19:51:17.874 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432585,ok=432585,error=0, records=41
[WARN ] 2026-06-02 19:51:22.857 [28806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:51:25.981 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.17MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:51:32.880 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:51:32.880 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432586,ok=432586,error=0, records=41
[INFO ] 2026-06-02 19:51:32.880 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21631/300s
[WARN ] 2026-06-02 19:51:37.862 [28806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:51:40.982 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.45MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:51:47.887 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 19:51:47.887 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432587,ok=432587,error=0, records=41
[WARN ] 2026-06-02 19:51:52.867 [28806] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:51:55.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=28.71MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:51:58.232 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21640/300s
[INFO ] 2026-06-02 19:52:02.893 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 19:52:02.893 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432588,ok=432588,error=0, records=41
[WARN ] 2026-06-02 19:52:07.872 [28883] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:52:10.983 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.96MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:52:10.983 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21643/300s
[INFO ] 2026-06-02 19:52:17.898 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10197, records=41
[INFO ] 2026-06-02 19:52:17.898 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432589,ok=432589,error=0, records=41
[WARN ] 2026-06-02 19:52:22.877 [28907] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:52:24.050 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21631/300s
[INFO ] 2026-06-02 19:52:25.984 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=29.22MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:52:32.903 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:52:32.903 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432590,ok=432590,error=0, records=41
[WARN ] 2026-06-02 19:52:37.882 [28842] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:52:40.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.99MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:52:47.909 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 19:52:47.909 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432591,ok=432591,error=0, records=41
[WARN ] 2026-06-02 19:52:52.887 [28922] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:52:55.985 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:53:02.914 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 19:53:02.914 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432592,ok=432592,error=0, records=41
[INFO ] 2026-06-02 19:53:05.971 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21641/300s
[INFO ] 2026-06-02 19:53:07.773 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21641/300s
[WARN ] 2026-06-02 19:53:07.893 [28965] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:53:10.986 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:53:14.578 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21641/300s
[INFO ] 2026-06-02 19:53:17.919 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10202, records=41
[INFO ] 2026-06-02 19:53:17.919 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432593,ok=432593,error=0, records=41
[WARN ] 2026-06-02 19:53:22.898 [28981] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:53:25.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.43MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:53:32.924 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 19:53:32.924 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432594,ok=432594,error=0, records=41
[WARN ] 2026-06-02 19:53:37.903 [28996] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:53:40.987 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 19:53:40.987 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 19:53:47.989 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10212, records=41
[INFO ] 2026-06-02 19:53:47.989 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432595,ok=432595,error=0, records=41
[INFO ] 2026-06-02 19:53:48.400 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18020/300s
[INFO ] 2026-06-02 19:53:48.402 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832604},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:53:48.580 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:53:48.580 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"HTTP":[],"TELNET":[],"PING":[],"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true}}
[INFO ] 2026-06-02 19:53:48.581 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:53:48.581 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:53:48.581 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:53:48.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:53:48.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:53:48.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:53:48.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:53:48.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:53:48.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:53:52.909 [29007] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:53:55.988 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:53:55.988 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 19:54:02.999 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10234, records=41
[INFO ] 2026-06-02 19:54:02.999 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432596,ok=432596,error=0, records=41
[WARN ] 2026-06-02 19:54:07.915 [29030] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:54:10.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.67%[>=50.00% 0/4], memory=26.13MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:54:18.005 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 19:54:18.005 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432597,ok=432597,error=0, records=41
[WARN ] 2026-06-02 19:54:22.922 [29047] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:54:25.990 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=27.63MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:54:33.009 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 19:54:33.009 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432598,ok=432598,error=0, records=41
[WARN ] 2026-06-02 19:54:37.927 [29064] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:54:40.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=28.64MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:54:48.015 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 19:54:48.015 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432599,ok=432599,error=0, records=41
[WARN ] 2026-06-02 19:54:52.932 [29081] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:54:55.991 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=29.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:55:02.370 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21645/300s
[INFO ] 2026-06-02 19:55:03.023 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 19:55:03.024 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432600,ok=432600,error=0, records=41
[WARN ] 2026-06-02 19:55:07.938 [29090] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:55:10.992 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:55:18.029 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10350, records=41
[INFO ] 2026-06-02 19:55:18.029 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432601,ok=432601,error=0, records=41
[WARN ] 2026-06-02 19:55:22.943 [29108] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:55:25.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.19MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:55:33.034 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10324, records=41
[INFO ] 2026-06-02 19:55:33.034 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432602,ok=432602,error=0, records=41
[WARN ] 2026-06-02 19:55:37.948 [29102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:55:40.993 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:55:43.449 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21636/300s
[INFO ] 2026-06-02 19:55:43.924 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21645/300s
[INFO ] 2026-06-02 19:55:48.039 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 19:55:48.039 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432603,ok=432603,error=0, records=41
[WARN ] 2026-06-02 19:55:52.953 [29139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:55:55.994 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:56:03.045 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10227, records=41
[INFO ] 2026-06-02 19:56:03.045 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432604,ok=432604,error=0, records=41
[WARN ] 2026-06-02 19:56:07.959 [29139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:56:10.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:56:18.055 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10211, records=41
[INFO ] 2026-06-02 19:56:18.055 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432605,ok=432605,error=0, records=41
[WARN ] 2026-06-02 19:56:22.964 [29168] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:56:25.995 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:56:33.063 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 19:56:33.063 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432606,ok=432606,error=0, records=41
[INFO ] 2026-06-02 19:56:33.063 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21632/300s
[WARN ] 2026-06-02 19:56:37.970 [29139] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:56:40.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.23MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:56:48.069 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10206, records=41
[INFO ] 2026-06-02 19:56:48.069 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432607,ok=432607,error=0, records=41
[INFO ] 2026-06-02 19:56:48.582 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832532},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:56:48.747 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:56:48.747 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"HTTP":[],"TELNET":[],"PING":[]}
[INFO ] 2026-06-02 19:56:48.747 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:56:48.747 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:56:48.747 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:56:48.761 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:56:48.761 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:56:48.761 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:56:48.761 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:56:48.761 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:56:48.761 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:56:52.973 [29102] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:56:55.996 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=30.87MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:56:58.289 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21641/300s
[INFO ] 2026-06-02 19:57:03.078 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 19:57:03.078 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432608,ok=432608,error=0, records=41
[WARN ] 2026-06-02 19:57:07.978 [29168] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:57:10.997 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.88MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:57:10.997 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21644/300s
[INFO ] 2026-06-02 19:57:18.083 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 19:57:18.083 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432609,ok=432609,error=0, records=41
[WARN ] 2026-06-02 19:57:22.982 [29124] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:57:24.235 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21632/300s
[INFO ] 2026-06-02 19:57:25.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:57:33.088 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 19:57:33.088 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432610,ok=432610,error=0, records=41
[WARN ] 2026-06-02 19:57:37.987 [29210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:57:40.998 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:57:48.148 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 19:57:48.148 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432611,ok=432611,error=0, records=41
[WARN ] 2026-06-02 19:57:52.992 [29195] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:57:55.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:58:03.158 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 19:58:03.158 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432612,ok=432612,error=0, records=41
[INFO ] 2026-06-02 19:58:06.036 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21642/300s
[INFO ] 2026-06-02 19:58:07.838 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21642/300s
[WARN ] 2026-06-02 19:58:07.998 [29252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:58:10.999 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:58:14.644 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21642/300s
[INFO ] 2026-06-02 19:58:18.210 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10237, records=41
[INFO ] 2026-06-02 19:58:18.210 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432613,ok=432613,error=0, records=41
[WARN ] 2026-06-02 19:58:23.003 [29252] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:58:26.000 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:58:33.219 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 19:58:33.219 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432614,ok=432614,error=0, records=41
[WARN ] 2026-06-02 19:58:38.008 [29195] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:58:41.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:58:48.223 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10203, records=41
[INFO ] 2026-06-02 19:58:48.223 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432615,ok=432615,error=0, records=41
[WARN ] 2026-06-02 19:58:53.013 [29210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:58:56.001 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.07MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:59:03.229 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10347, records=41
[INFO ] 2026-06-02 19:59:03.229 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432616,ok=432616,error=0, records=41
[WARN ] 2026-06-02 19:59:08.019 [29293] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:59:11.002 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.08MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:59:18.235 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10339, records=41
[INFO ] 2026-06-02 19:59:18.235 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432617,ok=432617,error=0, records=41
[WARN ] 2026-06-02 19:59:23.025 [29210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:59:26.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:59:33.246 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10321, records=41
[INFO ] 2026-06-02 19:59:33.246 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432618,ok=432618,error=0, records=41
[WARN ] 2026-06-02 19:59:38.029 [29210] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:59:41.003 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.09MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 19:59:48.252 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10340, records=41
[INFO ] 2026-06-02 19:59:48.252 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432619,ok=432619,error=0, records=41
[INFO ] 2026-06-02 19:59:48.747 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18021/300s
[INFO ] 2026-06-02 19:59:48.749 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832448},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 19:59:48.917 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 19:59:48.917 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"HTTP":[],"TELNET":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]}}
[INFO ] 2026-06-02 19:59:48.917 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 19:59:48.917 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 19:59:48.917 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 19:59:48.961 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:59:48.961 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 19:59:48.961 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:59:48.961 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 19:59:48.961 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 19:59:48.961 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 19:59:53.035 [29372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 19:59:56.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:00:02.373 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21646/300s
[INFO ] 2026-06-02 20:00:03.260 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10253, records=41
[INFO ] 2026-06-02 20:00:03.260 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432620,ok=432620,error=0, records=41
[WARN ] 2026-06-02 20:00:08.040 [29372] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:00:11.004 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 20:00:17.544 [29401] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/24748/stat), No such file or directory
[WARN ] 2026-06-02 20:00:17.544 [29401] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27643/stat), No such file or directory
[INFO ] 2026-06-02 20:00:18.268 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10257, records=41
[INFO ] 2026-06-02 20:00:18.268 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432621,ok=432621,error=0, records=41
[WARN ] 2026-06-02 20:00:23.046 [29401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:00:26.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.40%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[WARN ] 2026-06-02 20:00:32.550 [29426] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/24748/stat), No such file or directory
[WARN ] 2026-06-02 20:00:32.550 [29426] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27643/stat), No such file or directory
[INFO ] 2026-06-02 20:00:33.273 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10194, records=41
[INFO ] 2026-06-02 20:00:33.273 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432622,ok=432622,error=0, records=41
[WARN ] 2026-06-02 20:00:38.050 [29401] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:00:41.005 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:00:43.552 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21637/300s
[INFO ] 2026-06-02 20:00:43.931 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21646/300s
[WARN ] 2026-06-02 20:00:47.555 [29438] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/24748/stat), No such file or directory
[WARN ] 2026-06-02 20:00:47.555 [29438] cloudMonitor/base_collect.cpp:253: SicGetProcessCpuInformation failed, err: FeadFileContent(/proc/27643/stat), No such file or directory
[INFO ] 2026-06-02 20:00:48.279 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10217, records=41
[INFO ] 2026-06-02 20:00:48.279 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432623,ok=432623,error=0, records=41
[WARN ] 2026-06-02 20:00:52.557 [29448] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:00:56.006 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:01:03.286 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10192, records=41
[INFO ] 2026-06-02 20:01:03.286 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432624,ok=432624,error=0, records=41
[WARN ] 2026-06-02 20:01:07.563 [29472] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:01:11.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:01:18.292 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 20:01:18.292 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432625,ok=432625,error=0, records=41
[WARN ] 2026-06-02 20:01:22.568 [29490] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:01:26.007 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:01:33.298 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10199, records=41
[INFO ] 2026-06-02 20:01:33.298 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432626,ok=432626,error=0, records=41
[INFO ] 2026-06-02 20:01:33.298 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21633/300s
[WARN ] 2026-06-02 20:01:37.575 [29514] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:01:41.008 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:01:48.303 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10160, records=41
[INFO ] 2026-06-02 20:01:48.303 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432627,ok=432627,error=0, records=41
[WARN ] 2026-06-02 20:01:52.581 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:01:56.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.10MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:01:58.347 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21642/300s
[INFO ] 2026-06-02 20:02:03.312 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10235, records=41
[INFO ] 2026-06-02 20:02:03.312 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432628,ok=432628,error=0, records=41
[WARN ] 2026-06-02 20:02:07.587 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:02:11.009 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:02:11.009 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21645/300s
[INFO ] 2026-06-02 20:02:18.318 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10220, records=41
[INFO ] 2026-06-02 20:02:18.318 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432629,ok=432629,error=0, records=41
[WARN ] 2026-06-02 20:02:22.592 [29550] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:02:24.413 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21633/300s
[INFO ] 2026-06-02 20:02:26.010 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.36MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:02:33.324 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10252, records=41
[INFO ] 2026-06-02 20:02:33.324 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432630,ok=432630,error=0, records=41
[WARN ] 2026-06-02 20:02:37.597 [29493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:02:41.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:02:48.329 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 20:02:48.329 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432631,ok=432631,error=0, records=41
[INFO ] 2026-06-02 20:02:48.919 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832368},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 20:02:49.062 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 20:02:49.062 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 20:02:49.062 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 20:02:49.062 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 20:02:49.062 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 20:02:49.161 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:02:49.161 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:02:49.161 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:02:49.161 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:02:49.161 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 20:02:49.161 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 20:02:52.602 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:02:56.011 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:03:03.335 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10172, records=41
[INFO ] 2026-06-02 20:03:03.335 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432632,ok=432632,error=0, records=41
[INFO ] 2026-06-02 20:03:06.089 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21643/300s
[WARN ] 2026-06-02 20:03:07.607 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:03:07.891 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21643/300s
[INFO ] 2026-06-02 20:03:11.012 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:03:14.697 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21643/300s
[INFO ] 2026-06-02 20:03:18.343 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10156, records=41
[INFO ] 2026-06-02 20:03:18.343 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432633,ok=432633,error=0, records=41
[WARN ] 2026-06-02 20:03:22.611 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:03:26.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:03:33.349 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10137, records=41
[INFO ] 2026-06-02 20:03:33.349 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432634,ok=432634,error=0, records=41
[WARN ] 2026-06-02 20:03:37.615 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:03:41.013 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[ERROR] 2026-06-02 20:03:41.013 [908  ] core/ChannelManager.cpp:107: unkonw channel(alimonitor)
[INFO ] 2026-06-02 20:03:48.354 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10148, records=41
[INFO ] 2026-06-02 20:03:48.354 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432635,ok=432635,error=0, records=41
[WARN ] 2026-06-02 20:03:52.620 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:03:56.014 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.55MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:04:03.360 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10201, records=41
[INFO ] 2026-06-02 20:04:03.360 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432636,ok=432636,error=0, records=41
[WARN ] 2026-06-02 20:04:07.625 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:04:11.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:04:18.365 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 20:04:18.365 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432637,ok=432637,error=0, records=41
[WARN ] 2026-06-02 20:04:22.630 [29493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:04:26.015 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:04:33.370 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 20:04:33.370 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432638,ok=432638,error=0, records=41
[WARN ] 2026-06-02 20:04:37.634 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:04:41.016 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:04:48.376 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 20:04:48.376 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432639,ok=432639,error=0, records=41
[WARN ] 2026-06-02 20:04:52.639 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:04:56.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:05:02.377 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21647/300s
[INFO ] 2026-06-02 20:05:03.382 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10362, records=41
[INFO ] 2026-06-02 20:05:03.382 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432640,ok=432640,error=0, records=41
[WARN ] 2026-06-02 20:05:07.644 [29493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:05:11.017 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:05:18.388 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-02 20:05:18.388 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432641,ok=432641,error=0, records=41
[WARN ] 2026-06-02 20:05:22.649 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:05:26.018 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:05:33.393 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10346, records=41
[INFO ] 2026-06-02 20:05:33.393 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432642,ok=432642,error=0, records=41
[WARN ] 2026-06-02 20:05:37.655 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:05:41.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:05:43.657 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21638/300s
[INFO ] 2026-06-02 20:05:43.938 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21647/300s
[INFO ] 2026-06-02 20:05:48.398 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10352, records=41
[INFO ] 2026-06-02 20:05:48.398 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432643,ok=432643,error=0, records=41
[INFO ] 2026-06-02 20:05:49.062 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18022/300s
[INFO ] 2026-06-02 20:05:49.064 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832284},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 20:05:49.231 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 20:05:49.231 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"HTTP":[],"PING":[],"TELNET":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 20:05:49.231 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 20:05:49.231 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 20:05:49.231 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 20:05:49.261 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:05:49.261 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:05:49.261 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:05:49.261 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:05:49.261 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 20:05:49.261 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 20:05:52.660 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:05:56.019 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:06:03.403 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10238, records=41
[INFO ] 2026-06-02 20:06:03.403 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432644,ok=432644,error=0, records=41
[WARN ] 2026-06-02 20:06:07.665 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:06:11.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:06:18.412 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10215, records=41
[INFO ] 2026-06-02 20:06:18.412 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432645,ok=432645,error=0, records=41
[WARN ] 2026-06-02 20:06:22.669 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:06:26.020 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:06:33.417 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10264, records=41
[INFO ] 2026-06-02 20:06:33.417 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432646,ok=432646,error=0, records=41
[INFO ] 2026-06-02 20:06:33.417 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21634/300s
[WARN ] 2026-06-02 20:06:37.674 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:06:41.021 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=31.56MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:06:48.422 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10248, records=41
[INFO ] 2026-06-02 20:06:48.422 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432647,ok=432647,error=0, records=41
[WARN ] 2026-06-02 20:06:52.678 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:06:56.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:06:58.401 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21643/300s
[INFO ] 2026-06-02 20:07:03.428 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 20:07:03.428 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432648,ok=432648,error=0, records=41
[WARN ] 2026-06-02 20:07:07.684 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:07:11.022 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:07:11.022 [908  ] common/ThreadWorker.cpp:160: worker <SelfMonitor> keep alive: 21646/300s
[INFO ] 2026-06-02 20:07:18.433 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10213, records=41
[INFO ] 2026-06-02 20:07:18.433 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432649,ok=432649,error=0, records=41
[WARN ] 2026-06-02 20:07:22.689 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:07:24.598 [942  ] common/ThreadWorker.cpp:160: worker <common::Poll> keep alive: 21634/300s
[INFO ] 2026-06-02 20:07:26.023 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:07:33.438 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10204, records=41
[INFO ] 2026-06-02 20:07:33.438 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432650,ok=432650,error=0, records=41
[WARN ] 2026-06-02 20:07:37.693 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:07:41.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:07:48.446 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10219, records=41
[INFO ] 2026-06-02 20:07:48.446 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432651,ok=432651,error=0, records=41
[WARN ] 2026-06-02 20:07:52.698 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:07:56.024 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:08:03.452 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10222, records=41
[INFO ] 2026-06-02 20:08:03.452 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432652,ok=432652,error=0, records=41
[INFO ] 2026-06-02 20:08:06.157 [941  ] common/ThreadWorker.cpp:160: worker <LoggerTaskScheduler> keep alive: 21644/300s
[WARN ] 2026-06-02 20:08:07.703 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:08:07.959 [930  ] common/ThreadWorker.cpp:160: worker <ExporterScheduler> keep alive: 21644/300s
[INFO ] 2026-06-02 20:08:11.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:08:14.764 [1026 ] common/ThreadWorker.cpp:160: worker <DetectSchedule> keep alive: 21644/300s
[INFO ] 2026-06-02 20:08:18.458 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 20:08:18.458 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432653,ok=432653,error=0, records=41
[WARN ] 2026-06-02 20:08:22.708 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:08:26.025 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:08:33.462 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10216, records=41
[INFO ] 2026-06-02 20:08:33.463 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432654,ok=432654,error=0, records=41
[WARN ] 2026-06-02 20:08:37.714 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:08:41.026 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:08:48.468 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10205, records=41
[INFO ] 2026-06-02 20:08:48.468 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432655,ok=432655,error=0, records=41
[INFO ] 2026-06-02 20:08:49.233 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832204},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 20:08:49.396 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 20:08:49.396 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"PING":[],"TELNET":[],"HTTP":[],"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 20:08:49.397 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 20:08:49.397 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 20:08:49.397 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 20:08:49.461 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:08:49.461 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:08:49.461 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:08:49.461 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:08:49.461 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 20:08:49.461 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 20:08:52.719 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:08:56.027 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=30.65MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:08:56.027 [908  ] core/self_monitor.cpp:195: will malloc_trim
[INFO ] 2026-06-02 20:09:03.476 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10226, records=41
[INFO ] 2026-06-02 20:09:03.476 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432656,ok=432656,error=0, records=41
[WARN ] 2026-06-02 20:09:07.724 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:09:11.028 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=25.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:09:18.481 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10186, records=41
[INFO ] 2026-06-02 20:09:18.481 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432657,ok=432657,error=0, records=41
[WARN ] 2026-06-02 20:09:22.729 [29493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:09:26.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:09:33.488 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10228, records=41
[INFO ] 2026-06-02 20:09:33.488 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432658,ok=432658,error=0, records=41
[WARN ] 2026-06-02 20:09:37.734 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:09:41.029 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:09:48.494 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10250, records=41
[INFO ] 2026-06-02 20:09:48.494 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432659,ok=432659,error=0, records=41
[WARN ] 2026-06-02 20:09:52.739 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:09:56.030 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:10:02.380 [940  ] common/ThreadWorker.cpp:160: worker <LoggerTaskMonitor> keep alive: 21648/300s
[INFO ] 2026-06-02 20:10:03.500 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 20:10:03.500 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432660,ok=432660,error=0, records=41
[WARN ] 2026-06-02 20:10:07.745 [29497] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:10:11.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:10:18.506 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10244, records=41
[INFO ] 2026-06-02 20:10:18.506 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432661,ok=432661,error=0, records=41
[WARN ] 2026-06-02 20:10:22.750 [29493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:10:26.031 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:10:33.510 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10239, records=41
[INFO ] 2026-06-02 20:10:33.511 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432662,ok=432662,error=0, records=41
[WARN ] 2026-06-02 20:10:37.757 [29493] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:10:41.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=25.91MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:10:43.759 [932  ] common/ThreadWorker.cpp:160: worker <ModuleSchedulerWorker> keep alive: 21639/300s
[INFO ] 2026-06-02 20:10:43.944 [934  ] common/ThreadWorker.cpp:160: worker <TaskMonitor> keep alive: 21648/300s
[INFO ] 2026-06-02 20:10:48.588 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10236, records=41
[INFO ] 2026-06-02 20:10:48.588 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432663,ok=432663,error=0, records=41
[WARN ] 2026-06-02 20:10:52.762 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:10:56.032 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:11:03.594 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10251, records=41
[INFO ] 2026-06-02 20:11:03.594 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432664,ok=432664,error=0, records=41
[WARN ] 2026-06-02 20:11:07.766 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:11:11.033 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.60%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:11:18.600 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10229, records=41
[INFO ] 2026-06-02 20:11:18.600 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432665,ok=432665,error=0, records=41
[WARN ] 2026-06-02 20:11:22.771 [29580] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:11:26.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:11:33.605 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10224, records=41
[INFO ] 2026-06-02 20:11:33.605 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432666,ok=432666,error=0, records=41
[INFO ] 2026-06-02 20:11:33.605 [931  ] common/ThreadWorker.cpp:160: worker <CloudChannel> keep alive: 21635/300s
[WARN ] 2026-06-02 20:11:37.775 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:11:41.034 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.47%[>=50.00% 0/4], memory=26.16MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:11:48.611 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10218, records=41
[INFO ] 2026-06-02 20:11:48.611 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432667,ok=432667,error=0, records=41
[INFO ] 2026-06-02 20:11:49.397 [928  ] common/ThreadWorker.cpp:160: worker <CloudClient> keep alive: 18023/300s
[INFO ] 2026-06-02 20:11:49.398 [928  ] cloudMonitor/cloud_client.cpp:265: will send heartbeat :{"systemInfo":{"serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","hostname":"iZj6c1151k3ad370bosnmsZ","localIPs":["172.31.172.6"],"name":"Linux (Red Hat)","version":"7.9.2009","arch":"x86_64","freeSpace":20832128},"versionInfo":{"version":"3.5.10"}}
[INFO ] 2026-06-02 20:11:49.566 [928  ] cloudMonitor/cloud_client.cpp:277: send heartbeat to [POST]https://cms-cloudmonitor.aliyun.com/agent/heartbeat success,len=253
[INFO ] 2026-06-02 20:11:49.566 [928  ] cloudMonitor/cloud_client.cpp:323: the heartbeat response is :{"metricHubConfig":{"url":"http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines","gzip":false,"useProxy":false},"metricConfig":{"url":"https://metrichub-cms-cn-shanghai.aliyuncs.com/agent/metrics/putLines","gzip":false,"useProxy":true},"PING":[],"TELNET":[],"HTTP":[],"collectConfig":{"processNames":[],"processConfigs":[],"httpConfigs":[]},"node":{"instanceId":"i-j6c1151k3ad370bosnms","serialNumber":"dc589fe4-745d-4944-a467-1e0f4b1086c9","aliUid":5385154882880207,"hostName":"launch-advisor-20201104","operatingSystem":"Linux","region":"cn-hongkong","ipGroup":"47.242.152.148,172.31.172.6","tianjimonVersion":"3.5.10","aliyunHost":true,"networkType":"vpc","internetTx":204800,"vpcInstanceId":"vpc-j6ci7fo2jp96bcean8z5z","availabilityZone":"cn-hongkong-b","vswitchInstanceId":"vsw-j6cu0lsap2hezl8k9tdl6","instanceTypeFamily":"ecs.g6","aegisStatus":1}}
[INFO ] 2026-06-02 20:11:49.566 [928  ] cloudMonitor/cloud_client.cpp:447: metricConfig is the same,no change!
[INFO ] 2026-06-02 20:11:49.566 [928  ] cloudMonitor/cloud_client.cpp:457: no hpcClusterConfig in the response json{}
[WARN ] 2026-06-02 20:11:49.566 [928  ] cloudMonitor/cloud_client.cpp:481: no fileStore in the response json
[INFO ] 2026-06-02 20:11:49.661 [1027 ] detect/detect_schedule.cpp:141: TelnetItems Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:11:49.661 [1027 ] detect/detect_schedule.cpp:142: TelnetItems ~Changed! Current TelnetItems num is 0
[INFO ] 2026-06-02 20:11:49.661 [1027 ] detect/detect_schedule.cpp:141: HttpItems Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:11:49.661 [1027 ] detect/detect_schedule.cpp:142: HttpItems ~Changed! Current HttpItems num is 0
[INFO ] 2026-06-02 20:11:49.661 [1027 ] detect/detect_schedule.cpp:141: PingItems Changed! Current PingItems num is 0
[INFO ] 2026-06-02 20:11:49.661 [1027 ] detect/detect_schedule.cpp:142: PingItems ~Changed! Current PingItems num is 0
[WARN ] 2026-06-02 20:11:52.780 [29549] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:11:56.035 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]
[INFO ] 2026-06-02 20:11:58.458 [933  ] common/ThreadWorker.cpp:160: worker <ScriptScheduler> keep alive: 21644/300s
[INFO ] 2026-06-02 20:12:03.617 [931  ] cloudMonitor/cloud_channel.cpp:412: send metric to http://metrichub-cn-hongkong.aliyun.com/agent/metrics/putLines success,len=10242, records=41
[INFO ] 2026-06-02 20:12:03.617 [931  ] cloudMonitor/cloud_channel.cpp:438: send metric summary,total=432668,ok=432668,error=0, records=41
[WARN ] 2026-06-02 20:12:07.785 [29560] core/ModuleScheduler.cpp:301: module(gpu) collect error,ret=-1
[INFO ] 2026-06-02 20:12:11.036 [908  ] core/self_monitor.cpp:174: pid: 872, cpuUsage=0.53%[>=50.00% 0/4], memory=25.67MB[>=200.00MB 0/4], openFiles=10[>=300 0/4]